-
nvidia/Llama-3_3-Nemotron-Super-49B-v1
Text Generation • 50B • Updated • 11.7k • 320 -
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation • 8B • Updated • 9.03k • • 215 -
google/gemma-3-1b-it
Text Generation • 1.0B • Updated • 2.02M • 772 -
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition • 6B • Updated • 288k • 1.55k
Gain.Energy
company
Verified
AI & ML interests
At Gain Energy, we are committed to harnessing the power of Artificial Intelligence (AI) and Machine Learning (ML) to revolutionize the oil and gas industry. Our focus spans a wide range of AI and ML applications aimed at enhancing efficiency, safety, and sustainability.
Sparse Mixture of Experts datasets for mathematical reasoning and complex calculations.
-
On Domain-Specific Post-Training for Multimodal Large Language Models
Paper • 2411.19930 • Published • 29 -
START: Self-taught Reasoner with Tools
Paper • 2503.04625 • Published • 113 -
Union of Experts: Adapting Hierarchical Routing to Equivalently Decomposed Transformer
Paper • 2503.02495 • Published • 9 -
Fine-Tuning Small Language Models for Domain-Specific AI: An Edge AI Perspective
Paper • 2503.01933 • Published • 13
-
Xkev/Llama-3.2V-11B-cot
Image-Text-to-Text • 11B • Updated • 682 • 158 -
meta-llama/Llama-3.2-3B-Instruct
Text Generation • 3B • Updated • 2.04M • • 1.89k -
microsoft/Phi-3.5-mini-instruct
Text Generation • 4B • Updated • 338k • 940 -
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • 11B • Updated • 123k • • 1.55k
-
Stream of Search (SoS): Learning to Search in Language
Paper • 2404.03683 • Published • 30 -
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization
Paper • 2411.10442 • Published • 87 -
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions
Paper • 2411.14405 • Published • 61 -
Hymba: A Hybrid-head Architecture for Small Language Models
Paper • 2411.13676 • Published • 46
-
nvidia/Llama-3_3-Nemotron-Super-49B-v1
Text Generation • 50B • Updated • 11.7k • 320 -
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation • 8B • Updated • 9.03k • • 215 -
google/gemma-3-1b-it
Text Generation • 1.0B • Updated • 2.02M • 772 -
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition • 6B • Updated • 288k • 1.55k
Sparse Mixture of Experts datasets for mathematical reasoning and complex calculations.
-
On Domain-Specific Post-Training for Multimodal Large Language Models
Paper • 2411.19930 • Published • 29 -
START: Self-taught Reasoner with Tools
Paper • 2503.04625 • Published • 113 -
Union of Experts: Adapting Hierarchical Routing to Equivalently Decomposed Transformer
Paper • 2503.02495 • Published • 9 -
Fine-Tuning Small Language Models for Domain-Specific AI: An Edge AI Perspective
Paper • 2503.01933 • Published • 13
-
Xkev/Llama-3.2V-11B-cot
Image-Text-to-Text • 11B • Updated • 682 • 158 -
meta-llama/Llama-3.2-3B-Instruct
Text Generation • 3B • Updated • 2.04M • • 1.89k -
microsoft/Phi-3.5-mini-instruct
Text Generation • 4B • Updated • 338k • 940 -
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • 11B • Updated • 123k • • 1.55k
-
Stream of Search (SoS): Learning to Search in Language
Paper • 2404.03683 • Published • 30 -
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization
Paper • 2411.10442 • Published • 87 -
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions
Paper • 2411.14405 • Published • 61 -
Hymba: A Hybrid-head Architecture for Small Language Models
Paper • 2411.13676 • Published • 46