2 8 25

Mert Ege

mertege

mertege

AI & ML interests

None yet

Recent Activity

updated a model 2 months ago

mertege/moda

published a model 2 months ago

mertege/qwen2.5-7b-lora-tr_v3_epoch0_5-merged

published a model 2 months ago

mertege/moda

View all activity

Organizations

updated a model 2 months ago

mertege/moda

8B • Updated Oct 27 • 16

published 2 models 2 months ago

mertege/qwen2.5-7b-lora-tr_v3_epoch0_5-merged

8B • Updated Aug 18 • 4

mertege/moda

8B • Updated Oct 27 • 16

updated 2 models 4 months ago

databoss/bge_reranker_v2_m3_db_v1

0.6B • Updated Sep 10 • 7

mertege/checkpoint-2050-merged_linear_Qwen2.5-7B-Instruct

Text Generation • 8B • Updated Aug 18 • 6

published a model 4 months ago

mertege/checkpoint-2050-merged_linear_Qwen2.5-7B-Instruct

Text Generation • 8B • Updated Aug 18 • 6

updated a model 4 months ago

mertege/qwen2.5-7b-lora-tr_v3_epoch0_5-merged

8B • Updated Aug 18 • 4

upvoted 3 papers 5 months ago

Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models

Paper • 2402.13064 • Published Feb 20, 2024 • 50

LexC-Gen: Generating Data for Extremely Low-Resource Languages with Large Language Models and Bilingual Lexicons

Paper • 2402.14086 • Published Feb 21, 2024 • 12

ReasonRank: Empowering Passage Ranking with Strong Reasoning Ability

Paper • 2508.07050 • Published Aug 9 • 117

liked a Space 10 months ago

The Ultra-Scale Playbook

🌌

3.6k

The ultimate guide to training LLM on large GPU Clusters

liked a model 10 months ago

humain-ai/ALLaM-7B-Instruct-preview

Text Generation • 7B • Updated Jul 14 • 25.1k • 157

upvoted a paper 11 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 429

liked 2 models 11 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • 33B • Updated Feb 24 • 2.67M • • 1.48k

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 609k • • 12.9k

upvoted a paper about 1 year ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 376

liked a dataset about 1 year ago

abdoelsayed/Open-ArabicaQA

Preview • Updated Mar 27, 2024 • 291 • 10

liked a dataset over 1 year ago

BAAI/Infinity-Instruct

Viewer • Updated 23 days ago • 21.9M • 10.6k • 688

liked a model over 1 year ago

maywell/Qwen2-7B-Multilingual-RP

Text Generation • 8B • Updated Jun 25, 2024 • 181 • • 57

liked a dataset over 1 year ago

macadeliccc/opus_samantha

Viewer • Updated Jun 21, 2024 • 3.19k • 121 • 21

Mert Ege

AI & ML interests

Recent Activity

Organizations

mertege's activity

The Ultra-Scale Playbook