Curated SFT datasets for instruction-following and conversational fine-tuning
Behrooz Azarkhalili
ermiaazarkhalili
AI & ML interests
LLMs, VLMs, PEFT, RL for LLMs and VLMs.
Recent Activity
published a model about 2 hours ago
ermiaazarkhalili/Qwen3-0.6B-GRPO-NuminaMath-10K liked a dataset 2 days ago
Jackrong/Qwen3.5-reasoning-700x liked a dataset 2 days ago
nohurry/Opus-4.6-Reasoning-3000x-filtered