Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Omartificial-Intelligence-Space
/
Fanar-Math-R1-GRPO

Text Generation
PEFT
TensorBoard
Safetensors
Arabic
English
Generated from Trainer
trl
grpo
math
reasoning
R1
conversational
Model card Files Files and versions
xet
Metrics Training metrics Community

Instructions to use Omartificial-Intelligence-Space/Fanar-Math-R1-GRPO with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • PEFT

    How to use Omartificial-Intelligence-Space/Fanar-Math-R1-GRPO with PEFT:

    from peft import PeftModel
    from transformers import AutoModelForCausalLM
    
    base_model = AutoModelForCausalLM.from_pretrained("QCRI/Fanar-1-9B-Instruct")
    model = PeftModel.from_pretrained(base_model, "Omartificial-Intelligence-Space/Fanar-Math-R1-GRPO")
  • Notebooks
  • Google Colab
  • Kaggle
Fanar-Math-R1-GRPO / runs /Jun13_02-21-01_lambda-hyperplane
7.88 kB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 1 commit
Omartificial-Intelligence-Space's picture
Omartificial-Intelligence-Space
Training in progress, step 10
2b0fb2e verified 11 months ago
  • events.out.tfevents.1749770465.lambda-hyperplane.3173309.0
    7.88 kB
    xet
    Training in progress, step 10 11 months ago