Text Generation
Safetensors
English
Chinese
qwen3
reward-model
rlhf
principle-following
qwen
conversational
Instructions to use WisdomShell/RewardAnything-8B-v1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Inference