Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
2
2
4
quinn
jwhe
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
new
activity
6 days ago
harborframework/parity-experiments:
[Parity] CL-bench: codex/gpt-5.2 vs infer_codex.py (50 tasks, 3 trials, MATCHING)
new
activity
16 days ago
harborframework/parity-experiments:
[Parity] CL-bench: codex/gpt-5.1 vs original pipeline (50 tasks, 3 trials)
authored
a paper
2 months ago
SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks
View all activity
Organizations
jwhe
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
harborframework/parity-experiments
6 days ago
[Parity] CL-bench: codex/gpt-5.2 vs infer_codex.py (50 tasks, 3 trials, MATCHING)
1
#230 opened 6 days ago by
jwhe
New activity in
harborframework/parity-experiments
16 days ago
[Parity] CL-bench: codex/gpt-5.1 vs original pipeline (50 tasks, 3 trials)
#210 opened 16 days ago by
jwhe