28 15 76

Nikita Kezins

entfane

AI & ML interests

AI safety and Alignment

Recent Activity

updated a collection 18 days ago

CoT-Signal Classifiers

updated a collection 18 days ago

CoT-Signal Classifiers

updated a collection 18 days ago

CoT-Signal Classifiers

View all activity

Organizations

Collections 3

View 3 collections

spaces 3

Gpt2 Harmful Classifier

🚀

Gpt2 Harmful Classifier

🚀

Visualize token scores from a GPT-2 classifier

Math Virtuoso

🧮

Ask math questions and get detailed answers

models 17

datasets 13

entfane/jailbreaks-only

Viewer • Updated May 8 • 666 • 29

entfane/construction_points

Viewer • Updated Apr 22 • 10k • 14

entfane/violent_eval

Viewer • Updated Apr 9 • 22.4k • 7

entfane/harmful_subsets

Viewer • Updated Apr 7 • 571k • 19

entfane/preprocessed_toxigen

Viewer • Updated Apr 3 • 10.1k • 23

entfane/toxic_classification

Viewer • Updated Apr 3 • 38.9k • 10

entfane/toxic_chat

Viewer • Updated Mar 1 • 1.25M • 9

entfane/EmotionAtlas-chat

Viewer • Updated Jun 1, 2025 • 3.3k • 20

entfane/EmotionAtlas

Viewer • Updated Jun 1, 2025 • 3.3k • 6

entfane/professor-mathematics

Viewer • Updated Apr 17, 2025 • 64.2k • 7 • 1

View 13 datasets

Nikita Kezins

AI & ML interests

Recent Activity

Organizations

Collections 3

spaces 3 Sort: Recently updated

Gpt2 Harmful Classifier

Gpt2 Harmful Classifier

Math Virtuoso

models 17 Sort: Recently updated

datasets 13 Sort: Recently updated

spaces 3

models 17

datasets 13