Nikita Kezins
entfane
AI & ML interests
LLM post-training, adversarial training, safety, knowledge transfer
Recent Activity
updated a model 20 minutes ago
entfane/gpt2_constitutional_classifier_violence published a model 20 minutes ago
entfane/gpt2_constitutional_classifier_violence updated a dataset about 2 hours ago
entfane/harmful_subsets