Project: Turkish Embeddings from Scratch and CPT Decoders
Infrastructure: MareNostrum 5 (BSC)
AI & ML interests
Where data finds its mind
Papers
Mecellem Models: Turkish Models Trained from Scratch and Continually Pre-trained for the Legal Domain
TurkEmbed: Turkish Embedding Model on NLI & STS Tasks
models 66
newmindai/Mecellem-Qwen3-1.7B-TR
Text Generation • Updated • 83 • • 4
newmindai/Mursit-Large
Feature Extraction • Updated • 66 • 4
newmindai/Mursit-Base
Feature Extraction • Updated • 7 • 3
newmindai/Muhakim
Text Generation • Updated • 15 • 4
newmindai/Mecellem-Qwen3-4B-TR
Text Generation • 4B • Updated • 59 • • 3
newmindai/Mursit-Large-TR-Retrieval
Sentence Similarity • 0.4B • Updated • 1.3k • 6
newmindai/Mursit-Embed-Qwen3-1.7B-TR
Sentence Similarity • 2B • Updated • 69 • 3
newmindai/Mursit-Base-TR-Retrieval
Sentence Similarity • 0.2B • Updated • 1.63k • • 4
newmindai/Mursit-Embed-Qwen3-4B-TR
Sentence Similarity • 4B • Updated • 1 • 2
newmindai/bge-m3-stsb
Sentence Similarity • 0.6B • Updated • 64 • 3
datasets 9
newmindai/contract-retrieval
Viewer • Updated • 816 • 62 • 2
newmindai/regulation-retrieval
Viewer • Updated • 264k • 120 • 2
newmindai/caselaw-retrieval
Viewer • Updated • 4.15k • 94 • 3
newmindai/ms-marco-turkish-triplets
Viewer • Updated • 920k • 74
newmindai/stsb-deepl-tr
Viewer • Updated • 8.63k • 8
newmindai/EuroHPC-Legal
Viewer • Updated • 43k • 137 • 1
newmindai/RAGTruth-TR
Viewer • Updated • 17.8k • 55 • 7
newmindai/siu-rag-data
Viewer • Updated • 507 • 41 • 2
newmindai/mezura-eval-data
Viewer • Updated • 650 • 27 • 1