Base Model for TransMLA
mengfanxu
fxmeng
AI & ML interests
None yet
Recent Activity
upvoted a paper 10 days ago
GQLA: Group-Query Latent Attention for Hardware-Adaptive Large Language Model Decoding submitted a paper 10 days ago
GQLA: Group-Query Latent Attention for Hardware-Adaptive Large Language Model Decoding upvoted a paper 17 days ago
MISA: Mixture of Indexer Sparse Attention for Long-Context LLM InferenceOrganizations
None yet