Publications
[Arxiv]
Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models- Zhenghao Lin\(^\heartsuit\), Zihao Tang\(^\heartsuit\), Xiao Liu\(^\heartsuit\), Yeyun Gong\(^\heartsuit\), Yi Cheng, Qi Chen, Hang Li, Ying Xin, Ziyue Yang, \(\cdots\), Peng Cheng, Mao Yang.
- Paper
[Arxiv]
ModelGPT: Unleashing LLM’s Capabilities for Tailored Model Generation[ICLR'24]
AuG-KD: Anchor-Based Mixup Generation for Out-of-Domain Knowledge Distillation[ICML'25]
Latent Score-Based Reweighting for Robust Classification on Imbalanced Tabular Data- Yunze Tong, Fengda Zhang, Zihao Tang, Kaifeng Gao, Kai Huang, Pengfei Lyu, Jun Xiao, Kun Kuang.
- Paper