Publications

Publications

  • [Arxiv] Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
    • Zhenghao Lin\(^\heartsuit\), Zihao Tang\(^\heartsuit\), Xiao Liu\(^\heartsuit\), Yeyun Gong\(^\heartsuit\), Yi Cheng, Qi Chen, Hang Li, Ying Xin, Ziyue Yang, \(\cdots\), Peng Cheng, Mao Yang.
    • Paper
  • [Arxiv] ModelGPT: Unleashing LLM’s Capabilities for Tailored Model Generation
    • Zihao Tang, Zheqi Lv, Shengyu Zhang, Fei Wu, Kun Kuang.
    • Paper | Code
  • [ICLR'24] AuG-KD: Anchor-Based Mixup Generation for Out-of-Domain Knowledge Distillation
    • Zihao Tang, Zheqi Lv, Shengyu Zhang*, Yifan Zhou, Xinyu Duan, Fei Wu, Kun Kuang*.
    • Paper | Code
  • [ICML'25] Latent Score-Based Reweighting for Robust Classification on Imbalanced Tabular Data
    • Yunze Tong, Fengda Zhang, Zihao Tang, Kaifeng Gao, Kai Huang, Pengfei Lyu, Jun Xiao, Kun Kuang.
    • Paper

Trending Tags