YIZHE
profile photo

Yizhe Xiong (熊翊哲)

I am now a Ph.D. candidate at Multimedia Intelligence Group, School of Software, Tsinghua University. Before that, I received my bachelor's degree at Department of Computer Science and Technology, Tsinghua University. I am instructed by Prof. Guiguang Ding.

Email  /  Google Scholar  /  Linkedin  /  CV

News

  • 12/2024, One article is accepted by ICASSP 2025.
  • 12/2024, Scaffold-BPE has been accepted by AAAI 2025.
  • 12/2024, One article is accepted by COLING 2025.
  • 07/2024, Our paper on PEFT has been accepted by ECCV 2024.
  • 04/2024, We have made some novel explorations in the field of LLM pre-training. Checkout Temporal Scaling Law and Scaffold-BPE on arXiv.
  • 03/2024, Checkout our latest work on fine-tuning & task adaptation.
  • 07/2023, Our paper on domain adaptation has been accepted by ICCV 2023.
  • Research

    I'm interested in transfer learning for computer vision. I mainly focus on research topics such as trasferring to downstream tasks, domain adaptation/generalization, continual learning, etc.

    Conference Papers:

    1. PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task Adaptation
      Yizhe Xiong, Hui Chen, Tianxiang Hao, Zijia Lin, Jungong Han, Yuesong Zhang, Guoxin Wang, Yongjun Bao, Guiguang Ding
      Keywords: Transfer Learning, Parameter-Efficient Fine-Tuning (PEFT), Task Adaptation, Model Pruning, Token Pruning
      ECCV 2024 | paper  GitHub repo

    2. Confidence-based Visual Dispersal for Few-shot Unsupervised Domain Adaptation
      Yizhe Xiong, Hui Chen, Zijia Lin, Sicheng Zhao, Guiguang Ding
      Keywords: Domain Adaptation, Transfer Learning, Few-Shot (Low Shot), Few-Shot Unsupervised Domain Adaptation (FUDA)
      ICCV 2023 | paper  GitHub repo

    3. Breaking the Stage Barrier: A Novel Single-Stage Approach to Long Context Extension for Large Language Models
      Haoran Lian∗, Junmin Chen∗, Wei Huang∗, Yizhe Xiong∗, Wenping Hu*, Guiguang Ding, Hui Chen, Jianwei Niu, Zijia Lin, Fuzheng Zhang, Di Zhang (∗ denotes equal contribution)
      Keywords: Large Language Model (LLM), Language Modeling, Long Context Extrapolation
      COLING 2025

    4. LBPE: Long-token-first Tokenization to Improve Large Language Models
      Haoran Lian, Yizhe Xiong, Zijia Lin, Jianwei Niu, Shasha Mo, Hui Chen, Peng Liu, Guiguang Ding
      Keywords: Large Language Model (LLM), Language Modeling, Machine Translation, Byte-Pair Encoding (BPE)
      ICASSP 2025

    5. Scaffold-BPE: Enhancing Byte Pair Encoding for Large Language Models with Simple and Effective Scaffold Token Removal
      Haoran Lian, Yizhe Xiong, Jianwei Niu, Shasha Mo, Zhenpeng Su, Zijia Lin, Peng Liu, Hui Chen, Guiguang Ding
      Keywords: Large Language Model (LLM), Language Modeling, Machine Translation, Byte-Pair Encoding (BPE)
      AAAI 2025

    Others:

    1. Temporal Scaling Law for Large Language Models
      Yizhe Xiong, Xiansheng Chen, Xin Ye, Hui Chen, Zijia Lin, Haoran Lian, Jianwei Niu, Guiguang Ding
      Keywords: Large Language Model (LLM), Language Modeling, Scaling Law
      Under review | paper

    Academic Services

  • Reviewer of NeurIPS 2024, IJCAI 2024, ACL Rolling Review 2024, ICLR 2024.

  • Reviewer of IEEE Transactions on Image Processing.

  • Awards

  • 2024, Academic Scholarship, School of Software, Tsinghua University.

  • 2024, First Place and Gold Prize, VISION'24 Data Challenge, ECCV 2024.

  • 2023, Academic Scholarship, School of Software, Tsinghua University.

  • 2022, Outstanding Graduate, Department of Computer Science and Technology, Tsinghua University.

  • Many thanks go to Dr. Yunhe Wang, who shared the source code of his homepage.