Posts by Collection

portfolio

publications

Prototypical Reward Network for Data-Efficient RLHF

Published in Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024), 2024

Proto-RM introduces prototypical networks into reward modeling to enhance data efficiency in RLHF, achieving robust preference learning with limited human feedback.

Download Paper

LEKA: LLM-Enhanced Knowledge Augmentation

Published in Proceedings of the 34th International Joint Conference on Artificial Intelligence (IJCAI 2025), 2025

This paper proposes LEKA, a Large Language Model–Enhanced Knowledge Augmentation framework that actively retrieves and aligns transferable knowledge across domains for improved data efficiency and transfer learning performance.

Download Paper

Distilling Empathy from Large Language Models

Published in Proceedings of the 26th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL 2025), 2025

This paper presents a two-step fine-tuning framework for distilling empathy from Large Language Models (LLMs) into Smaller Language Models (SLMs), achieving a 90% win rate in empathetic response generation.

Download Paper

talks

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.