Kejian (Mark) Shi

mark.kejianshi[AT]gmail.com

Goodreads

I work on LLM posttraining, RL systems, and scalable oversight evaluation. I'm joining Scale AI as a Research Scientist in NYC (and SF). Previously, I was an MS student advised by Arman Cohan at the YaleNLP group, and did my undergrad research working with Sam Bowman at the NYU CS ARG group, where I still maintain active research involvement.

Publications

* denotes equal contributions. See Google Scholar for full list.

Improving LLM Alignment with References

Kejian Shi, Yixin Liu, Peifeng Wang, Alexander Fabbri, Shafiq Joty, Arman Cohan

Submitted to ICLR 2026

SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature PDF Code HF

Kejian Shi*, David Wadden*, Jacob Morrison, Aakanksha Naik, Shruti Singh, Nitzan Barzilay, Kyle Lo, Tom Hope, Luca Soldaini, Shannon Zejiang Shen, Doug Downey, Hannaneh Hajishirzi, Arman Cohan

In Submission to ACL 2025, In Neurips 2024 Workshop FM4Science

ReIFE: Re-evaluating Instruction-Following Evaluation PDF Code HF

Yixin Liu*, Kejian Shi*, Alexander R. Fabbri, Yilun Zhao, Peifeng Wang, Chien-Sheng Wu, Shafiq Joty, Arman Cohan

In NAACL 2025 Main Conference

Pretraining Language Models with Human Preferences PDF

Tomasz Korbak, Kejian Shi, Angelica Chen, Rasika Bhalerao, Christopher L. Buckley, Jason Phang, Samuel R. Bowman, Ethan Perez

ICML'23: Proceedings of the 40th International Conference on Machine Learning

Oral Presentation

Improving LLM Alignment with References

Kejian Shi, Yixin Liu, Peifeng Wang, Alexander Fabbri, Shafiq Joty, Arman Cohan

Submitted to ICLR 2026

SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature PDF Code HF

Kejian Shi*, David Wadden*, Jacob Morrison, Aakanksha Naik, Shruti Singh, Nitzan Barzilay, Kyle Lo, Tom Hope, Luca Soldaini, Shannon Zejiang Shen, Doug Downey, Hannaneh Hajishirzi, Arman Cohan

In Submission to ACL 2025, In Neurips 2024 Workshop FM4Science

ReIFE: Re-evaluating Instruction-Following Evaluation PDF Code HF

Yixin Liu*, Kejian Shi*, Alexander R. Fabbri, Yilun Zhao, Peifeng Wang, Chien-Sheng Wu, Shafiq Joty, Arman Cohan

In NAACL 2025 Main Conference

Pretraining Language Models with Human Preferences PDF

Tomasz Korbak, Kejian Shi, Angelica Chen, Rasika Bhalerao, Christopher L. Buckley, Jason Phang, Samuel R. Bowman, Ethan Perez

ICML'23: Proceedings of the 40th International Conference on Machine Learning

Oral Presentation

People Who Have Empowered Me
  • My family
  • Prof Sam Bowman, Prof Arman Cohan, Dr. Tomasz Korbak, Yixin Liu, Rachel Li, among others
Acknowledgement

This website uses the website design and template by Rose Wang and Martin Saveski