mark.kejianshi[AT]gmail.com
I work on LLM posttraining, RL systems, and scalable oversight evaluation. I'm joining Scale AI as a Research Scientist in NYC (and SF). Previously, I was an MS student advised by Arman Cohan at the YaleNLP group, and did my undergrad research working with Sam Bowman at the NYU CS ARG group, where I still maintain active research involvement.
* denotes equal contributions. See Google Scholar for full list.
Improving LLM Alignment with References
Kejian Shi, Yixin Liu, Peifeng Wang, Alexander Fabbri, Shafiq Joty, Arman Cohan
Submitted to ICLR 2026
SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature
PDF
Code
HF
Kejian Shi*, David Wadden*, Jacob Morrison, Aakanksha Naik, Shruti Singh, Nitzan Barzilay, Kyle Lo, Tom Hope, Luca Soldaini, Shannon Zejiang Shen, Doug Downey, Hannaneh Hajishirzi, Arman Cohan
In Submission to ACL 2025, In Neurips 2024 Workshop FM4Science
ReIFE: Re-evaluating Instruction-Following Evaluation
PDF
Code
HF
Yixin Liu*, Kejian Shi*, Alexander R. Fabbri, Yilun Zhao, Peifeng Wang, Chien-Sheng Wu, Shafiq Joty, Arman Cohan
In NAACL 2025 Main Conference
Pretraining Language Models with Human Preferences
PDF
Tomasz Korbak, Kejian Shi, Angelica Chen, Rasika Bhalerao, Christopher L. Buckley, Jason Phang, Samuel R. Bowman, Ethan Perez
ICML'23: Proceedings of the 40th International Conference on Machine Learning
Oral Presentation
Improving LLM Alignment with References
Kejian Shi, Yixin Liu, Peifeng Wang, Alexander Fabbri, Shafiq Joty, Arman Cohan
Submitted to ICLR 2026
SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature
PDF
Code
HF
Kejian Shi*, David Wadden*, Jacob Morrison, Aakanksha Naik, Shruti Singh, Nitzan Barzilay, Kyle Lo, Tom Hope, Luca Soldaini, Shannon Zejiang Shen, Doug Downey, Hannaneh Hajishirzi, Arman Cohan
In Submission to ACL 2025, In Neurips 2024 Workshop FM4Science
ReIFE: Re-evaluating Instruction-Following Evaluation
PDF
Code
HF
Yixin Liu*, Kejian Shi*, Alexander R. Fabbri, Yilun Zhao, Peifeng Wang, Chien-Sheng Wu, Shafiq Joty, Arman Cohan
In NAACL 2025 Main Conference
Pretraining Language Models with Human Preferences
PDF
Tomasz Korbak, Kejian Shi, Angelica Chen, Rasika Bhalerao, Christopher L. Buckley, Jason Phang, Samuel R. Bowman, Ethan Perez
ICML'23: Proceedings of the 40th International Conference on Machine Learning
Oral Presentation
This website uses the website design and template by Rose Wang and Martin Saveski