PhD Student
Indiana University Bloomington
I am a PhD student in Information Science with a minor in Computer Science at Indiana University Bloomington, advised by Prof. Susan Herring, Prof. Siqi Wu, and Prof. Thai Le. My research lies at the intersection of computational social science and artificial intelligence. I employ computational methods, including natural language processing, statistical analysis, and machine learning, to understand how AI influences human behaviors, particularly in communication and information seeking.
Currently, I'm working on large-scale conversational AI datasets, algorithm auditing, and LLM fairness evaluation. Previously, I worked at the Language Understanding Lab at Samsung Research Center on machine learning approaches for mental health detection.
Download CVarXiv:2507.04224
Evaluates whether LLMs differentiate responses across user identities by prompting six state-of-the-art LLMs to assist patrons differing in sex, race/ethnicity, and institutional role.
Designed and built a large-scale TikTok political video & comment dataset (4,691 videos, 10M+ comments) and used LLM to analyze how user ideological leanings influence comment ranking under political videos.
iConference 2025
Analyzed TikTok hardship live streaming where people stream to raise money from 14 creators using topic modeling, sentiment analysis, and thematic coding, finding that multimodal authenticity cues and positive tone relate to stronger sales, hardship storytelling increases viewer retention, and women emphasize hardship and use more negative, faster speech than men.
IEEE BSN 2022
Developed and applied sequential machine learning models, including CNN, LSTM, and a custom-designed DEP-CASER model for depression detection, achieving a state-of-the-art accuracy of 0.83.
HCI International 2022 – Late Breaking Papers
Proposed a multi task, sequence based mobile sensing model that jointly predicts depression, anxiety, and stress, achieving 0.78 average accuracy and 0.78 average AUC, and outperforming single task and statistical feature baselines.