Deema Alnuhait

I am Deema Alnuhait, a second-year computer science Ph.D. student at the University of Illinois Urbana-Champaign (UIUC) , advised by Prof. Hao Peng. I am also a visiting student at Argonne National Laboratory and previously interned at Amazon.
Before this, I completed my master’s degree at Columbia University, where I worked with Prof. Zhou Yu.

I am looking for research internship opportunities for 2025 summer

Email  /  CV  /  Scholar  /  Twitter  /  Github

Research

My research focuses on enhancing the robustness and trustworthiness of large language models (LLMs), including hallucination detection, bias and toxicity mitigation, and model alignment.

FactCheckmate: Preemptively Detecting and Mitigating Hallucinations in LMs
Deema Alnuhait, Neeraja Kirtane, Muhammad Khalifa, Hao Peng
arXiv, 2024
arXiv

FactCheckmate, a framework for preemptively detecting and mitigating hallucinations in LLMs by analyzing hidden states to identify issues before they appear in outputs, using effective interventions with minimal overhead to improve factuality.

AraTrust: An Evaluation of Trustworthiness for LLMs in Arabic
Emad A. Alghamdi, Reem I. Masoud, Deema Alnuhait, Afnan Y. Alomairi, Ahmed Ashraf, Mohamed Zaytoon
COLING, 2025
arXiv

AraTrust, a comprehensive Arabic-specific trustworthiness benchmark for LLMs, covering categories such as ethics, safety, and offensive language. It evaluates various models, demonstrating GPT-4's superior performance over open-source alternatives like AceGPT and Jais.

CIDAR: Culturally Relevant Instruction Dataset For Arabic
Zaid Alyafeai, Khalid Almubarak, Ahmed Ashraf, Deema Alnuhait, Saied Alshahrani, Gubran A. Q. Abdulrahman, Gamil Ahmed, Qais Gawah, Zead Saleh, Mustafa Ghaleb, Yousef Ali, Maged S. Al-Shaibani
ACL Findings, 2024
ACL Findings

CIDAR, an open Arabic instruction-tuning dataset developed with extensive manual review for cultural and linguistic alignment, addressing biases in machine-translated datasets. Models fine-tuned on CIDAR demonstrate improved cultural relevance and performance in Arabic-specific tasks compared to those fine-tuned on larger but less tailored datasets.

Education

UIUC logo University of Illinois Urbana-Champaign
2023 - Present
Ph.D. in Computer Science
Advisor: Prof. Hao Peng
Columbia Uni logo Columbia University in the City of New York
2021 - 2023
MSc in Computer Science

Internship Experience

ANL logo Argonne National Laboratory
05.2024 - Present
Visiting Student
Manager: Prof. Eliu Huerta
Amazon logo Amazon Company, Search and AI (A9 team)
06.2022 - 09.2022
Software Development Engineer Intern

Copyright © Jon Barron