Research
My research focuses on enhancing the robustness and trustworthiness of large language models (LLMs), including hallucination detection, bias and toxicity mitigation, and model alignment.
|
|
FactCheckmate: Preemptively Detecting and Mitigating Hallucinations in LMs
Deema Alnuhait,
Neeraja Kirtane,
Muhammad Khalifa,
Hao Peng
arXiv, 2024
arXiv
FactCheckmate, a framework for preemptively detecting and mitigating hallucinations in LLMs by analyzing hidden states to identify issues before they appear in outputs, using effective interventions with minimal overhead to improve factuality.
|
|
AraTrust: An Evaluation of Trustworthiness for LLMs in Arabic
Emad A. Alghamdi,
Reem I. Masoud,
Deema Alnuhait,
Afnan Y. Alomairi,
Ahmed Ashraf,
Mohamed Zaytoon
COLING, 2025
arXiv
AraTrust, a comprehensive Arabic-specific trustworthiness benchmark for LLMs, covering categories such as ethics, safety, and offensive language. It evaluates various models, demonstrating GPT-4's superior performance over open-source alternatives like AceGPT and Jais.
|
|
CIDAR: Culturally Relevant Instruction Dataset For Arabic
Zaid Alyafeai,
Khalid Almubarak,
Ahmed Ashraf,
Deema Alnuhait,
Saied Alshahrani,
Gubran A. Q. Abdulrahman,
Gamil Ahmed,
Qais Gawah,
Zead Saleh,
Mustafa Ghaleb,
Yousef Ali,
Maged S. Al-Shaibani
ACL Findings, 2024
ACL Findings
CIDAR, an open Arabic instruction-tuning dataset developed with extensive manual review for cultural and linguistic alignment, addressing biases in machine-translated datasets. Models fine-tuned on CIDAR demonstrate improved cultural relevance and performance in Arabic-specific tasks compared to those fine-tuned on larger but less tailored datasets.
|
|
University of Illinois Urbana-Champaign
2023 - Present
Ph.D. in Computer Science
Advisor: Prof. Hao Peng
|
|
Columbia University in the City of New York
2021 - 2023
MSc in Computer Science
|
|
Argonne National Laboratory
05.2024 - Present
Visiting Student
Manager: Prof. Eliu Huerta
|
|
Amazon Company, Search and AI (A9 team)
06.2022 - 09.2022
Software Development Engineer Intern
|
|