PhD Candidate — Computational Linguistics & NLP Researcher
Department of Near Eastern Languages & Cultures (NELC) | Indiana University, Bloomington
I am a PhD candidate in Computational Linguistics at Indiana University, Bloomington, working in the Department of Near Eastern Languages & Cultures (NELC). My research focuses on natural language inference (NLI), mechanistic interpretability of multilingual language models, and Arabic NLP.
My dissertation investigates the internal reasoning mechanisms of multilingual transformer models for cross-lingual entailment, with a special focus on Arabic and its typological properties. I use circuit analysis, probing classifiers, and attention visualization to understand how models represent semantic relationships across languages.
Beyond technical NLP, I am also interested in language policy, minority language rights, and computational approaches to endangered language documentation.
Dissertation research investigating how multilingual transformer models internally represent and process natural language entailment across languages, with special focus on Arabic cross-lingual phenomena. Employs circuit analysis, activation patching, and probing methods.
Dissertation In Progress Indiana UniversityResearch on semantic inference across Arabic and English using multilingual transformer models. Includes construction of custom Arabic NLI datasets, cross-lingual annotation protocols, and evaluation of state-of-the-art multilingual models.
NLI Arabic NLP Dataset ConstructionBuilding structured knowledge graphs from academic corpora using Neo4j and Python-based NLP pipelines. Involves entity extraction, relation linking, and semantic querying for computational linguistics literature.
Knowledge Graphs Neo4j Digital HumanitiesResearch on authorship attribution and verification in Arabic and English texts using machine learning and stylometric approaches, with applications to historical manuscript analysis and forensic linguistics.
Authorship Analysis Arabic & EnglishDeveloping data augmentation pipelines for speech analysis of individuals with ALS and dysarthria, supporting assistive technology research and atypical speech modeling.
Speech Processing ALS Assistive TechNLI datasets and benchmarks used and contributed to in my research:
I have experience in graduate-level instruction and research mentoring in computational linguistics, NLP methods, and Arabic language studies at Indiana University, Bloomington. My teaching integrates hands-on coding with theoretical linguistic foundations.
I am actively engaged in academic service including conference reviewing (targeting ACL 2026, NESY 2026, and American Research Center in Egypt Annual Meeting), paper publication in NLI, multilingual NLP, and Arabic computational linguistics venues.
I also contribute to community outreach on language policy, minority language rights, and computational approaches to the documentation of endangered and under-resourced languages.
NLI & Interp.