PhD Candidate at NYU Tandon & NYU Abu Dhabi, working to make multimodal large language models safer, more robust, and more trustworthy.
I am a PhD candidate in Computer Science & Engineering at NYU Tandon School of Engineering and NYU Abu Dhabi, co-advised by Prof. Siddharth Garg and Prof. Muhammad Shafique.
My research investigates the safety, adversarial robustness, and vulnerability detection of multimodal large language models — spanning jailbreak attacks, multimodal hallucination, and automated code vulnerability analysis.
Born and raised in Beijing, China, I attended the first year of the Early Development Program (EDP) at RDFZ (The High School Affiliated to Renmin University of China) before moving abroad. I then completed a double major in Computer Science and Mathematics at Pomona College, California. I care about work that is both technically sharp and genuinely useful, and I believe the best ideas come from the meeting of different disciplines and cultures.
Peer-reviewed work on adversarial machine learning, multimodal reasoning, and secure code — presented at AAAI, IJCNN, and WCCI.
An RL-driven multi-agent framework using dynamic cipher selection to bypass LLM safety guardrails. A modular Q-table selector adapts cipher strategy against evolving defenses, achieving over 92% ASR on non-reasoning LLMs and over 74% on reasoning-capable models within 10 queries.
Three object-detection-model / VLM integration strategies for counting tasks. Prompt augmentation achieves up to 81.3% counting accuracy (+6.6pp) while cutting inference time by 22%, with ablations revealing when positional grounding helps versus hurts.
A synthetic data-collection pipeline for code vulnerability reports, powering a 7B LLM fine-tuned with QLoRA via curriculum training. Led a research team of 12 to analyze memory corruption across user code and binary programs.
A black-box multi-model cascading framework for cost-efficient code completion with self-testing — routing queries to smaller models when feasible and escalating only when necessary, validated through large-scale ablations on HPC.
An open-source public health surveillance system deployed for the NYC Department of Health, processing syndromic data from 50 NYC hospitals. Built contrastive topic modeling and Poisson scan-statistic pipelines, with an 11× speedup via Numba JIT.
Research sharpens the mind, but the body and the board keep it honest. A few pursuits keep me grounded, competitive, and endlessly curious.
Rated around 1800, with multiple UAE tournament championships. Chess taught me to think many moves ahead — a habit that quietly shapes how I design experiments and adversarial strategies.
A lifelong love of the fastest racket sport in the world. There is nothing quite like a long rally to reset the mind — pure reflex, rhythm, and the joy of a perfectly placed loop drive.
Explosive footwork and delicate touch in equal measure. Badminton is where I go to move fast, laugh loud, and remember that the best play is often the deceptively simple one.
#1 Arlie in the Middle East, with 10,000+ power heroes across all five lanes. Years of ranked play sharpened my instincts for tempo, positioning, and reading an opponent — the same intuitions I bring to adversarial research.