Fardin Ahsan Sakib
PhD Candidate, Computer Science
George Mason University
I am a doctoral researcher in the Department of Computer Science at George Mason University and a member of the MasonNLP Research Group. My research spans the large language model lifecycle — interpretability, post-training, and evaluation — aimed at understanding how these models behave and making them dependable for real-world, high-stakes use such as clinical NLP and healthcare, with work published at venues like ACL and EMNLP. A central thread is AI safety: making models and the systems built on them fairer, more robust, and better aligned — from mitigating bias and reliability failures in LLMs to AI safety in multi-agent systems. Through research internships at Comcast, Fujitsu Research, and Amazon, I extend this to applied agentic AI — including agentic memory, agent evaluation, coding agents, and tool-calling workflows.
Recent News
-
06/2026
Started AI Research Internship at Comcast, working on agentic memory!
-
11/2025
Started Agentic AI Research Internship at Fujitsu Research of America!
-
08/2025
Successfully completed a rewarding applied science internship at Amazon and back to school.
-
06/2025
I am joining Amazon as an applied science intern for the summer!
-
05/2025
Paper on "Spurious Correlations and Beyond: Understanding and Mitigating Shortcut Learning in SDOH Extraction with Large Language Models" accepted to ACL 2025!
-
08/2024
Completed Applied Research Internship at Brillient Corporation, working on BRAG (Brillient Retrieval-Augmented Generation)
-
06/2024
Started Graduate Research Assistant position focusing on LLMs in health information retrieval
-
05/2023
Awarded NSF Research Trainee Fellowship at the Center for Adaptive Systems of Brain-Body Interactions
Experience
AI Research Intern
Comcast- Researching agentic memory for large language model agents.
Agentic AI Research Intern
Fujitsu Research of America- Trained a verifier model for coding agents that evaluates and selects the best solution from multiple agent attempts on real-world software engineering tasks (SWE-Bench), improving resolve rate by 15%.
- Designed and implemented an end-to-end evaluation pipeline for scoring and ranking agent-generated solutions across multiple runs, supporting the team's research on inference-time scaling for coding agents.
Applied Research Scientist Intern
Amazon (AWS Support)- Built CloudNEST, an expert-validated dataset for multi-step AWS tool calling (5–10+ chained APIs with cross-service dependencies) to assess agentic-AI workflows pre-production.
- Developed AWSBench, a deterministic AWS response simulator and evaluation harness enabling credential-less, cost-free, reproducible benchmarking with standardized precision/recall scoring for API selection, parameter extraction, and step sequencing.
- Benchmarked leading LLMs; identified failure modes (wrong API, name/parameter-format mismatches) and established baseline metrics (best 38% recall, 19% precision) for internal evaluations.
Applied Research Intern
Brillient Corporation- Developed key components of BRAG (Brillient Retrieval-Augmented Generation) using PyTorch, implementing query distillation and synonym generation modules to enhance contextual understanding across multiple domains.
- Optimized information retrieval by designing efficient chunking algorithms and re-ranking methods, improving relevance by 22% (MAP) and reducing processing time by 35%.
- Finetuned domain-specific LLMs and performed prompt engineering (chain-of-thought, tree-of-thought), deploying across medical, HR, and IRS domains with an 85% user satisfaction rate.
Graduate Teaching and Research Assistant
George Mason University- Researching applications of Foundation Models to health data, focusing on social determinants of health for improved medical information retrieval and analysis.
- Exploring methods to identify and mitigate bias in LLMs to improve fairness and accuracy in health-related AI systems, including explainability for transparency and trust.
- Conducting labs and facilitating discussions for 100+ students; providing individualized feedback on assessments to improve engagement and understanding.
Machine Learning and AI Intern
Brillient Corporation- Engineered and fine-tuned NLP models for QA, summarization, emotion detection, and readability analysis, improving performance by 12–18% (F1, ROUGE, accuracy) for IRS and USCIS projects.
- Built an end-to-end ML pipeline from model development to AWS deployment and API creation (SageMaker, Lambda, API Gateway), reducing deployment time by 40%.