* below indicates equal contribution
Polarity-Aware Probing for Quantifying Latent Alignment in Language Models
S. Sadiekh, E. Ericheva, C. Agarwal: AAAI, 2026
Oral Acceptance
Paper | Code | HuggingFace | Video
CLINIC: Evaluating Multilingual Trustworthiness in Language Models for Healthcare
A. Ghosh, S. Sridhar, RK Ravi, M. Muhsin, S. Saha, C. Agarwal: arxiv, 2025
Paper | Code | HuggingFace

