The Nguyen SAML (Statistical and Algorithmic ML) Group @ NJIT focus on the statistical and algorithmic aspects of learning for three core AI problems: Sequential Decision-Making, Responsible AI and Reasoning. We seek a mathematical understanding of the underlying algorithmic principles for learning with strong adaptivity to problem structures, and thereby design efficient machine learning algorithms with strong theoretical guarantees.

Reinforcement Learning: Offline RL, Multi-agent RL, multi-task RL
Reasoning: Inductive bias for reasoning in transformers, auto-regressive learning
Responsible AI: robustness, unlearning, privacy

Theory of (Multi-agent) Reinforcement Learning

Many real-world systems—such as recommender platforms, personalized healthcare tools, and digital assistants—are inherently interactive, with data generated through temporally extended experiences rather than isolated observations. Reinforcement learning (RL) offers a foundational paradigm for optimizing decision-making in such environments. Despite decades of progress, however, RL remains insufficiently equipped to address the evolving demands of practice. Key challenges include (i) leveraging rich logged datasets to support robust and efficient decision-making, (ii) developing agents capable of acting reliably in the presence of strategic or adaptive opponents, and (iii) integrating multiple data sources and learning modalities to achieve provable performance improvements. Advancing solutions to these challenges is central to our research agenda, with the ultimate goal of building principled, reliable, and practical RL systems for high-impact applications.

Representative papers:

Thanh Nguyen-Tang, Raman Arora. Policy regret minimization in Markov games with function approximation. ICML, 2025.
Thanh Nguyen-Tang, Raman Arora. Learning in Markov games with adaptive adversaries: Policy regret, fundamental barriers, and efficient algorithms. NeurIPS, 2024 [pdf].
Haque Ishfaq, Thanh Nguyen-Tang, Songtao Feng, Raman Arora, Mengdi Wang, Ming Yin, Doina Precup. Offline multitask representation learning for reinforcement learning. NeurIPS, 2024 [pdf].
Thanh Nguyen-Tang, Raman Arora. On the statistical complexity of offline decision-making. ICML, 2024 [pdf].
Thanh Nguyen-Tang, Raman Arora. On sample-efficient offline reinforcement learning: Data diversity, posterior sampling and beyond. NeurIPS, 2023 [pdf].
Anh Do, Thanh Nguyen-Tang, Raman Arora. Multi-agent learning with heterogeneous linear contextual bandits. NeurIPS, 2023 [pdf].
Thanh Nguyen-Tang, Ming Yin, Sunil Gupta, Svetha Venkatesh, Raman Arora. On instance-dependent bounds for offline reinforcement learning with linear function approximation. AAAI, 2023 [arXiv] .
Thanh Nguyen-Tang, Sunil Gupta, A.Tuan Nguyen, and Svetha Venkatesh. Offline neural contextual bandits: Pessimism, optimization, and generalization. ICLR, 2022 [pdf] [poster]

Theory of Reasoning with Transformers

Reasoning in ML/AI refers to a model’s capacity to perform multi-step inference, abstraction, and compositional problem-solving—going beyond pattern recognition to systematically connect information in ways that support generalization and decision-making. Why do transformers and other large language models achieve strong performance on many tasks that demand complex reasoning? What are the underlying learning mechanisms for reasoning in such situations? Do these reasoning capabilities arise from the inductive bias of training transformers with gradient descent, or from the new learning paradigm (e.g., autoregressive learning)? Answering these questions will improve our understanding of how to design reasoning-capable agents and represent a step toward developing even better or more efficient AI systems for specialized needs.

Representative papers:

Quan Nguyen, Thanh Nguyen-Tang. One-Layer Transformers are Provably Optimal for In-context Reasoning and Distributional Association Learning in Next-Token Prediction Tasks. [ArXiv]

Responsible AI

Responsible AI requires ML algorithms not only to achieve statistical and computational efficiency but also to remain aligned with societal needs. In our lab, we integrate responsible design principles directly into algorithmic constraints. This includes ensuring robustness, so that algorithms remain reliable even in adversarial or uncertain deployment environments; enabling differential privacy, to protect sensitive user information; and supporting the efficient removal of user data, allowing individuals to request the elimination of their data’s influence on model behavior.

Representative papers:

Yassine Chemingui, Aryan Deshwal, Alan Fern, Thanh Nguyen-Tang, Jana Doppa. O3SRL: Online Optimization for Offline Safe Reinforcement Learning. NeurIPS, 2025.
Nguyen Hung-Quang, Ngoc-Hieu Nguyen, The-Anh Ta, Thanh Nguyen-Tang, Kok-Seng Wong, Hoang Thanh-Tung, and Khoa D Doan. Wicked oddities: Selectively poisoning for effective clean-label backdoor attacks. ICLR, 2025 [pdf].
Ragja Palakkadavath, Hung Le, Thanh Nguyen-Tang, Svetha Venkatesh, Sunil Gupta. Fair domain generalization with heterogeneous sensitive attributes across domains. WACV, 2025 [pdf].
Austin Watkins, Thanh Nguyen-Tang, Enayat Ullah, Raman Arora. Adversarially robust multi-task representation learning. NeurIPS, 2024 [pdf].
Thanh Nguyen-Tang, Sunil Gupta, Svetha Venkatesh. Distributional reinforcement learning via moment matching. AAAI, 2021 [arXiv] [code]
Thanh Nguyen-Tang, Sunil Gupta, Huong Ha, Santu Rana, Svetha Venkatesh. Distributionally robust Bayesian quadrature optimization. AISTATS, 2020 [arXiv] [code]