Dhawal Gupta

Krishna Agrawal , Kushagra Jain , Dhawal Gupta , Raunak Srivastav
ASME 2018 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference

2018

Structural credit assignment in neural networks using reinforcement learning

Dhawal Gupta , Gabor Mihucz , Matthew Schlegel , James Kostas
Advances in Neural Information Processing Systems 34 30257 -30270

2021

Exploring the impact of low-rank adaptation on the performance, efficiency, and regularization of rlhf

Simeng Sun , Dhawal Gupta , Mohit Iyyer
arXiv preprint arXiv:2309.09055

2023

Reinforcement learning based dialogue management strategy

Tulika Saha , Dhawal Gupta , Sriparna Saha , Pushpak Bhattacharyya
Neural Information Processing: 25th International Conference, ICONIP 2018, Siem Reap, Cambodia, December 13–16, 2018, Proceedings, Part III 25 359 -372

2018

A unified dialogue management strategy for multi-intent dialogue conversations in multiple languages

Tulika Saha , Dhawal Gupta , Sriparna Saha , Pushpak Bhattacharyya
Transactions on Asian and Low-Resource Language Information Processing 20 ( 6) 1 -22

2021

Coagent Networks: Generalized and Scaled

James E Kostas , Scott M Jordan , Yash Chandak , Georgios Theocharous
arXiv preprint arXiv:2305.09838

2023

Behavior Alignment via Reward Function Optimization

Dhawal Gupta , Yash Chandak , Scott Jordan , Philip S Thomas
Advances in Neural Information Processing Systems 36

2024

ICU-Sepsis: A Benchmark MDP Built from Real Medical Data

Kartik Choudhary , Dhawal Gupta , Philip S Thomas
arXiv preprint arXiv:2406.05646

2024

Emotion Aided Dialogue Act Classification for Task-Independent Conversations in a Multi-modal Framework

Tulika Saha , Dhawal Gupta , Sriparna Saha , Pushpak Bhattacharyya
Cognitive Computation 1 -13