Kunal Banerjee

Principal Data Scientist
Walmart Global Tech

SMIEEE, SMACM

Address:	Walmart Labs Pardhanani Wishire II, Cessna Business Park Kadubeesanahalli, Varthur Hobli, Outer Ring Road Bengaluru, Karnataka, India - 560103
Email:	kunal [dot] banerjee1 [at] walmart [dot] com [CV]

About

I am currently a Principal Data Scientist at Walmart Global Tech (formerly, known as Walmart Labs). My team "Data Science Foundation" works on 2 types of ML problems: (1) foundational problems: the problems whose solutions are likely to benefit multiple orgs within Walmart and its subsidiaries (e.g., text extraction from images, super-resolution, sentiment analysis, topic modelling, anomaly detection), (2) ad-hoc problems: the problems that may occur only to specific orgs, which typically do not have the necessary ML expertise to resolve these problems (e.g., sequential testing, diversity in recommendation). In addition, I try to leverage my experience in high-performance computing (HPC) to scale these solutions, whenever possible. I have had the opportunity to work on various projects encompassing diverse fields including Computer Vision, Natural Language Processing, Recommendation, A/B Testing, Responsible AI, AutoML, and AI Governance.

Earlier I was a Research Scientist at Parallel Computing Lab, Intel Labs, India, where my primary focus was on kernel optimization of deep learning workloads on Intel architectures (IA). For example, my code for convolution using Winograd, RNN, LSTM and GRU are available in open source libraries: LIBXSMM and Intel MKL-DNN. These libraries have been adopted in several software products including TensorFlow, Caffe, MS CNTK, Apache MXNet, Chainer, OpenVINO among others for enhanced performance on IA.
I am also interested in low-precision deep neural networks. Specifically, together with my colleagues in Intel Labs, we have developed and implemented Ternary Residual Networks which uses 8-bits for activations and 2-bits for weights (with residual edges, if required) for neural networks. I have also helped showcase the efficacy of BFLOAT16 datatype on IA.
These works have been accepted in venues such as, SuperComputing, IPDPS, ICLR, CLUSTER, and have been recognized with awards such as, ISC Best Research Poster Award (AI & ML track), Intel's Gordy Award (Intel Labs' highest award) and Divisional Recognition Award.
I have also contributed to Intel's accelerator for deep learning training as part of Intel Artificial Intelligence Products Group.

Prior to joining Intel, I received my PhD from the Department of Computer Science and Engineering, IIT Kharagpur. My research areas encompassed program analysis, formal methods and verification. I was a recipient of Senior Research Fellowship from the Department of Science and Technology, India, and TCS Research Fellowship from Tata Consultancy Services for supporting my doctoral studies. My dissertation work won Best PhD Thesis Award at VLSI Design, Best PhD Forum Paper at ISVLSI and Techno Inventor Award (PhD) from India Electronics & Semiconductor Association (IESA).

Research Interests

Deep Learning
Responsible AI
High-Performance Computing
Program Analysis
Formal Methods

Education

2016: Ph.D. in Computer Science and Engineering from Indian Institute of Technology Kharagpur

Translation Validation of Optimizing Transformations of Programs using Equivalence Checking

Prof. Chittaranjan Mandal

Prof. Dipankar Sarkar

2008: B.Tech. (Honors) in Computer Science and Engineering from Heritage Institute of Technology, Kolkata

Prof. Nabanita Das

Indian Statistical Institute, Kolkata

2004: Higher-Secondary (Class XII Board Exam)

West Bengal Council of Higher Secondary Education

Ramakrishna Mission Residential College, Narendrapur

2002: I.C.S.E. (Class X Board Exam)

Council for the Indian School Certificate Examinations

Lycée

Professional Experience

Data Science Foundation, Walmart Global Tech

Artificial Intelligence Products Group, Intel Corp.

Parallel Computing Lab, Intel Labs

Sponsored Research and Industrial Consultancy, Indian Institute of Technology Kharagpur

Department of Science and Technology

Tata Consultancy Services

McGraw-Hill Education

Awards/Achievements

2025: Received Excellence Award from Walmart

2023: Received AV Luminary Award (Top 10 Data Scientists) from Analytics Vidhya

2022: Received Emerging Leaders in Data & Analytics award at Data World Summit

2021: Elevated to ACM Senior Member

2021: Nominated for Best Poster Award in the conference DeLTA 2021

2020: Elevated to IEEE Senior Member

2019: Gordy Award (Intel Labs' highest award)

2019: Invited paper in the journal Supercomputing Frontiers and Innovations

2019: Best Research Poster Award in "Artificial Intelligence and Machine Learning" track in the conference ISC 2019

2018: Divisional Recognition Award from Intel

2018: Techno Inventor Award (PhD) 2017 from India Electronics & Semiconductor Association (IESA)

2018: Best PhD Thesis Award at VLSID 2018

2017: Invited to present our TSE 2017 paper at ESEC/FSE 2017

2015: Best PhD Forum Paper Award in the conference ISVLSI 2015

2013: Best Paper Award in the conference I-CARE 2013

2012: TCS Research Fellowship Award

Selected Publications

Journals

Optimizing Deep Learning RNN Topologies on Intel Architecture.
Kunal Banerjee, Evangelos Georganas, Dhiraj D. Kalamkar, Barukh Ziv, Eden Segal, Cristina Anderson, Alexander Heinecke.
Supercomputing Frontiers and Innovations, vol. 6, no. 3, 2019, pp: 64-85, invited paper.
A Counter-Example Generation Procedure for Path based Equivalence Checkers.
Ramanuj Chouksey, Chandan Karfa, Kunal Banerjee, Pankaj Kalita, Purandar Bhaduri.
IET Software, vol. 13, no. 4, 2019, pp: 280-285.
Deriving Bisimulation Relations from Path Extension Based Equivalence Checkers.
Kunal Banerjee, Dipankar Sarkar, Chittaranjan Mandal.
IEEE Transactions on Software Engineering (TSE), vol. 43, no. 10, 2017, pp: 946-953.
Deriving bisimulation relations from path based equivalence checkers.
Kunal Banerjee, Dipankar Sarkar, Chittaranjan Mandal.
Formal Aspects of Computing (FAC), vol. 29, no. 2, 2017, pp: 365-379.
A Path Construction Algorithm for Translation Validation using PRES+ Models.
Soumyadip Bandyopadhyay, Dipankar Sarkar, Chittaranjan Mandal, Kunal Banerjee, Krishnam Raju Duddu.
Parallel Processing Letters (PPL), vol. 26, no. 2, 2016, pp: 1-25.
Extending the FSMD Framework for Validating Code Motions of Array-Handling Programs.
Kunal Banerjee, Dipankar Sarkar, Chittaranjan Mandal.
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), vol. 33, no. 12, 2014, pp: 2015-2019.
Verification of Code Motion Techniques using Value Propagation.
Kunal Banerjee, Chandan Karfa, Dipankar Sarkar, Chittaranjan Mandal.
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), vol. 33, no. 8, 2014, pp: 1180-1193.
Verification of Loop and Arithmetic Transformations of Array-Intensive Behaviours.
Chandan Karfa, Kunal Banerjee, Dipankar Sarkar, Chittaranjan Mandal.
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), vol. 32, no. 11, 2013, pp: 1787-1800.

Conferences / Workshops

Real-time Anomaly Prediction at Scale using Anomaly Detection Augmented with Regression.
Kunal Banerjee, Binay Gupta, Meet Maheshwari, Lalitdutt Parsai, Geet Vudata, Soumik Dasgupta, Anirban Chatterjee.
International Conference on Data Science & Management of Data (CODS-COMAD) Dec'24, December 2024, pp: 183-191.
waLLMartCache: A Distributed, Multi-Tenant and Enhanced Semantic Caching System for LLMs.
Soumik Dasgupta, Anurag Wagh, Lalitdutt Parsai, Binay Gupta, Geet Vudata, Shally Sangal, Sohom Majumdar, Hema Rajesh, Kunal Banerjee, Anirban Chatterjee.
International Conference on Pattern Recognition (ICPR), December 2024, pp: 232-248.
BARGAIN: A Super-Resolution Technique to Gain High-Resolution Images for Barcodes.
Saptarshi Misra, Kunal Banerjee, Anirban Chatterjee.
International Conference on Data Science & Management of Data (CODS-COMAD), January 2024, pp: 464-468.
Are you a Foodie looking for New Cookies to try out? Better not ask an LLM.
Binay Gupta, Saptarshi Misra, Anirban Chatterjee, Kunal Banerjee.
AIMLSystems, October 2023, pp: 40:1-40:4.
These Deals Won't Last! Longevity, Uniformity and Bias in Product Badge Assignment in E-Commerce Platforms.
Archit Bansal, Kunal Banerjee, Abhijnan Chakraborty.
SIGIR Workshop On eCommerce (eCOM)@SIGIR, July 2023, pp: 1-17. [arXiv]
Designing a Vision Transformer based Enhanced Text Extractor from Product Images.
Saptarshi Misra, Pranay Dugar, Anirban Chatterjee, Lalitdutt Parsai, Kunal Banerjee.
International Conference on Data Science & Management of Data (CODS-COMAD), January 2023, pp: 208–212.
A Dynamic Attention Based Graph Neural Network for Anomaly Prediction in Multi-Variate Time-Series & Its Application in Network Monitoring.
Meet Maheshwari, Binay Gupta, Anirban Chatterjee, Kunal Banerjee.
International Conference on Data Science & Management of Data (CODS-COMAD), January 2023, pp: 233–237.
Developing a Noise-Aware AI System for Change Risk Assessment with Minimal Human Intervention.
Subhadip Paul, Anirban Chatterjee, Binay Gupta, Kunal Banerjee.
CIKM Workshop on Human-in-the-Loop Data Curation (HIL-DC), October 2022, pp: 1-5.
WALTS: Walmart AutoML Libraries, Tools and Services.
Rahul Bajaj, Kunal Banerjee, Lalitdutt Parsai, Deepansh Goyal, Sachin Parmar, Divyajyothi Bn, Balamurugan Subramaniam, Chaitanya Sai, Tarun Balotia, Anirban Chatterjee, Kailash Sati.
Euromicro Conference on Software Engineering and Advanced Applications (SEAA), August 2022, pp: 21-28.
Don't Miss the Fine Print! An Enhanced Framework To Extract Text From Low Resolution Images.
Pranay Dugar, Aditya Vikram, Anirban Chatterjee, Kunal Banerjee, Vijay Agneeswaran.
International Conference on Computer Vision Theory and Applications (VISAPP), February 2022, pp: 664-671.
Look Before You Leap! Designing a Human-Centered AI System for Change Risk Assessment.
Binay Gupta, Anirban Chatterjee, Subhadip Paul, Matha Harika, Lalitdutt Parsai, Kunal Banerjee, Vijay Agneeswaran.
International Conference on Agents and Artificial Intelligence (ICAART), February 2022, pp: 655-662. [arXiv]
Designing, Developing and Deploying an Enterprise Scale Network Monitoring System.
Arkadip Basu, Rishi Singh, Chenyang Yu, Amarjeet Prasad, Kunal Banerjee.
Innovations in Software Engineering Conference (ISEC), February 2022, pages: 18:1-18:5.
From Pixels To Words: A Scalable Journey Of Text Information From Product Images To Retail Catalog.
Pranay Dugar, Rajesh Shreedhar Bhat, Asit Sharad Tarsode, Uddipto Dutta, Kunal Banerjee, Anirban Chatterjee, Vijay Srinivas Agneeswaran.
International Conference on Information and Knowledge Management (CIKM), November 2021, pp: 3787-3795.
Exploring Alternatives to Softmax Function.
Kunal Banerjee, Vishak Prasad C, Rishi Raj Gupta, Karthik Vyas, Anushree H, Biswajit Mishra.
Deep Learning Theory and Applications (DeLTA), July 2021, pp: 81-86. [arXiv]
Nominated for "Best Poster Award"
Designing a Bot for Efficient Distribution of Service Requests.
Arkadip Basu, Kunal Banerjee.
Bots in Software Engineering (BotSE)@ICSE, June 2021, pp: 16-20. [arXiv]
Harnessing Deep Learning via a Single Building Block.
Evangelos Georganas, Kunal Banerjee, Dhiraj Kalamkar, Sasikanth Avancha, Anand Venkat, Michael Anderson, Greg Henry, Hans Pabst, Alexander Heinecke.
International Parallel & Distributed Processing Symposium (IPDPS), May 2020, pp: 222-233. [arXiv]
(Preliminary version accepted as research poster in SuperComputing 2019.)
Reliability Evaluation of Compressed Deep Learning Models.
Brunno F. Goldstein, Sudarshan Srinivasan, Dipankar Das, Kunal Banerjee, Leandro Santiago, Victor C. Ferreira, Alexandre S. Nery, Sandip Kundu, Felipe M. G. Franca.
Latin American Symposium on Circuits and Systems (LASCAS), February 2020, pp: 1-5.
Training Google Neural Machine Translation on an Intel CPU Cluster.
Dhiraj Kalamkar, Kunal Banerjee, Sudarshan Srinivasan, Srinivas Sridharan, Evangelos Georganas, Mikhail E. Smorkalov, Cong Xu, Alexander Heinecke.
International Conference on Cluster Computing (CLUSTER), September 2019, pp: 1-10.
Anatomy Of High-Performance Deep Learning Convolutions On SIMD Architectures.
Evangelos Georganas, Sasikanth Avancha, Kunal Banerjee, Dhiraj Kalamkar, Greg Henry, Hans Pabst, Alexander Heinecke.
International Conference for High Performance Computing, Networking, Storage, and Analysis (SuperComputing), November 2018, pp: 66:1-66:12. [arXiv]
Poster: Automatic Detection of Inverse Operations while Avoiding Loop Unrolling.
Kunal Banerjee, Ramanuj Chouksey, Chandan Karfa, Pankaj Kumar Kalita.
International Conference on Software Engineering (ICSE), May 2018, pp: 175-176.
Mixed Precision Training of Convolutional Neural Networks using Integer Operations.
Dipankar Das, Naveen Mellempudi, Dheevatsa Mudigere, Dhiraj Kalamkar, Sasikanth Avancha, Kunal Banerjee, Srinivas Sridharan, Karthik Vaidyanathan, Bharat Kaul, Evangelos Georganas, Alexander Heinecke, Pradeep Dubey, Jesus Corbal, Nikita Shustrov, Roma Dubtsov, Evarist Fomenko, Vadim Pirogov.
International Conference on Learning Representations (ICLR), April 2018, pp: 1-11. [arXiv]
An Equivalence Checking Framework for Array-Intensive Programs.
Kunal Banerjee, Chittaranjan Mandal, Dipankar Sarkar.
Automated Technology for Verification and Analysis (ATVA), October 2017, pp: 84-90.
An End-to-end Formal Verifier for Parallel Programs.
Soumyadip Bandyopadhyay, Santonu Sarkar, Kunal Banerjee.
International Conference on Software Technologies (ICSOFT), July 2017, pp: 388-393.
Translation Validation of Loop and Arithmetic Transformations in the Presence of Recurrences.
Kunal Banerjee, Chittaranjan Mandal, Dipankar Sarkar.
Languages, Compilers, Tools, and Theory for Embedded Systems (LCTES), June 2016, pp: 31-40.
Data-Race Detection: The Missing Piece for an End-to-End Semantic Equivalence Checker for Parallelizing Transformations of Array-Intensive Programs.
Kunal Banerjee, Soumyadip Banerjee, Santonu Sarkar.
International Workshop on Libraries, Languages, and Compilers for Array Programming (ARRAY)@PLDI, June 2016, pp: 1-8.
A Translation Validation Framework for Symbolic Value Propagation Based Equivalence Checking of FSMDAs.
Kunal Banerjee, Chittaranjan Mandal, Dipankar Sarkar.
Source Code Analysis and Manipulation (SCAM), September 2015, pp: 247-252.
A Path-Based Equivalence Checking Method for Petri net based Models of Programs.
Soumyadip Bandyopadhyay, Dipankar Sarkar, Kunal Banerjee, Chittaranjan Mandal.
International Conference on Software Engineering and Applications (ICSOFT-EA), July 2015, pp: 319-329.
Translation Validation of Transformations of Embedded System Specifications using Equivalence Checking.
Kunal Banerjee, Chittaranjan Mandal, Dipankar Sarkar.
IEEE Computer Society Annual Symposium on VLSI (ISVLSI), July 2015, pp: 183-186.
Received "Best PhD Forum Paper Award"
Circuits and Synthesis Mechanism for Hardware Design to Counter Power Analysis Attacks.
Partha De, Kunal Banerjee, Chittaranjan Mandal, Debdeep Mukhopadhyay.
Euromicro Conference on Digital System Design (DSD), August 2014, pp: 520-527.
Extending the Scope of Translation Validation by Augmenting Path Based Equivalence Checkers with SMT Solvers.
Kunal Banerjee, Chittaranjan Mandal, Dipankar Sarkar.
International Symposium on VLSI Design and Test (VDAT), July 2014, pp: 1-6.
Experimentation with SMT Solvers and Theorem Provers for Verification of Loop and Arithmetic Transformations.
Chandan Karfa, Kunal Banerjee, Dipankar Sarkar, Chittaranjan Mandal.
IBM Collaborative Academia Research Exchange (I-CARE), October 2013, pp: 3:1-3:4.
Received "Best Paper Award"
Designing DPA Resistant Circuits Using BDD Architecture and Bottom Pre-charge Logic.
Partha De, Kunal Banerjee, Chittaranjan Mandal, Debdeep Mukhopadhyay.
Euromicro Conference on Digital System Design (DSD), September 2013, pp: 641-644.
A Value Propagation Based Equivalence Checking Method for Verification of Code Motion Techniques.
Kunal Banerjee, Chandan Karfa, Dipankar Sarkar, Chittaranjan Mandal.
International Symposium on Electronic System Design (ISED), December 2012, pp: 67-71.
Equivalence Checking of Array-Intensive Programs.
Chandan Karfa, Kunal Banerjee, Dipankar Sarkar, Chittaranjan Mandal.
IEEE Computer Society Annual Symposium on VLSI (ISVLSI), July 2011, pp: 156-161.

Others

Detecting Concept Drift in the Presence of Sparsity - A Case Study of Automated Change Risk Assessment System.
Vishwas Choudhary, Binay Gupta, Anirban Chatterjee, Subhadip Paul, Kunal Banerjee, Vijay Srinivas Agneeswaran.
Engineering Dependable and Secure Machine Learning Systems (EDSMLS)@AAAI, March 2022.
K-TanH: Hardware Efficient Activations For Deep Learning.
Abhisek Kundu, Alexander Heinecke, Dhiraj Kalamkar, Sudarshan Srinivasan, Eric C. Qin, Naveen K. Mellempudi, Dipankar Das, Kunal Banerjee, Bharat Kaul, Pradeep Dubey.
Preprint on arXiv, September 2019, arXiv:1909.07729.
A Study of BFLOAT16 for Deep Learning Training.
Dhiraj Kalamkar, Dheevatsa Mudigere, Naveen Mellempudi, Dipankar Das, Kunal Banerjee, Sasikanth Avancha, Dharma Teja Vooturi, Nataraj Jammalamadaka, Jianyu Huang, Hector Yuen, Jiyan Yang, Jongsoo Park, Alexander Heinecke, Evangelos Georganas, Sudarshan Srinivasan, Abhisek Kundu, Misha Smelyanskiy, Bharat Kaul, Pradeep Dubey.
Preprint on arXiv, May 2019, arXiv:1905.12322.
A Quick Introduction to Functional Verification of Array-Intensive Programs.
Kunal Banerjee, Chandan Karfa.
Preprint on arXiv, May 2019, arXiv:1905.09137.
Optimizing Deep Learning LSTM Topologies on Intel Xeon Architecture.
Kunal Banerjee, Evangelos Georganas, Dhiraj Kalamkar, Alexander Heinecke.
ISC High Performance, June 2019, Research Poster.
Received "Best Research Poster Award" in "Artificial Intelligence and Machine Learning" track
Understanding the Performance of Small Convolution Operations for CNN on Intel Architecture.
Alexander Heinecke, Evangelos Georganas, Kunal Banerjee, Dhiraj Kalamkar, Narayanan Sundaram, Anand Venkat, Greg Henry, Hans Pabst.
International Conference for High Performance Computing, Networking, Storage and Analysis (SuperComputing), November 2017, Research Poster.
Ternary Residual Networks.
Abhisek Kundu, Kunal Banerjee, Naveen Mellempudi, Dheevatsa Mudigere, Dipankar Das, Bharat Kaul, Pradeep Dubey.
Preprint on arXiv, July 2017, arXiv:1707.04679.
(Accepted as extended abstract in SysML 2018. Presented at Intel AI DevCon 2018.)
An Equivalence Checking Mechanism for Handling Recurrences in Array-Intensive Programs.
Kunal Banerjee.
Principles of Programming Languages (POPL): Student Research Competition, January 2015, pp: 1-2.