Prasanna Biswas's photo

AI Software Solutions Engineer
Intel Corporation
Intel Habana AI
AI Kernel Development Team
Bangalore, KA - 560037

LinkedIn | Resume
| CV | Github Profile


About me | Research | Patent | Publications | Youtube | Projects | Technologies I am currently working as an AI Software Solutions Engineer at Intel Habana, Bangalore, where I specialize in developing high-performance kernels for Intel’s next-generation GPUs. My work involves optimizing latency, memory bandwidth, I/O access, and compute utilization using SYCL, with a particular focus on enabling dynamic shape support for advanced workloads.
I have co-authored two research papers, one submitted to CVR 2025 and another to IEEE CONNECT 2025.
Prior to joining Intel, I worked as a Senior Machine Learning Engineer at Qualcomm, where I was responsible for optimizing ONNX models for Natural Language Processing (NLP), Computer Vision (CV), and Large Language Models (LLMs) to enhance inference speed on Qualcomm’s AI100 accelerator. My role involved designing and implementing software modules for AI/Deep Neural Network frameworks using C++ and Python. Additionally, I contributed to the implementation of node fusion as a custom operation for performance optimization.
Before my tenure at Qualcomm, I was a Research Assistant at the Center for Indian Language Technology (CFILT), Indian Institute of Technology Bombay, where I was part of the IBM-AIHN Network. My research focused on analyzing the impact of sarcasm on emotion detection using a multi-modal approach. My primary area of interest is Machine Learning for Natural Language Processing (NLP), with a strong passion for language generation, language models, and multimodal NLP.
At Intel, I was involved in research on generating machine-style handwriting using a diffusion-based text-conditioned latent model combined with VAE decoding.
Previously, as a Research Assistant, I developed a multimodal approach to understanding emotions in sarcasm, focusing on the incongruities in language and the hidden emotions behind textual expressions. My work in text-to-text NLP has revolved around emotion classification, sentiment analysis, and sarcasm detection.
Over the years, I have been particularly fascinated by generative modeling for conversational AI and have dedicated significant time to optimizing computational efficiency and memory requirements for such models. Additionally, I have worked with Graph Neural Networks (GNNs) for efficient compilation of ML models, enabling improved model performance and scalability.
  1. Himanshu Upreti , Dheeraj Gattupalli , Vinayak Baddi, Mohit Sharma, Prasanna Biswas and Anuj Gupta. June 06,2023. "Pre-Processing For Deep Neural Network Compilation Using Graph Neural Networks". (Pending)
Python Instructor
YouTube Video Thumbnail

Learn how to automate web tasks using Python and Selenium! Click the thumbnail or button below to watch.

Watch Video

YouTube Playlist Thumbnail

Check out this complete Python tutorial series covering everything from basics to advanced topics!

Watch Playlist


Projects for fun
  1. K-Means clustering from scratch: K-Means-Clustering
  2. GPT from scratch - by Andrej: NanoGPT
  3. Term-Deposit Prediction: Term-Deposit-Subscription-Prediction-via-PySpark
  4. Classic Snake Game: Classic-Snake-Game
  5. Coding Questions: Questions for fun
Computational Model for Understanding Emotions in Sarcasm Investigating Importance of Multiple Modalities in Sarcasm Detection Technology I use for Machine Learning
  1. PyTorch: Machine-Learning-With-Pytorch
  2. ONNX
  3. ONNX Runtime
Programming
  1. Python
  2. C++
ML Domain & Techniques:
  1. Natural Language Processing
  2. Computer Vision
  3. Quantization
  4. Pruning
  5. Node Fusion
  6. Graph Optimization
Others:
  1. Docker
  2. Git
  3. AWS
Last updated on 13/03/2025 at 08:57 p.m. IST