prakashjay prakashjayy

Hi there 👋

My name is Prakash,

Current - AI Research Scientist at invideo.ai working as Generative AI team lead building multi-model systems from last 1.5 year, Previously

At qure.ai I worked as Senior Director - Data science leading 3D (CT scan - Chest and Brain) team for 3 years.
At fractal.ai, worked as Data scientist for one year and later worked as Senior Data scientist for 3 years.

You can find me at

About me 🕴️

I have been working in the deep learning field for over 9 years. I enjoy learning things from scratch and love writing and teaching about them. I am known for my attention to detail—you'll often find me documenting my findings in thorough reports. For any project, I start by gathering information, collecting data, and establishing a solid evaluation framework. I focus on understanding the fundamentals and take a mathematical approach to every problem.

List of projects I worked and went into production.

Talking heads: Multimodal Generative AI system. Trained GAN models on 5k+ hours of audio+video data. experimenting with diffusion foundational models now [2025]
SyncNet: Contrastive learning, CLIP type model for audio+video alignment. Trained on 10k+ hours of audio+video data. [2024]
Identifying defects from Product X-rays. Non-destructive testing . self supervision (200k+ images) + segmentation (25k+ images).
3D object detection: Finding nodules on Chest CT. Trained on 200k CT Scans. [2023]
2D object detection: Identifying Humans in drone footage - thermal and normal cameras - Trained on 100+ hours of data. [2022]
2D object detection and long tail classification: Quantifying brand presence in retail shelfs, Detecting 200+ objects from single image and identifying 5000+ SKUs. Trained on 100k+ images [2020-21]
Image classification: Identifying diabetic retinopathy from Eye Images. Trained on 50k images [2018]. This is more of a research project

Competitions

Below are some competations where I was in Top-25

Using focal loss on deep recommender systems - 14th Position
AV-Fractal-hiring hackathon on Forecasting - 9th Position

Blogs

The following blogs are written by me on various platforms.

All my generative AI blogs are written here. details on diffusion, flow, score and GAN based models are detailed.
some of blogs on foundation models were written here

GenAI

Foundation models

object detection

image classification

Structural reparameterization
CSPnet
Classification architecture review Alexnet-SENet
Understanding Resnet and Resnext Part1 Part2
Almost any image classification using Pytorch-2018
Transfer learning using keras - 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

prakashjay prakashjayy

Achievements

Achievements

Block or report prakashjayy

Hi there 👋

About me 🕴️

List of projects I worked and went into production.

Competitions

Blogs

GenAI

Foundation models

object detection

image classification

Others

Engineering

languages and tools

Pinned Loading

Uh oh!