Spaces:

prakashknaikade
/

Ask-About-Me

Running

App Files Files Community

Ask-About-Me / docs /biography.txt

prakashknaikade

intial commit

359520d 4 months ago

raw

history blame contribute delete

8.19 kB

	Prakash Naikade
	Computer Vision & Machine Learning Engineer

	PROFILE
	I am passionate about Machine Learning, especially Computer Vision and Generative AI. I have hands-on experience
	from academia and industry. My research interests span in the broad areas of 3D-Reconstruction, Scene Understanding,
	Neural Rendering, Radiance Field, Motion Capture, Digital Twins, LLMs, AR/VR, and generally Computer Vision,
	Computer Graphics, GenAI, Human Computer Interaction, Deep Learning, Machine Learning, and Data Science, to
	solve real-world problems with impactful AI aided solutions.

	EDUCATION
	MS Media Informatics Saarland University, Germany
	Oct 2020 – Present
	Grade: 1.6/5.0 (1.0 being the best possible score)
	Thesis: Novel View Synthesis of Structural Color Objects Created by Laser Markings. (1.3)
	Relevant Courses: Computer Graphics, Image Processing & Computer Vision, Neural Networks: Theory & Implementation,
	High-Level Computer Vision, Statistics with R, Adversarial Reinforcement Learning, Human Computer Interaction,
	Games & Interactive Media.
	[Audited]: Geometric Modeling, Machine Learning, AI, Ethics for Nerds

	BEng Computer Engineering Pune University, India
	June 2011 – May 2015
	Grade: 65% (First Class)
	Thesis: Secure Data Storage on Multi-Cloud Using DNA Based Cryptography.
	Relevant Courses: Data Structures and Algorithms, Design & Analysis of Algorithms, Software Architecture, Software
	Engineering, Software Testing & Quality Assurance, Microprocessors & Microcontrollers

	PROFESSIONAL EXPERIENCE
	Junior Researcher (HiWi) Saarbrücken, Germany
	August-Wilhelm Scheer Institute
	Sept 2023 – Dec 2024
	• Worked on several applied research projects, including MediHopps, iperMö, FläKI and VuLCAn.
	• Implemented advanced deep learning methods for human action recognition (HAR) and body pose estimation (HPE),
	and delivered detailed performance evaluations of these models, along with a trained HAR model (ST-GCN++) for
	custom rehabilitation exercise data captured in the lab.
	• Contributed significantly to the feature extraction, generation, and visualization of furniture functionalities in the
	Python codebase for the iperMö project, developing an AR application to turn individual furniture wishes into reality.
	• Systematic Literature Research and Reviews, Project Proposals and Scientific Literature Writing.
	• Generally worked on computer vision, computer graphics, and machine/deep learning tasks like human pose esti-
	mation, human action recognition, and some XR tasks.

	Research Assistant Saarbrücken, Germany
	AIDAM, Max Planck Institute for Informatics Advisor: Dr Vahid Babaei
	July 2023 – Aug 2024
	• Worked on Radiance Field methods for Novel View Synthesis of structural color objects created by laser markings.
	• Benchmarked SOTA radiance methods for synthetic scene involving Structural Color Object created in Blender.
	• Developed capture setup to capture highly reflective and shiny structural color paintings on metal substrates.
	• Improved the scene optimization using geometric prior and Anisotropy Regularizer in 3D Gaussian-Splatting method.
	• Presented comprehensive experiments to demonstrate methods for simulating structural color objects before printing
	them using only captured images of laser-printed primaries.
	• Facilitated interactive visualization of view-dependent structural color objects in web viewer.

	Computer Vision Intern Münster, Germany
	BASF-Coatings GmbH
	March 2023 – May 2023
	• Developed dataset for adhesive test and corrosion detection on images of test panels of metal substrates.
	• Developed framework and trained YOLOv8 model for adhesive tests’ detection and UNet for corrosion detection
	using created dataset for automation project.

	Computer Vision Intern Aachen, Germany
	Fenris GmbH
	May 2022 – Sept 2023
	• Contributed to markerless motion capture solutions using single and multiple cameras for athlete motion tracking
	and analysis.
	• Conducted a comprehensive literature research and review focused on deep learning approaches for human pose
	estimation and benchmarked SOTA approaches for domain specific video data.
	• Worked on different tasks such as camera calibration, deep learning based human pose estimation & golf sequence
	detection, estimating joint angles from 3D body poses, comparing two pose sequences and visualization of results
	in Blender and Unity.

	Indian Civil Services Exam Preparation
	Jun 2015 – July 2019
	During the preparation of this exam, I gained Under-Graduate level knowledge of Anthropology, Polity, Governance,
	Indian Constitution, Social Justice, International Relations, Economics (Macro), Indian & World Geography, Indian &
	World History, Indian Culture & Society, Environment, and Ethics. (Overall pass percentage of candidates ≈ 0.1%)

	SKILLS
	• Programming: Python, C#, C++, R, SQL, Matlab
	• Frameworks: PyTorch, TensorFlow, NumPy, Pandas, SKLearn, OpenCV, Open3D, Matplotlib, HuggingFace
	• Tools: Conda, Jupyter Notebook, Git, Unity, Blender, Metashape, Colmap, Meshlab, Docker, Slurm/HPC, DevOps
	• OS: Linux, Windows, Shell/Dos Scripting
	• Concepts: Regression, k-NN, k-Means Clustering, PCA, SVM, Neural Networks, CNN, RNN, LSTM, Transformers,
	ViT, CLIP, Autoencoders, VAE, GAN, Diffusion Models, LLMs, NLP, GPT, Prompt Engineering, LangChain, 2D/3D
	Image Processing, Object Detection, Classification, Localization, Segmentation, NeRF, 3DGS, 3D Reconstruction,
	Scene Understanding, Scene Interaction, HCI, XR, Reinforcement Learning

	PROJECTS
	Learn-LLMs GenAI, Information Retrieval
	Getting a hands-on experience of using different LLM models and tools, to understand the finetuning, data preparation,
	evaluation & other techniques related to LLMs such as RAG.

	Diffusion Models Computer Vision, GenAI
	This Project is a basic implementation of Diffusion Model to understand how diffusion works.

	Human Action Recognition (HAR) Computer Vision
	Investigating the performance of different deep learning models and their ensembles used for HAR in still images.

	Image Segmentation on PASCAL VOC and Cityscapes Datasets Computer Vision
	Training and Evaluation of CNNs like UNet, RU-Net and R2U-Net for Image Segmentation.

	COVID-19 Detection Computer Vision
	TensorFlow implementation of model based on ResNet50 architecture for COVID-19 detection on Chest X-rays using
	dataset sourced from Kaggle.

	Object Detection Computer Vision
	Training an object detection model on custom dataset (Oxford Pets dataset) using TensorFlow Object Detection API 2.

	Easy Flappy Bird Game Development
	An simple implementation of Flappy Bird game using Unity and C#.

	Roman Villa Nennig Bot - Your virtual guide to Roman Villa Nennig NLP
	This chatbot helps the user throughout their journey of visiting a museum of the Roman Villa Nennig, developed using
	Google Cloud, Dialogflow Essentials and Telegram.

	Ludwig Palette - an AR painting game AR/VR
	App developed in Unity and C# allows visitors of Ludwigskirche to explore its architecture by painting on its surfaces
	and understand the intricacies of sculptures inside the church.

	Mini-RayTracer Computer Graphics
	Simple ray tracing engine developed in C++.

	Synthetic Dataset Computer Graphics
	Generate simple 3D rendered datasets in Blender and Unity.

	PUBLICATIONS
	• Secure Data Storage on Multi-Cloud Using DNA Based Cryptography. D Zingade, S Dhuri, P Naikade, N Gade,
	A Teke, International Journal of Advance Engineering and Research Development March 2015

	CERTIFICATIONS
	• Kaggle: Python, ML, Pandas, Feature Engineering, Data Visualization, Data Cleaning, SQL, Reinforcement Learning
	& Game AI, Time Series
	• Udacity: C++, AWS ML Foundations
	• Coursera: Mathematics for Machine Learning and Data Science, Structuring ML Project, Neural Network and Deep
	Learning, Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization
	• Udemy: Foundations of MR, XR, VR Development on Quest headsets with Meta’s Presence Platform and Unity.
	• DataCamp: Intermediate R, Data in R
	• Memgraph: Graph Analytics

	LANGUAGES
	English (Fluent), Hindi (Fluent), Marathi (Native), German (Elementary)

	HOBBIES
	Biking, Running, Hiking, Movies, Music