SHIVAM KUMAR

Senior AI Software Developer & Machine Learning Engineer

Specializing in Large Language Models, Big Data Analytics, and Government-scale AI solutions with 4+ years of experience

LLMs & Transformers
Big Data Analytics
Government Projects
Full Stack AI

About Me

I'm a Senior AI Software Developer with over 4 years of experience building cutting-edge AI solutions for government agencies, international clients, and enterprise applications. My expertise spans from fine-tuning Large Language Models to developing Big Data analytics platforms that process diverse data formats for critical decision-making.

I've had the privilege of working on high-impact projects for Indian government focusing on brand monitoring, cyber security, Security Operations Center (SOC), and Big Data Analytics, developing systems that analyze threats and extract entity relationships from complex datasets. My international experience includes EdTech platforms for UK clients and healthcare document analysis for German research institutions.

Skills Overview

Professional Experience

Senior AI Software Developer

SveltetechGurgaon, Haryana
July 2023 - Present

Contributing to Big Data and Generative AI projects, fine-tuning LLMs with multimodal data, working with NLP, Django, PySpark, Apache Flink, Kafka, Docker, and Langchain.

Senior AI Technical Specialist

Skill-Up TechnologiesNoida, UP
Jan 2022 - July 2023

Developed AI and Cloud content for IBM learning programs on Coursera, edX. Worked with Computer Vision, NLP, ML, DL, Django, MERN, Docker, LLM, Diffusion, Transformers.

AI Technical Specialist

Skill-Up TechnologiesNoida, UP
Jun 2021 - Jan 2022

AI and Cloud Subject Matter Expert for IBM learning programs across multiple platforms.

AI Engineer

Pink Tech DesignGurgaon, Haryana
Nov 2020 - Jun 2021

Computer Vision with satellite imagery, CNN, YOLO, UNET, Mask R-CNN, GAN, Autoencoder, NLP with RNN LSTM, Transformer. API development with Django REST.

Featured Projects

AI • Cybersecurity • Red Team • Blue Team

AI-Driven Cybersecurity Simulation Platform

Developed an AI-driven cybersecurity simulation platform with Red Team and Blue Team agents. The app autonomously conducts offensive (Red Team) and defensive (Blue Team) operations using AI agents, simulating real-world cyberattack and defense scenarios. Designed to enhance security testing by providing continuous, adaptive security assessments. Integrated machine learning algorithms to allow agents to evolve strategies over time. Enabled seamless user interaction for customizable environments and automated threat-response exercises.

Python
AI Agents
Cybersecurity
Machine Learning
Red Team
Blue Team
Threat Simulation
Adaptive Security
Government • Police

Government Big Data Analytics Platform

Developed a comprehensive Big Data Analytics platform for Indian government agencies including Police. Enables ingestion of diverse data formats (PDF, DOCX, images, audio, video) and performs complex queries to retrieve information such as family background and past activities of criminals. Features entity relationship extraction across multiple documents using graph database technology.

Python
Big Data
Neo4j
Apache Spark
Graph Analytics
Document Processing
Government • Cyber Security • Indian Railways • SOC

Cyber Security & SOC Platform

Built a specialized Security Operations Center (SOC) platform for Indian Railway brand monitoring and cyber security operations. System provides real-time threat analysis, brand reputation monitoring, and security incident response capabilities with advanced analytics and automated threat detection using machine learning algorithms.

Python
Cyber Security
SOC
Threat Analysis
Brand Monitoring
ML Analytics
Mobile • React Native • Published

AI Image Enhancement Mobile App

Developed and published an AI-powered image enhancement Android application using React Native. Features advanced image processing algorithms for noise reduction, sharpening, color correction, and quality enhancement. Successfully deployed on Google Play Store with positive user reviews and active user base.

React Native
AI Image Processing
Android
Computer Vision
Mobile Development
Play Store
EdTech • UK Client

EdTech Platform (GL11)

Built a competitive exam platform allowing PDF uploads, AI-generated practice questions, contextual Q&A, and intelligent feedback systems for enhanced learning experiences.

React
Node.js
Python
NLP
PDF Processing
AI Feedback
Healthcare • German Client

Medical Document Readability System

Created a solution for research students to simplify complex medical leaflets based on readability metrics, determining text comprehension levels for different demographic groups.

Python
NLP
Readability Analysis
Medical Text Processing
AI • SaaS

Advanced Virtual Assistant Platform

AI-based real-time virtual assistant with subscription integration, faster than mobile ChatGPT. Features chatbot/voice assistant embedding, custom LLMs with LangChain agents.

LangChain
LLMs
WebSocket
Payment Integration
Voice AI
AI • Analytics

Real-Time Communication Analysis

Real-time call analysis tool using MediaStream Recording API, WebSocket for speaker diarization, Whisper STT, and Pyannote for speaker embedding.

WebSocket
Whisper
Pyannote
Real-time Processing
Audio Analysis
Computer Vision • Smart City

Smart City Traffic Optimization

Traffic flow optimization system analyzing real-time video feeds, object detection with YOLOv5, predictive modeling with LSTM, and simulation environment for testing.

YOLOv5
OpenCV
LSTM
FastAPI
React
Time Series

Technical Skills

AI/ML

Machine Learning
Deep Learning
LLMs
Hugging Face Transformers
Computer Vision
NLP
Diffusion
Prompt Engineering
Time Series Forecasting
Bayesian Statistics

Programming

Python
JavaScript
SQL
NoSQL
HTML/CSS

Frameworks

React
Next.js
Node.js
Django REST API
FastAPI
MERN Stack
React Native
Bootstrap 5
Redux
Streamlit

Big Data

PySpark
Apache Kafka
Apache Flink
Elasticsearch
Neo4j
Big Data Analysis

Cloud/DevOps

Docker
Kubernetes
GCP
IBM Cloud
Terraform
Git
Agile Development

Libraries

TensorFlow
PyTorch
Keras
Scikit-Learn
LangChain
OpenCV
Transformers
Diffusers

Certifications

IBM Applied AI Professional Course - EDX

IBM AI Engineering Professional Course - Coursera

IBM DevOps and Software Engineering Professional Certificate - Coursera

IBM Full Stack Cloud Developer Professional Course - EDX

Google Cloud Platform Professional Course - Coursera

Let's Connect

Interested in collaborating on AI projects or discussing opportunities? I'm always open to new challenges and innovative solutions.