AI Engineer

Full-time available

Bio

Sahab M T is a highly experienced Senior Machine Learning Engineer with a strong background in building end-to-end AI, machine learning, and data-driven systems across NLP, computer vision, and generative AI. He specializes in Python-based ML development, leveraging frameworks such as Scikit-learn, TensorFlow, PyTorch, OpenCV, and YOLO for tasks including object detection, tracking, pose estimation, 3D computer vision with LiDAR, and real-time analytics. His expertise in NLP and Retrieval-Augmented Generation (RAG) spans…

Expertise

PythonFlaskMongoDBPyTorchSQLAWSFast APIWebScrapingDatabase:DocumentDatabase:RelationalGitHTMLDatabase:CacheingKubernetes

Video Intro

Assessments

Project Assessment

Great score on Implement LRU Cache Algorithm, Difficulty Hard · Great score on Understanding Rate Limiting in APIs, Difficulty Medium · Great score on Ensuring Real-time Data Accuracy in Mobile App, Difficulty Medium

Projects

Locomotive-AI

Worked on the 'Locomotive-AI' project, utilizing LiDAR and 2D imagery for locomotive detection and track analysis. Developed algorithms to detect locomotives on the main track, identify turnaround points, segment the track, and trigger alerts as necessary. Main technical focus on Python-based AI and computer vision.

Gas-Station

Executed a straightforward Image Processing project for real-time vehicle detection and classification from live camera streams. Implemented functions for vehicle counting and peak-hour vehicle analysis. Tasked with identifying and categorizing vehicles into five distinct classes, such as red taxi, red bus, and more. Developed the project within the Django framework. Leveraged AWS cloud services for hosting and deployment. Utilized the Python programming language for image processing and classification tasks.

Data Scrapping

Collaborated with a prominent big data scraping firm, specializing in frequent data extraction from major client websites. Proficiently employed a tech stack centered around Python, including Scrapy, an in-house API for handling captchas and bans, Beautiful Soup, HTML, and regular expressions (regex). Addressed challenges related to website bans by implementing proactive measures, such as rotating proxies and IP management, ensuring uninterrupted data scraping operations. Processed and delivered scraped data for utilization in AI algorithms, contributing to valuable insights and informed decision-making. Developed a Selenium-based web scraper specifically for extracting job listings from LinkedIn, followed by data analysis. Applied question-answering algorithms to extract seniority levels, salary information, and other relevant details from the scraped job data.

Danish-NER

Received hotel vendor invoices written in Danish, a major European language, for processing. Conducted Name Entity Recognition (NER) and data analysis on these invoices, which were provided in PDF and image formats. Overcame the significant challenge of dealing with inconsistent and non-standard data formats. Designed a robust solution, involving the training of a computer vision algorithm, YOLOv8, to identify relevant areas on the invoices. Utilized Optical Character Recognition (OCR) to extract text from the identified areas accurately. Trained BERT (Bidirectional Encoder Representations from Transformers) for Named Entity Recognition (NER) to categorize and extract specific entities from the extracted text. Designed the architectural framework of the product, incorporating it into a Flask-based API for seamless integration and scalability. Managed a team of 5 individuals, overseeing their roles and responsibilities in the project. Established direct communication channels with clients to ensure project alignment and meet their specific requirements.

SalesArt-AI

Employed computer vision algorithms to train models for product recognition on supermarket shelves. Implemented image stitching techniques to create comprehensive views of supermarket shelves for improved analysis. Designed the architecture of the product, incorporating microservices for scalability and modularity. Utilized Minio as a storage solution to store machine learning models and associated data. Leveraged Docker for containerization, ensuring consistency and portability of the deployment environment. Acquired proficiency in FastAPI and utilized it to wrap the trained models, making them accessible via API endpoints. Managed data storage using PostgreSQL, ensuring efficient and secure data management within the project.

Cultural-Influencers

Developed matching algorithms to identify relevant connections and associations within the scraped data. Trained a classifier to categorize creators into their primary niches based on their content and profiles. Collaborated with a team to design and build an analytics dashboard for data visualization and insights. Utilized GPT-3 to generate biographies for Instagram profiles, enhancing profile descriptions. Leveraged Python for Natural Language Processing (NLP) and Computer Vision tasks. Employed ElasticSearch and Kibana for data indexing, searching, and visualization within the project.

Experience

Senior Machine Learning Engineer
Complya LLCJan 2024 — Present
- Workflow Automation – built n8n-style engine enabling staff/clients to design and run custom workflows
- Data Engineering – created ETL pipelines on GCP (Data Stream, Eventarc) for workflow automation
- Agentic AI Development – built onboarding agent using LiveKit (real-time comms), DeepGram (speech-to-text), Google TTS (voice output)
- RAG Development: Utilized Scrapy and Selenium for data scraping, focusing on childcare websites. Implemented scraping on Digital Ocean VM in headless mode.
- Generated text embeddings with OpenAI’s Text Embedding 002, initially storing data in MongoDB Atlas, migrating to Elasticsearch for search optimization, and later moving to Pinecone for efficient vector management and integration.
- Developed a Retrieval-Augmented Generation (RAG) system using Google Gemini Vertex AI, Claude by Anthropic, and OpenAI. Built the RAG application with LangChain and monitored performance using LangSmith.
- Created a FastAPI backend for the RAG bot, implementing real-time streaming with WebSockets.
- Build Automatic embedding pipeline using Cloudflare, VoyageAI
- Build Agents using crewAI, and deploy them over Cloud Run using FastAPI
- Set up CI/CD pipelines with GitHub Actions and deployed the system via Google Cloud Run and Cloud Build.
n8nGCPLiveKitDeepGramGoogle TTSScrapySeleniumOpenAIMongoDBElasticsearchPineconeGoogle Gemini Vertex AIClaude by AnthropicFastAPICloudflareVoyageAIcrewAILangChainLangSmithGitHub ActionsGoogle Cloud RunCloud Build
Machine Learning Engineer
SalesArt AIJun 2022 — Jan 2024
- Implemented FastAPI, Docker, Pydantic, and PostgreSQL to wrap models, providing user-friendly interfaces.
- Trained YOLOv5 and UltraLytics models for precise product detection in the retail industry.
- Utilized VGG16 for image feature extraction, enhancing model accuracy and performance.
- Contributed to automatic training and testing pipelines for efficient machine learning model development.
- Managed datasets through dedicated pipelines, ensuring organized and accessible data.
- Utilized Celery, Redis, and RabbitMQ for background tasks and queue-based communication.
- Deployed complete pipeline on ec2
- Applied Python for image processing and computer vision tasks.
- Implemented Docker for containerization, enabling microservices deployment and system scalability.
- Integrated sockets for real-time communication within the system.
FastAPIDockerPydanticPostgreSQLYOLOv5UltraLyticsVGG16CeleryRedisRabbitMQPython
Machine Learning Engineer
SlashNext IncJun 2020 — Jan 2022
- Developed natural language processing models in Python for phishing content prediction using TF-IDF features and Random Forest, with three models in production.
- Integrated computer vision features with TF-IDF for enhanced model performance.
- Conducted feature engineering, model training, and testing, continually exploring techniques to improve precision and recall, and proficient in reporting model metrics.
- Trained Doc2vec on phishing content for word embedding.
- Proficient in Python, Linux, Shell scripting, AWK, handling large CSV files, and using development tools like PyCharm, scikit-learn, and PyTorch, with experience in model stacking techniques.
PythonTF-IDFRandom ForestDoc2vecLinuxShell scriptingAWKPyCharmscikit-learnPyTorch
Computer Vision Developer
SixlogicsMay 2019 — Jun 2021
- Developed and implemented object detection and tracking algorithms for real-time applications.
- Utilized homography techniques to map world coordinates to a 2D plane, enhancing spatial understanding.
- Conducted training of the InceptionV2 model and developed a Keras image classifier for improved accuracy.
- Engineered a real-time analytics system, significantly boosting performance from 5 frames per second (fps) to 33 fps.
InceptionV2Keras
Machine Learning Engineer
XeruixMay 2020 — Aug 2020
- Resume Parsing and Question & Answering Techniques on unstructured data (Spacy, NLTK, Stanford NLP).
- Analyzed facial expression of candidate for emotions, voice and stress level during the interview
SpacyNLTKStanford NLP
Software Engineering Intern
Grey NeonJan 2019 — May 2019
- Human Resource Management System, Develop new modules in mobile application. SMS verification, Push Notifications, and One Signal. Uploading builds for IOS and android.

Education

Bachelor Computer Science
COMSATS UniversityLahore2015 — 2019
ICS
Government College UniversityLahore2013 — 2015

AI Engineer

Full-time available

Sahab is available for hire

Schedule an interview

14 days risk-free trial · No commitments · We handle contracts and payroll

Bio

Projects completed6
Hourly rate
4+ years experience
Member since Jan 2026

Expertise

PythonFlaskMongoDBPyTorchSQLAWSFast APIWebScrapingDatabase:DocumentDatabase:RelationalGitHTMLDatabase:CacheingKubernetes

Video Intro

Assessments

Project Assessment

Projects

Locomotive-AI

Gas-Station

Data Scrapping

Danish-NER

SalesArt-AI

Cultural-Influencers

Experience

Senior Machine Learning Engineer
Complya LLCJan 2024 — Present
- Workflow Automation – built n8n-style engine enabling staff/clients to design and run custom workflows
- Data Engineering – created ETL pipelines on GCP (Data Stream, Eventarc) for workflow automation
- Agentic AI Development – built onboarding agent using LiveKit (real-time comms), DeepGram (speech-to-text), Google TTS (voice output)
- RAG Development: Utilized Scrapy and Selenium for data scraping, focusing on childcare websites. Implemented scraping on Digital Ocean VM in headless mode.
- Generated text embeddings with OpenAI’s Text Embedding 002, initially storing data in MongoDB Atlas, migrating to Elasticsearch for search optimization, and later moving to Pinecone for efficient vector management and integration.
- Developed a Retrieval-Augmented Generation (RAG) system using Google Gemini Vertex AI, Claude by Anthropic, and OpenAI. Built the RAG application with LangChain and monitored performance using LangSmith.
- Created a FastAPI backend for the RAG bot, implementing real-time streaming with WebSockets.
- Build Automatic embedding pipeline using Cloudflare, VoyageAI
- Build Agents using crewAI, and deploy them over Cloud Run using FastAPI
- Set up CI/CD pipelines with GitHub Actions and deployed the system via Google Cloud Run and Cloud Build.
n8nGCPLiveKitDeepGramGoogle TTSScrapySeleniumOpenAIMongoDBElasticsearchPineconeGoogle Gemini Vertex AIClaude by AnthropicFastAPICloudflareVoyageAIcrewAILangChainLangSmithGitHub ActionsGoogle Cloud RunCloud Build
Machine Learning Engineer
SalesArt AIJun 2022 — Jan 2024
- Implemented FastAPI, Docker, Pydantic, and PostgreSQL to wrap models, providing user-friendly interfaces.
- Trained YOLOv5 and UltraLytics models for precise product detection in the retail industry.
- Utilized VGG16 for image feature extraction, enhancing model accuracy and performance.
- Contributed to automatic training and testing pipelines for efficient machine learning model development.
- Managed datasets through dedicated pipelines, ensuring organized and accessible data.
- Utilized Celery, Redis, and RabbitMQ for background tasks and queue-based communication.
- Deployed complete pipeline on ec2
- Applied Python for image processing and computer vision tasks.
- Implemented Docker for containerization, enabling microservices deployment and system scalability.
- Integrated sockets for real-time communication within the system.
FastAPIDockerPydanticPostgreSQLYOLOv5UltraLyticsVGG16CeleryRedisRabbitMQPython
Machine Learning Engineer
SlashNext IncJun 2020 — Jan 2022
- Developed natural language processing models in Python for phishing content prediction using TF-IDF features and Random Forest, with three models in production.
- Integrated computer vision features with TF-IDF for enhanced model performance.
- Conducted feature engineering, model training, and testing, continually exploring techniques to improve precision and recall, and proficient in reporting model metrics.
- Trained Doc2vec on phishing content for word embedding.
- Proficient in Python, Linux, Shell scripting, AWK, handling large CSV files, and using development tools like PyCharm, scikit-learn, and PyTorch, with experience in model stacking techniques.
PythonTF-IDFRandom ForestDoc2vecLinuxShell scriptingAWKPyCharmscikit-learnPyTorch
Computer Vision Developer
SixlogicsMay 2019 — Jun 2021
- Developed and implemented object detection and tracking algorithms for real-time applications.
- Utilized homography techniques to map world coordinates to a 2D plane, enhancing spatial understanding.
- Conducted training of the InceptionV2 model and developed a Keras image classifier for improved accuracy.
- Engineered a real-time analytics system, significantly boosting performance from 5 frames per second (fps) to 33 fps.
InceptionV2Keras
Machine Learning Engineer
XeruixMay 2020 — Aug 2020
- Resume Parsing and Question & Answering Techniques on unstructured data (Spacy, NLTK, Stanford NLP).
- Analyzed facial expression of candidate for emotions, voice and stress level during the interview
SpacyNLTKStanford NLP
Software Engineering Intern
Grey NeonJan 2019 — May 2019
- Human Resource Management System, Develop new modules in mobile application. SMS verification, Push Notifications, and One Signal. Uploading builds for IOS and android.

Education

Bachelor Computer Science
COMSATS UniversityLahore2015 — 2019
ICS
Government College UniversityLahore2013 — 2015

Similar developers

We have moreWith similar skills.

Other vetted developers with similar skills and experience

Saad Ahmed S.

AI Engineer

3-4 Years of Experience

FlaskKubernetesGitPython

View

Bilal S.

AI Engineer

0-2 Years of Experience

NodeJSReactSQLPython

View

Roshan K.

AI Engineer

3-4 Years of Experience

PythonNodeJSReactFlask

View

View all

AI Engineer

Project Assessment

Locomotive-AI

Gas-Station

Data Scrapping

Danish-NER

SalesArt-AI

Cultural-Influencers

Senior Machine Learning Engineer

Machine Learning Engineer

Machine Learning Engineer

Computer Vision Developer

Machine Learning Engineer

Software Engineering Intern

Bachelor Computer Science

ICS

AI Engineer

Project Assessment

Locomotive-AI

Gas-Station

Data Scrapping

Danish-NER

SalesArt-AI

Cultural-Influencers

Senior Machine Learning Engineer

Machine Learning Engineer

Machine Learning Engineer

Computer Vision Developer

Machine Learning Engineer

Software Engineering Intern

Bachelor Computer Science

ICS

We have moreWith similar skills.

Hire

Company

Resources

For Developers

For Bots