Great score on Implement LRU Cache Algorithm, Difficulty Hard
Great score on Understanding Rate Limiting in APIs, Difficulty Medium
Great score on Ensuring Real-time Data Accuracy in Mobile App, Difficulty Medium
Great score on Validate Parentheses in a String, Difficulty Medium
Great score on Word Search in Grid, Difficulty Hard
Great score on Reverse Digits of a 32-bit Integer, Difficulty Hard
Workflow Automation – built n8n-style engine enabling staff/clients to design and run custom workflows
Data Engineering – created ETL pipelines on GCP (Data Stream, Eventarc) for workflow automation
Agentic AI Development – built onboarding agent using LiveKit (real-time comms), DeepGram (speech-to-text), Google TTS (voice output)
RAG Development: Utilized Scrapy and Selenium for data scraping, focusing on childcare websites. Implemented scraping on Digital Ocean VM in headless mode.
Generated text embeddings with OpenAI’s Text Embedding 002, initially storing data in MongoDB Atlas, migrating to Elasticsearch for search optimization, and later moving to Pinecone for efficient vector management and integration.
Developed a Retrieval-Augmented Generation (RAG) system using Google Gemini Vertex AI, Claude by Anthropic, and OpenAI. Built the RAG application with LangChain and monitored performance using LangSmith.
Created a FastAPI backend for the RAG bot, implementing real-time streaming with WebSockets.
Build Automatic embedding pipeline using Cloudflare, VoyageAI
Build Agents using crewAI, and deploy them over Cloud Run using FastAPI
Set up CI/CD pipelines with GitHub Actions and deployed the system via Google Cloud Run and Cloud Build.
Implemented FastAPI, Docker, Pydantic, and PostgreSQL to wrap models, providing user-friendly interfaces.
Trained YOLOv5 and UltraLytics models for precise product detection in the retail industry.
Utilized VGG16 for image feature extraction, enhancing model accuracy and performance.
Contributed to automatic training and testing pipelines for efficient machine learning model development.
Managed datasets through dedicated pipelines, ensuring organized and accessible data.
Utilized Celery, Redis, and RabbitMQ for background tasks and queue-based communication.
Deployed complete pipeline on ec2
Applied Python for image processing and computer vision tasks.
Implemented Docker for containerization, enabling microservices deployment and system scalability.
Integrated sockets for real-time communication within the system.
Developed natural language processing models in Python for phishing content prediction using TF-IDF features and Random Forest, with three models in production.
Integrated computer vision features with TF-IDF for enhanced model performance.
Conducted feature engineering, model training, and testing, continually exploring techniques to improve precision and recall, and proficient in reporting model metrics.
Trained Doc2vec on phishing content for word embedding.
Proficient in Python, Linux, Shell scripting, AWK, handling large CSV files, and using development tools like PyCharm, scikit-learn, and PyTorch, with experience in model stacking techniques.
Developed and implemented object detection and tracking algorithms for real-time applications.
Utilized homography techniques to map world coordinates to a 2D plane, enhancing spatial understanding.
Conducted training of the InceptionV2 model and developed a Keras image classifier for improved accuracy.
Engineered a real-time analytics system, significantly boosting performance from 5 frames per second (fps) to 33 fps.
Resume Parsing and Question & Answering Techniques on unstructured data (Spacy, NLTK, Stanford NLP).
Analyzed facial expression of candidate for emotions, voice and stress level during the interview
Human Resource Management System, Develop new modules in mobile application. SMS verification, Push Notifications, and One Signal. Uploading builds for IOS and android.
Worked on the 'Locomotive-AI' project, utilizing LiDAR and 2D imagery for locomotive detection and track analysis. Developed algorithms to detect locomotives on the main track, identify turnaround points, segment the track, and trigger alerts as necessary. Main technical focus on Python-based AI and computer vision.
Executed a straightforward Image Processing project for real-time vehicle detection and classification from live camera streams. Implemented functions for vehicle counting and peak-hour vehicle analysis. Tasked with identifying and categorizing vehicles into five distinct classes, such as red taxi, red bus, and more. Developed the project within the Django framework. Leveraged AWS cloud services for hosting and deployment. Utilized the Python programming language for image processing and classification tasks.
Collaborated with a prominent big data scraping firm, specializing in frequent data extraction from major client websites. Proficiently employed a tech stack centered around Python, including Scrapy, an in-house API for handling captchas and bans, Beautiful Soup, HTML, and regular expressions (regex). Addressed challenges related to website bans by implementing proactive measures, such as rotating proxies and IP management, ensuring uninterrupted data scraping operations. Processed and delivered scraped data for utilization in AI algorithms, contributing to valuable insights and informed decision-making. Developed a Selenium-based web scraper specifically for extracting job listings from LinkedIn, followed by data analysis. Applied question-answering algorithms to extract seniority levels, salary information, and other relevant details from the scraped job data.
Received hotel vendor invoices written in Danish, a major European language, for processing. Conducted Name Entity Recognition (NER) and data analysis on these invoices, which were provided in PDF and image formats. Overcame the significant challenge of dealing with inconsistent and non-standard data formats. Designed a robust solution, involving the training of a computer vision algorithm, YOLOv8, to identify relevant areas on the invoices. Utilized Optical Character Recognition (OCR) to extract text from the identified areas accurately. Trained BERT (Bidirectional Encoder Representations from Transformers) for Named Entity Recognition (NER) to categorize and extract specific entities from the extracted text. Designed the architectural framework of the product, incorporating it into a Flask-based API for seamless integration and scalability. Managed a team of 5 individuals, overseeing their roles and responsibilities in the project. Established direct communication channels with clients to ensure project alignment and meet their specific requirements.
Employed computer vision algorithms to train models for product recognition on supermarket shelves. Implemented image stitching techniques to create comprehensive views of supermarket shelves for improved analysis. Designed the architecture of the product, incorporating microservices for scalability and modularity. Utilized Minio as a storage solution to store machine learning models and associated data. Leveraged Docker for containerization, ensuring consistency and portability of the deployment environment. Acquired proficiency in FastAPI and utilized it to wrap the trained models, making them accessible via API endpoints. Managed data storage using PostgreSQL, ensuring efficient and secure data management within the project.
Developed matching algorithms to identify relevant connections and associations within the scraped data. Trained a classifier to categorize creators into their primary niches based on their content and profiles. Collaborated with a team to design and build an analytics dashboard for data visualization and insights. Utilized GPT-3 to generate biographies for Instagram profiles, enhancing profile descriptions. Leveraged Python for Natural Language Processing (NLP) and Computer Vision tasks. Employed ElasticSearch and Kibana for data indexing, searching, and visualization within the project.

Sahab T. is senior Level Developer