I am a research engineer focusing upon building surpassing the latest and greatest in AI. My focus is on NLP, ASR, TTS, VC, OCR, FR with a particular emphasis on large generative models.
I helped several startups and companies to develop algorithms for product recommendation, product quality assessment, chatbots, stock/crypto trading, OCR systems, object recognition, video analysis (retrieval, anomaly detection, moment detection, highlight detection), biometric identification, etc.
I'm well-versed in working with transformer models and can help you fine-tune and transfer learning to get the most out of your own data.
If you have text data I can do text classification, natural language understanding, and natural language generation.
If you're looking for semantic search or similarity search, I can create high-quality embeddings and develop a state-of-the-art semantic search system using sentence transformers.
If you have speech corpus I can do train and finetune your own ASR/TTS/VC models.
Core Skills:
- Programming: Python, C/C++/C#, Java, Java Script
- AI & Machine Learning: TensorFlow, PyTorch, Caffe, YoLo, Gradient Boosting, Classification, Regression, Deep Learning, Model Tuning
- Computer Vision: Image / Object Recognition, Image Analysis, OpenCV, Facial Recognition, Text Recognition, Tesseract OCR, Paddle OCR
- NLP & Chatbots: Natural Language Processing, Natural Language Generation, ChatGPT, Prompt Engineering, Bot Development
- Cloud & Deployment: AWS, Amazon, Azure, Google Cloud
- Data Processing & Extraction: NLTK, Gensim, Spark, Web Scraping, Selenium