I am a research
engineer focusing upon building surpassing the latest and greatest in AI. My
focus is on NLP, ASR, TTS, VC, OCR, FR with a particular emphasis on large
generative models.
I helped several
startups and companies to develop algorithms for product recommendation,
product quality assessment, chatbots, stock/crypto trading, OCR systems, object
recognition, video analysis (retrieval, anomaly detection, moment detection,
highlight detection), biometric identification, etc.
I'm well-versed in
working with transformer models and can help you fine-tune and transfer
learning to get the most out of your own data.
If you have text
data I can do text classification, natural language understanding, and natural
language generation.
If you're looking
for semantic search or similarity search, I can create high-quality
embeddings and develop a state-of-the-art semantic search system using sentence
transformers.
If you have speech
corpus I can do train and finetune your own ASR/TTS/VC models.
Core Skills:
-
Programming: Python, C/C++/C#, Java, Java Script
-
AI & Machine Learning: TensorFlow, PyTorch, Caffe, YoLo, Gradient Boosting,
Classification, Regression, Deep Learning, Model Tuning
-
Computer Vision: Image / Object Recognition, Image Analysis, OpenCV, Facial
Recognition, Text Recognition, Tesseract OCR, Paddle OCR
-
NLP & Chatbots: Natural Language Processing, Natural Language Generation,
ChatGPT, Prompt Engineering, Bot Development
-
Cloud & Deployment: AWS, Amazon, Azure, Google Cloud
-
Data Processing & Extraction: NLTK, Gensim, Spark, Web Scraping, Selenium