✅ OCR:
● Document Layout detection, Parsing various documents (Invoices, Bills, Resumes, etc)
● Libraries & Deep Learning models: Tesseract, Paddle, Google & AWS OCR APIs, Yolox, Detectron2, Chipper
✅ NLP:
● Text processing and analyzing such as content, entity, semantic extraction, analyzing projects.
● Embedding feature extraction from the raw string(RNN, word2vec), semantic meaning analyzing and classification, similarity analyzing and etc.
● In case of text classification: Text processing(Regular expression), Embedding feature extraction, Classification model based on SVM to classify this feature vectors
● Chatbot development using Rasa, Google Dialogflow
● Libraries: NLTK, Gensim, Spacy