Data Engineer | Web Scraper | Python Developer | Cloud Architect
Thanks for visiting my profile
I'm Tran Tien Van – a Top Rated Plus Data Engineer, Data Architect, and Web Scraper with 5 years of experience building robust, scalable data solutions. Below are the key cutting‑edge technologies I excel in:
Programming Language:
✅ SQL
✅ Python, R, Javascript, and Java
✅ Bash
LLMs and Embeddings for RAG:
✅ Claude
✅ Deepseek
✅ GPT 4.o
✅ Llama 3.0
✅ Pretrained models from Huggingface
LangChain Agents & Workflow Automation
✅ Langgraph
✅ Langfuse
✅ Langchain: (Bedrock, OpenAI, Anthropic, ....)
✅ LangSmith
✅ Multi-modal processing with LangChain
Cloud Architect:
✅ GCP : Bigquery, Looker Studio, Cloud Run, Dataflow, Data proc, Cloud Storage, Firestore, ....
✅ AWS : Lambda Function, EC2, ECS, Fargate, AWS Glue, Athena, Redshift, EventBridge, SNS, SQS, AWS EMR, ....
✅ Azure : Data Factory, PowerBI, Blob Storage, Azure Databricks, Cosmos DB, ....
Database Development
✅ Vector database : Milvus, Pinecone, ChromaDB
✅ SQL databases : Postgres, MySQL, MSSQL, and SQLite,...
✅ NoSQL databases : MongoDB, Redis, Elasticsearch, DynamoDB
✅ Graph database : Neo4j
Web scraping (Facebook, Tiktok, Hubspot, Quickbook, Shopify, Hubspot, ....):
✅ Crawl4AI
✅ Scrapy, BeautifulSoup, Selenium, Playwright, Puppeteer
✅ 3rd party services to bypass Cloudflare, Captcha, and other bot detection
API Integration
✅ Flask, Django, FastAPI
✅ NodeJS
Orchestration tools:
✅ Airflow
✅ Dagster
Apache Software
✅ Spark, Kafka
✅ Hadoop, Hive, Pig
Other Tools:
✅ DBT
✅ Docker/Kubernetes/Helm
✅ Git, Gitlab, and Github
I can guarantee that you'll be satisfied with my work output as I am a very hard-working and diligent person.
I'm looking forward to all the work opportunities!