Thanks for visiting my profile! I'm Tran Tien Van – a Top Rated Plus Data Engineer, Data Architect, and Web Scraper with 5 years of experience building robust, scalable data solutions. Below is a concise overview of the cutting‑edge technologies I excel in:
Programming Languages:
• SQL
• Python, R, JavaScript, Java
• Bash
LLMs & Embeddings for RAG:
• Claude
• Deepseek
• GPT-4.0
• Llama 3.0
• Pretrained models from Huggingface
LangChain & Workflow Automation:• Langgraph• Langfuse• LangChain (including Bedrock, OpenAI, Anthropic, etc.)• LangSmith• Multi‑modal processing with LangChain
Cloud Architecture:
• GCP: BigQuery, Looker Studio, Cloud Run, Dataflow, Dataproc, Cloud Storage, Firestore
• AWS: Lambda, EC2, ECS, Fargate, AWS Glue, Athena, Redshift, EventBridge, SNS, SQS, EMR
• Azure: Data Factory, PowerBI, Blob Storage, Azure Databricks, Cosmos DB
Database Development:
• Vector Databases: Milvus, Pinecone, ChromaDB
• SQL Databases: Postgres, MySQL, MSSQL, SQLite
• NoSQL Databases: MongoDB, Redis, Elasticsearch, DynamoDB• Graph Database: Neo4j
Web Scraping:
• Platforms: Facebook, TikTok, HubSpot, QuickBooks, Shopify, etc.
• Tools: Crawl4AI, Scrapy, BeautifulSoup, Selenium, Playwright, Puppeteer
• Expertise in bypassing Cloudflare, Captcha, and other bot detections
API Integration:
• Flask, Django, FastAPI, NodeJS
Orchestration Tools:
• Airflow, Dagster
Apache Software:
• Spark, Kafka, Hadoop, Hive, Pig
Other Tools:
• DBT
• Docker, Kubernetes, Helm
• Git, GitLab, GitHub
I’m committed to delivering high-quality results through hard work and diligence. I look forward to exploring exciting opportunities to collaborate and innovate!