Looking for a python web scraping software engineer for our AI sales intelligence startup. We will be going to raise a round of funding post launch, and would like to make the experience as mutually beneficial as possible (full time transition potential, references, resume builders, etc).
See job requirements below:
Python Web Scraping Software Engineer
Location: Remote
Primary Working Hours: US East Coast (flexible, some of us are night owls)
Hey there!
We're a scrappy startup on the lookout for a talented Python Web Scraping Software Engineer who's ready to roll up their sleeves and dive into our diverse tech stack. This isn't your typical junior or intern gig—this is a role for someone who's been around the block and knows their stuff. Having done this in production will put you at the top of the list.
What We're Working With:
Tech Stack:
- Render.com
- We use Render.com for deploying our frontend and backend services.
- Programming Languages:
- Python
- JavaScript/TypeScript is a plus (we use these languages for our frontend and Chrome Extension, but this role is focused on Python)
- Java code is a plus (don't worry, we aren't running Java anymore! but we may reference the legacy code to understand how to do certain things in Python)
- Databases:
- Elasticsearch: We update indexes here with scraped data, which triggers email alerts for subscribed users. Hosted on AWS.
- MongoDB: This is our go-to for storing scraped data, user data, and subscription details—handling most of our read/write operations. Hosted on MongoDB Atlas.
- Redis: We use Redis for caching. Hosted on Render.com.
- Backend Services:
- Python Playwright for web scraping. BeautifulSoup. HTML to Markdown. Using LLMs to extract content from scraped data.
- Python FastAPI backend to support the web application (Next.js frontend and React Vite Chrome Extension)
- Frontend:
- Next.js (React) with TypeScript
- Chrome Extension:
What We're Looking For:
- Python Web Scraping with Headless Browser Automation: You've built web scrapers using Playwright (or Puppeteer) and BeautifulSoup. In production. We need to scrape news articles, job listings, web search results, and more.
- Prompt Engineering: You've used LLMs to extract content from scraped data or are willing and excited to learn.
- MongoDB: You've used MongoDB in production to store scraping results or are willing and excited to learn.
- Elasticsearch: You've used Elasticsearch in production to send Email Alerts to users who's Saved Filters match the newly scraped data that we inserted into MongoDB or are willing and excited to learn.
What You'll Get:
- Learn AI: Learn while doing. Our software is using AI to accelerate sales. Enjoy mentorship from our lead AI Engineer.
- Creative Freedom: Play a key role in shaping our application's architecture by implementing new tools and technologies.
- Fast-Paced Environment: If you thrive on learning and implementing new technologies in a dynamic environment, this is the perfect role for you.
- Impact: Your work will directly influence the look, feel, and functionality of our product, giving you a chance to make a significant impact from day one.
Bonus points if you have an active GitHub or contribute to Open Source Software.
Bonus points if you have a cool side project.
Why You'll Love It Here:
Join a team where your creativity, performance mindset, and passion for both Python and Web Scraping technology will be valued and put to use. If you're excited about leading the charge in building beautiful, responsive, and performant applications, we'd love to have you on board.