D

Data Engineer – Web Scraping & Automation (Python/AWS)

Digivanti

Mumbai, Maharashtra, IN Full–Time

Location: Khar West, Mumbai (On-site)

Working Hours: 12:30-9pm IST, Monday-Friday

Experience: 3-5 years

The Mission:

We are building a proprietary intelligence engine for the world's largest CPG brands. We need a "Data Hunter" to architect the pipelines that extract, clean, and structure organizational and marketplace data at scale.

What You’ll Do:

  • Extract: Build and manage scalable scrapers for LinkedIn, The Org, and major E-commerce marketplaces (Amazon, Walmart, etc.).
  • Bypass: Implement advanced proxy management and anti-bot solutions (Cloudflare, CAPTCHA) to ensure 99.9% uptime.
  • Automate: Use AWS Lambda and S3 to create a self-healing "Data Factory" that refreshes our brand hierarchies daily.
  • Structure: Deliver clean, hierarchical JSON data to our Frontend team for high-stakes visualization.
  • Requirements:
  • 3-5 years of professional experience in Python (Selenium, Playwright, Scrapy, BeautifulSoup).
  • Proven track record of scraping large-scale, authenticated platforms.
  • Experience with AWS (Lambda, DynamoDB, S3) and Git.
  • You are a "Builder"—you don't just find data; you build systems to own it.

Posted 5 Mar 2026 · Listing from OnJob.io. Create a free profile to apply and see your AI match score.

Related Data & AI jobs

Hand-picked roles that match this listing on skills, category and location — each scored to your profile inside OnJob.

Explore more on OnJob

Create my free profile — free