We are looking for a skilled Data Acquisition Engineer to join our data operations team. In this
role, you will be responsible for building, maintaining, and optimizing the systems that power
large-scale data acquisition through web crawling and parsing. You will ensure consistent,
high-quality data ingestion across your assigned geographical region.
Key Responsibilities
Design, develop, and maintain large-scale web scraping pipelines to extract platform
data.
Implement scalable, resilient data extraction solutions, including proxy management
and anti-bot/anti-captcha techniques.
Maintain and troubleshoot crawlers, parsers, and data collection workflows to ensure
daily successful data retrieval.
Work extensively with JSON/XML data structures for parsing and transformation.
Optimize web scraping workflows for performance, reliability, and efficiency.
Ensure data quality, integrity, and consistency across acquisition processes.
Ideal Candidate Profile
2–3 years of hands-on programming experience in Python, HTML, and JavaScript.
Demonstrated experience in web scraping or web development.
Strong expertise with Python-based scraping libraries/frameworks such as Scrapy,
Selenium, Playwright, BeautifulSoup, etc.
Understanding of distributed crawling architectures, job scheduling, and automation
workflows.
Comfortable working with structured data formats such as JSON, XML, CSV.
Strong debugging skills and the ability to apply quick fixes to crawlers/parsers when
required.
Familiarity with proxy rotation, user-agent management, and anti-bot strategies is a
plus.