Datenna is a fast-growing tech scale-up combining cutting-edge open-source intelligence (OSINT) and AI technologies to provide governments worldwide with critical insights into China’s techno-economic landscape. Our platform transforms OSINT into actionable insights on China’s Defense Industrial Base and broader technological ecosystem through advanced data processing and analysis.
The RoleAs a Python Engineer – Data Acquisition, you will join our Data Acquisition team and play a key role in building and maintaining web scrapers that collect data from diverse online sources across China’s technology and industrial landscape. Your work directly feeds into our OSINT platform, transforming raw web content into clean, structured data that supports high-impact geopolitical intelligence.
Your Mission at Datenna- Build and maintain web scrapers for a wide variety of sources, including corporate registries, academic databases, and government portals
- Parse HTML, PDF, and other document formats into structured JSON outputs
- Navigate and adapt to anti-scraping measures such as proxies, browser automation, and request pattern controls
- Collaborate closely with data engineers to define output schemas that integrate seamlessly with downstream data pipelines
- Monitor scraper health, detect failures, and adapt quickly to changes in source websites
- Solid Python skills. You write clean, readable code and are comfortable with common patterns
- Strong focus on code quality: writing tests, using type hints, applying best practices, and maintaining long-term maintainability
- Hands-on experience with scraping or web testing libraries such as Scrapy, BeautifulSoup, Playwright, Selenium, or similar tools
- Comfortable with HTTP fundamentals, including headers, cookies, sessions, and authentication flows
- Experience parsing messy, real-world data into structured formats
- Familiarity with XPath, CSS selectors, and regex for content extraction
- Proficient with Git and collaborative development workflows
- Enthusiastic about using AI tools to accelerate your work, including code assistants, agentic coding tools, or LLMs for research, workflow optimization, and problem-solving
- Experience with Chinese-language web sources or proficiency in the Chinese language
- Experience with cloud platforms and containerization solutions
- SQL experience
Please note that applicants may be subject to a screening process.
AI is part of how we buildWe use AI tools extensively to move faster and raise quality, and we focus our human attention on the work where judgment, creativity, and responsibility matter most. We expect every candidate to be comfortable using AI in their day-to-day work in a responsible manner and to continuously look for better ways to apply it.
What you can expect- Work on globally impactful projects in geopolitical intelligence
- Lead innovation in OSINT and AI technologies
- Competitive compensation and benefits
- Dynamic, international team environment
- Significant growth opportunities in a scale-up
When you apply to this position, we will process your personal data according to our Recruitment Privacy Notice.
#J-18808-Ljbffr