Job Description
What we are looking for Wellist is seeking a Senior Web Scraping Engineer who will be responsible for standing up and owning the data acquisition operations that will drive data that powers the Wellist Directory. The Wellist Directory is a state-of-the-art product deployed to Wellist hospital and corporate customers to ensure that patients and employees get connected with the right resource at the right time.
The Senior Web Scraping Engineer will take over existing code and be responsible for all phases of the automated data acquisition process. They will bring experience in data extraction and web scraping techniques. They will lead efforts to build and optimize scraping processes and support our vision for helping patients and employees identify local, relevant services that fall within categories of need.
The Senior Engineer will report into Wellist's Head of Operations and complement an existing small but highly professional and effective Engineering Team.
What experience you bring: - Demonstrated ability to extract data from complex websites with minimal supervision, with a portfolio or examples of past projects.
- Proficiency in Python and JavaScript, with strong skills in libraries and frameworks like BeautifulSoup, Scrapy, or Selenium.
- Knowledge of asynchronous programming, multithreading, and distributed scraping.
- In-depth knowledge of HTML, CSS, JavaScript, and the DOM.
- Experience designing efficient storage solutions and managing data integrity.
- Ability to apply machine learning algorithms for data cleaning, categorization, or predictive analysis adds significant value.
- Able to leverage and engage with current AI based platform (e.g. OpenAI), with examples of past projects
- Experience with cloud services (AWS a plus) for deploying and managing scraping jobs at scale.
Location Requirements: What You'll Be Doing: - Write, test, and refine code that extracts data from various online sources, ensuring reliability and efficiency.
- Perform data retrieval tasks, handling complexities such as pagination and dynamic content (i.e. content loaded with AJAX or JavaScript).
- Clean and format extracted data, ensuring it meets format, structure and quality standards for further analysis or processing.
- Store and manage the scraped data appropriately, optimizing for access speed and data integrity.
- Regularly monitor the scraping processes, identify and resolve any issues to maintain continuous data flow.
- Work seamlessly with our existing Engineering, Product and Directory teams to ensure that data is relevant and positively impactful for our users.
About Wellist Wellist is a mission-driven digital health company that provides effective consumer navigation at scale. Wellist has historically served patients and care teams at leading health systems (e.g., Yale New Haven Health, HCA) and in 2022 expanded its offerings to support different types of employers. Our employer solution provides an omnichannel benefit front door that increases strategic engagement across the benefits portfolio and meaningfully reduces cost.
Wellist's outstanding contributions have been recognized by organizations including Fast Company (Most Innovative Companies 2024 - Human Resources), MassTLC (Most Innovative Technology of the Year - Healthcare), Rock Health (Top 50 in Digital Health honoree, Diversity Leadership), American Business Awards (Company of the Year, Silver Stevie - Health Products and Service) and BostInno (50onFire, Health and Medicine winner).
Why work here - A meaningful mission - join a team that makes a difference for patients and families.
- Outstanding benefits - excellent medical and dental insurance coverage, life & disability insurance, up to 3 weeks of PTO and 401k contribution
- Opportunities to develop your career - expand your resume with a healthcare startup!
Wellist
Job Tags
Local area,