Medscape logo

Data Engineer, IB Clean Room

Medscape

IN Full–Time

As a Data Engineer at WebMD Health Corp., an Internet Brands Company, you will play a crucial role in designing, developing, and supporting multiple data engineering projects with heterogeneous data sources. Your responsibilities will include:

  • Designing, developing, and supporting multiple data engineering projects with heterogeneous data sources such as files (structured & unstructured), traditional relational databases like Postgres DB, and non-traditional databases like Vertica DB (MPP)
  • Analyzing business requirements, designing and implementing the required data model, and building ETL/ELT strategies
  • Leading data architecture and engineering decision making/planning
  • Translating complex technical subjects into terms that can be understood by both technical and non-technical audiences

Qualifications:

  • 4 years of experience with database development (advanced SQL) on any traditional & non-traditional DBs
  • 2 years on one specific ETL tool, such as Pentaho, Talend, Informatica, DataStage
  • 1 year of experience in scheduler/orchestration Tools like Control-M, Autosys, Airflow, JAMS
  • Basic Python scripting and troubleshooting skills
  • Strong communication and documentation skills are absolutely required for this role as you will be working directly with both technical and non-technical teams
  • Experience working closely with teams outside of IT (i.e. Business Intelligence, Finance, Marketing, Sales)
  • Experience with setting up the infrastructure and architectural requirements

Desired:

  • Working knowledge with big data databases such as Vertica, Snowflake, or Redshift
  • Experience on the Hadoop ecosystem
  • Programming or working with key data components such as HIVE, Spark, and Sqoop moving and processing terabyte level of data
  • Experience on GCP/Big Query
  • Experience in Apache Airflow or at least in-depth understanding of how Airflow works
  • Web analytics or Business Intelligence a plus
  • Understanding of Digital Marketing Transactional Data (Click Stream Data, Ad Interaction Data, Email Marketing Data)
  • Understanding of Medical/Clinical data
  • Exposure or understanding of scheduling tools such as Airflow
  • Experience in Linux environment is preferred but not mandatory As a Data Engineer at WebMD Health Corp., an Internet Brands Company, you will play a crucial role in designing, developing, and supporting multiple data engineering projects with heterogeneous data sources. Your responsibilities will include:
  • Designing, developing, and supporting multiple data engineering projects with heterogeneous data sources such as files (structured & unstructured), traditional relational databases like Postgres DB, and non-traditional databases like Vertica DB (MPP)
  • Analyzing business requirements, designing and implementing the required data model, and building ETL/ELT strategies
  • Leading data architecture and engineering decision making/planning
  • Translating complex technical subjects into terms that can be understood by both technical and non-technical audiences

Qualifications:

  • 4 years of experience with database development (advanced SQL) on any traditional & non-traditional DBs
  • 2 years on one specific ETL tool, such as Pentaho, Talend, Informatica, DataStage
  • 1 year of experience in scheduler/orchestration Tools like Control-M, Autosys, Airflow, JAMS
  • Basic Python scripting and troubleshooting skills
  • Strong communication and documentation skills are absolutely required for this role as you will be working directly with both technical and non-technical teams
  • Experience working closely with teams outside of IT (i.e. Business Intelligence, Finance, Marketing, Sales)
  • Experience with setting up the infrastructure and architectural requirements

Desired:

  • Working knowledge with big data databases such as Vertica, Snowflake, or Redshift
  • Experience on the Hadoop ecosystem
  • Programming or working with key data components such as HIVE, Spark, and Sqoop moving and processing terabyte level of data
  • Experience on GCP/Big Query
  • Experience in Apache Airflow or at least in-depth understanding of how Airflow works
  • Web analytics or Business Intelligence a plus
  • Understanding of Digital Marketing Transactional Data (Click Stream Data, Ad Interaction Data, Email Marketing Data)
  • Understanding of Medical/Clinical data
  • Exposure or understanding of scheduling tools such as Airflow
  • Experience in Linux environment is preferred but not mandatory

Posted 12 Mar 2026 · Listing from OnJob.io. Create a free profile to apply and see your AI match score.

Related Data & AI jobs

Hand-picked roles that match this listing on skills, category and location — each scored to your profile inside OnJob.

Explore more on OnJob

Create my free profile — free