Data Engineer, IB Clean Room

Medscape

IN Full–time
Posted on: March 12, 2026
As a Data Engineer at WebMD Health Corp., an Internet Brands Company, you will play a crucial role in designing, developing, and supporting multiple data engineering projects with heterogeneous data sources. Your responsibilities will include: - Designing, developing, and supporting multiple data engineering projects with heterogeneous data sources such as files (structured & unstructured), traditional relational databases like Postgres DB, and non-traditional databases like Vertica DB (MPP) - Analyzing business requirements, designing and implementing the required data model, and building ETL/ELT strategies - Leading data architecture and engineering decision making/planning - Translating complex technical subjects into terms that can be understood by both technical and non-technical audiences Qualifications: - 4 years of experience with database development (advanced SQL) on any traditional & non-traditional DBs - 2 years on one specific ETL tool, such as Pentaho, Talend, Informatica, DataStage - 1 year of experience in scheduler/orchestration Tools like Control-M, Autosys, Airflow, JAMS - Basic Python scripting and troubleshooting skills - Strong communication and documentation skills are absolutely required for this role as you will be working directly with both technical and non-technical teams - Experience working closely with teams outside of IT (i.e. Business Intelligence, Finance, Marketing, Sales) - Experience with setting up the infrastructure and architectural requirements Desired: - Working knowledge with big data databases such as Vertica, Snowflake, or Redshift - Experience on the Hadoop ecosystem - Programming or working with key data components such as HIVE, Spark, and Sqoop moving and processing terabyte level of data - Experience on GCP/Big Query - Experience in Apache Airflow or at least in-depth understanding of how Airflow works - Web analytics or Business Intelligence a plus - Understanding of Digital Marketing Transactional Data (Click Stream Data, Ad Interaction Data, Email Marketing Data) - Understanding of Medical/Clinical data - Exposure or understanding of scheduling tools such as Airflow - Experience in Linux environment is preferred but not mandatory As a Data Engineer at WebMD Health Corp., an Internet Brands Company, you will play a crucial role in designing, developing, and supporting multiple data engineering projects with heterogeneous data sources. Your responsibilities will include: - Designing, developing, and supporting multiple data engineering projects with heterogeneous data sources such as files (structured & unstructured), traditional relational databases like Postgres DB, and non-traditional databases like Vertica DB (MPP) - Analyzing business requirements, designing and implementing the required data model, and building ETL/ELT strategies - Leading data architecture and engineering decision making/planning - Translating complex technical subjects into terms that can be understood by both technical and non-technical audiences Qualifications: - 4 years of experience with database development (advanced SQL) on any traditional & non-traditional DBs - 2 years on one specific ETL tool, such as Pentaho, Talend, Informatica, DataStage - 1 year of experience in scheduler/orchestration Tools like Control-M, Autosys, Airflow, JAMS - Basic Python scripting and troubleshooting skills - Strong communication and documentation skills are absolutely required for this role as you will be working directly with both technical and non-technical teams - Experience working closely with teams outside of IT (i.e. Business Intelligence, Finance, Marketing, Sales) - Experience with setting up the infrastructure and architectural requirements Desired: - Working knowledge with big data databases such as Vertica, Snowflake, or Redshift - Experience on the Hadoop ecosystem - Programming or working with key data components such as HIVE, Spark, and Sqoop moving and processing terabyte level of data - Experience on GCP/Big Query - Experience in Apache Airflow or at least in-depth understanding of how Airflow works - Web analytics or Business Intelligence a plus - Understanding of Digital Marketing Transactional Data (Click Stream Data, Ad Interaction Data, Email Marketing Data) - Understanding of Medical/Clinical data - Exposure or understanding of scheduling tools such as Airflow - Experience in Linux environment is preferred but not mandatory

About Company

Your next job is waiting

Create your profile and start applying in minutes.