Sr. Data Scientist - Product Development Company - Blumetra Solutions

Blumetra Solutions

Hyderabad ,Telangana , IN Full–time
Posted on: March 07, 2026
Designation: Sr. Data Scientist Work Location: Hyderabad (Hybrid Model) Experience: 5+ Years Core Responsibilities • Multimodal Enrollment Forecasting: Build hierarchical models that forecast "Top-Down" (Country/Study) and "Bottom-Up" (Site-level) enrollment by ingesting real-time screening logs from IRT and site-activation milestones from CTMS. • Discontinuation & Attrition Modeling: Implement Survival Analysis (Cox-PH, DeepSurv) and RNNs to predict patient dropout probability using longitudinal data from EDC, which serves as the primary driver of "Maintenance Phase" demand. • Demand vs. Supply Optimization: Develop Monte Carlo simulations or Stochastic Optimization models to determine safety stock levels, balancing the variance between predicted enrollment and actual inventory on hand. • Dose Titration Logic: Build predictive ML models to anticipate dose escalations or reductions—syncing with IRT dispensing data to ensure the correct kit strength is available at the site before the patient’s next visit. • Clinical Data Lake Management: • Architect unified data pipelines that join EDC (clinical outcomes/visit data) with IRT (supply/randomisation data). • Manage the full ML lifecycle (Tracking, Registry, Serving) to ensure model reproducibility. • Build resilient, real-time pipelines for monitoring supply-demand signals and triggering automated alerts for potential stock-outs. Required Technical Expertise • Systems Integration: Proven experience processing and feature-engineering data from EDC (e.g., Medidata Rave, Veeva) and IRT/RTSM platforms. • Advanced ML Domains: • Time-Series: DeepAR, Temporal Fusion Transformers (TFT), or N-BEATS for non-linear recruitment trends. • Survival Analysis: Expert-level experience modeling "Time-to-Event" data to handle censored patient discontinuation patterns. • Probabilistic Programming: Experience with PyMC or Gurobi/OR-Tools to solve the "Supply vs. Demand" constraint problem. • Data Engineering: Expert-level Python, SQL, and distributed computing for processing large-scale, high-velocity clinical datasets. Clinical Domain Knowledge (Preferred To Have) • Clinical Systems: Deep understanding of the data schemas within IRT/RTSM (Randomisation/Dispensing) and EDC (Patient Visits/Adverse Events). • Supply Dynamics: Understanding of "Initial Seeding," "Trigger-based Resupply," and "Dose Titration" within a global trial context. • Regulatory Context: Experience working within GxP / CFR Part 11 compliant environments, ensuring model auditability. • Standards: Knowledge of CDISC (SDTM/ADaM) data structures is a significant plus. Skills: ai/ml,python,data science

About Company

Blumetra Solutions

Telangana ,IN

https://blumetra.com

Your next job is waiting

Create your profile and start applying in minutes.