Blumetra Solutions logo

Sr. Data Scientist - Product Development Company - Blumetra Solutions

Blumetra Solutions

Hyderabad, Telangana, IN Full–Time

Designation: Sr. Data Scientist

Work Location: Hyderabad (Hybrid Model)

Experience: 5+ Years

Core Responsibilities

  • Multimodal Enrollment Forecasting: Build hierarchical models that forecast "Top-Down" (Country/Study) and "Bottom-Up" (Site-level) enrollment by ingesting real-time screening logs from IRT and site-activation milestones from CTMS.
  • Discontinuation & Attrition Modeling: Implement Survival Analysis (Cox-PH, DeepSurv) and RNNs to predict patient dropout probability using longitudinal data from EDC, which serves as the primary driver of "Maintenance Phase" demand.
  • Demand vs. Supply Optimization: Develop Monte Carlo simulations or Stochastic Optimization models to determine safety stock levels, balancing the variance between predicted enrollment and actual inventory on hand.
  • Dose Titration Logic: Build predictive ML models to anticipate dose escalations or reductions—syncing with IRT dispensing data to ensure the correct kit strength is available at the site before the patient’s next visit.
  • Clinical Data Lake Management:
  • Architect unified data pipelines that join EDC (clinical outcomes/visit data) with IRT (supply/randomisation data).
  • Manage the full ML lifecycle (Tracking, Registry, Serving) to ensure model reproducibility.
  • Build resilient, real-time pipelines for monitoring supply-demand signals and triggering automated alerts for potential stock-outs.

Required Technical Expertise

  • Systems Integration: Proven experience processing and feature-engineering data from EDC (e.g., Medidata Rave, Veeva) and IRT/RTSM platforms.
  • Advanced ML Domains:
  • Time-Series: DeepAR, Temporal Fusion Transformers (TFT), or N-BEATS for non-linear recruitment trends.
  • Survival Analysis: Expert-level experience modeling "Time-to-Event" data to handle censored patient discontinuation patterns.
  • Probabilistic Programming: Experience with PyMC or Gurobi/OR-Tools to solve the "Supply vs. Demand" constraint problem.
  • Data Engineering: Expert-level Python, SQL, and distributed computing for processing large-scale, high-velocity clinical datasets.

Clinical Domain Knowledge (Preferred To Have)

  • Clinical Systems: Deep understanding of the data schemas within IRT/RTSM (Randomisation/Dispensing) and EDC (Patient Visits/Adverse Events).
  • Supply Dynamics: Understanding of "Initial Seeding," "Trigger-based Resupply," and "Dose Titration" within a global trial context.
  • Regulatory Context: Experience working within GxP / CFR Part 11 compliant environments, ensuring model auditability.
  • Standards: Knowledge of CDISC (SDTM/ADaM) data structures is a significant plus.

Skills: ai/ml,python,data science

Posted 7 Mar 2026 · Listing from OnJob.io. Create a free profile to apply and see your AI match score.

Related Data & AI jobs

Hand-picked roles that match this listing on skills, category and location — each scored to your profile inside OnJob.

Explore more on OnJob

Create my free profile — free