AI Software Stack Engineer

Squareroot Consulting Pvt Ltd.

IN Full–Time

As an AI Software Stack Engineer at a Global Semiconductor Company in Bangalore, India, your role involves working on a next-generation software stack for high-performance execution on custom hardware. Your primary mission is to develop an optimized, modular, and deeply integrated platform that ensures industry-leading low-latency systems.

Key Responsibilities:

Validate the AI software stack end-to-end by developing and running representative ML workloads, acting as a proxy for real users.
Write from scratch, port, and optimize AI workloads, identifying gaps, performance bottlenecks, and usability issues.
Develop and maintain a comprehensive suite of validation tests spanning model serving, kernel compilation, and framework integration layers.
Reproduce, triage, and characterize issues across the stack from Python-level framework behavior down to compiled kernel correctness and performance.
Benchmark and profile workloads to track performance regressions and validate optimization improvements across stack releases.
Collaborate closely with compiler, runtime, and framework teams to provide actionable feedback and drive resolution of identified issues.
Design and implement testing frameworks, platforms, and automation infrastructure to enable continuous and scalable validation across the stack.

Qualifications:

BSc or higher in Computer Science, Electrical Engineering, or a related field.
Hands-on experience with ML frameworks such as PyTorch, including model authoring and debugging.
Familiarity with model serving platforms.
Experience writing or modifying GPU kernels using Triton, CUDA, or similar kernel authoring tools.
Strong Python skills and comfort working across multiple layers of a complex software stack.
Systematic debugging mindset with the ability to isolate issues across framework, compiler, and runtime boundaries.

Strong Advantage:

Experience enabling new model architectures or workloads on AI accelerator platforms.
Hands-on experience with performance profiling and benchmarking tools for ML workloads.
Understanding of compiler-generated code behavior and ability to read and reason about IR-level representations.
Experience with CI/CD pipelines and automated test infrastructure for ML systems.
Exposure to GPU or custom accelerators ecosystems.
Familiarity with container-based deployment and orchestration for ML serving. As an AI Software Stack Engineer at a Global Semiconductor Company in Bangalore, India, your role involves working on a next-generation software stack for high-performance execution on custom hardware. Your primary mission is to develop an optimized, modular, and deeply integrated platform that ensures industry-leading low-latency systems.

Key Responsibilities:

Validate the AI software stack end-to-end by developing and running representative ML workloads, acting as a proxy for real users.
Write from scratch, port, and optimize AI workloads, identifying gaps, performance bottlenecks, and usability issues.
Develop and maintain a comprehensive suite of validation tests spanning model serving, kernel compilation, and framework integration layers.
Reproduce, triage, and characterize issues across the stack from Python-level framework behavior down to compiled kernel correctness and performance.
Benchmark and profile workloads to track performance regressions and validate optimization improvements across stack releases.
Collaborate closely with compiler, runtime, and framework teams to provide actionable feedback and drive resolution of identified issues.
Design and implement testing frameworks, platforms, and automation infrastructure to enable continuous and scalable validation across the stack.

Qualifications:

BSc or higher in Computer Science, Electrical Engineering, or a related field.
Hands-on experience with ML frameworks such as PyTorch, including model authoring and debugging.
Familiarity with model serving platforms.
Experience writing or modifying GPU kernels using Triton, CUDA, or similar kernel authoring tools.
Strong Python skills and comfort working across multiple layers of a complex software stack.
Systematic debugging mindset with the ability to isolate issues across framework, compiler, and runtime boundaries.

Strong Advantage:

Experience enabling new model architectures or workloads on AI accelerator platforms.
Hands-on experience with performance profiling and benchmarking tools for ML workloads.
Understanding of compiler-generated code behavior and ability to read and reason about IR-level representations.
Experience with CI/CD pipelines and automated test infrastructure for ML systems.
Exposure to GPU or custom accelerators ecosystems.
Familiarity with container-based deployment and orchestration for ML serving.

Apply with OnJob Browse more jobs

Posted 22 Mar 2026 · Listing from OnJob.io. Create a free profile to apply and see your AI match score.

Related Data & AI jobs

Hand-picked roles that match this listing on skills, category and location — each scored to your profile inside OnJob.

Data Entry Operator Mumbai (Work From Home) | Part Time / Full Time | Freshers Welcome | Typing Jobs Infosure Technologies Mumbai, Maharashtra, India · ₹20,000–₹40,000/mo Gen AI Engineer Work Yatra Bengaluru, Karnataka, India · ₹25,00,000–₹35,00,000 Junior Data Analyst (Fresher) DHANLAXMI TRENDS PRIVATE LIMITED Madhya Pradesh, IN · ₹18,000–₹22,000/mo International Sales Specialist (Digital Marketing & AI Automation) MarTech Union IN · ₹20,000–₹40,000/mo OT Data Analyst and Visualisation Support Officer Visy India Hyderabad, Telangana, IN Data Analyst – Demand Generation Orcapod Consulting Services Secunderabad, Telangana, IN

Explore more on OnJob

Find AI-matched jobs Browse jobs by city Salary guide by role Check your resume (ATS)