Mlengneer Cuda Sdk

Year    KA, IN, India

Job Description

Job Summary



We're looking for a highly skilled

Machine Learning Engineer

with extensive experience in

CUDA SDK

to enhance, port, and validate

PyTorch-based Large Language Models (LLMs)

for deployment on custom AI processors. This remote role offers the opportunity to work with cutting-edge hardware and collaborate with a cross-functional engineering team across the U.S.

Key Responsibilities:



Port and validate PyTorch-based LLMs on proprietary AI hardware using

CUDA SDK APIs

Extend and optimize CUDA code for compatibility and performance improvements Debug low-level integration issues between CUDA and PyTorch environments Replace off-the-shelf CUDA components with custom implementations as needed Develop tools and frameworks for validation and testing of LLMs Collaborate with AI hardware teams to ensure seamless deployment and performance tuning Profile and tune GPU kernels for speed, memory efficiency, and system scalability

Required Qualifications:



Bachelor's or Master's degree in

Computer Science, Electrical Engineering

, or a related field Strong hands-on experience with

CUDA programming

In-depth knowledge of

PyTorch

and large-scale deep learning models, especially

LLMs

Proficient in

C++ and Python

Experience debugging complex software and performance bottlenecks Solid understanding of

GPU architectures and memory management

Excellent problem-solving skills and communication abilities

Preferred Qualifications:



Familiarity with

AI accelerator architectures

Experience with

TensorFlow

or other deep learning frameworks Exposure to performance tuning tools (e.g., Nsight, nvprof, VTune) Experience working in remote, distributed engineering environments

Personal Attributes:



Passionate about AI/ML performance and innovation Strong attention to detail and ownership mindset Comfortable working independently in a fast-paced, virtual team Eager to solve real-world problems with cutting-edge technologies
Job Type: Full-time

Pay: ?30.00 - ?35.00 per hour

Experience:

CUDA programming: 4 years (Required) C++ and Python: 3 years (Required) PyTorch and LLM: 3 years (Required)
Work Location: In person

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Job Detail

  • Job Id
    JD3746086
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Contract
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    KA, IN, India
  • Education
    Not mentioned
  • Experience
    Year