What is the title for the position?

The position is offered as job of AI AND MACHINE LEARNING ENGINEER Dicetek LLC on MYTAT.

What is the client name?

The position is offered by Dicetek LLC on MYTAT.

How many steps are there to apply a job?

Only 3 steps you have to follow to apply a job on MYTAT:- Go to the MYTAT jobs page Jobs Select the specified job that you are looking for by going through the job description. Click on the apply button to apply the job by submitting your details. APPLY NOW

What is the experience required for this job?

2 - 2 Years of experience is required for this job.

What is the employment type for this job?

The job is offered as a Full Time job.

What is the office location for this job?

The job is offered as a Office Location job.

Home
Jobs
Dicetek LLC
AI AND MACHINE LEARNING ENGINEER

AI AND MACHINE LEARNING ENGINEER Dicetek LLC

Dicetek LLC
Office Location
Full Time

Experience: 2 - 2 years required

Pay:

Salary Information not included

Type: Full Time

Location: All India

Skills: devops, containerization, Docker, Python, Bash, MLOps, backend engineering, DeepStream, Gstplugins, nvdsanalytics, nvstreammux, GPU scheduling, NVIDIA GPUs, TensorRT, mixed precision, CUDA toolkit, Yolo, CNNs, LLMs, CICD scripting, cloud GPUs, Edge devices, Nsight Systems, DCGM, Triton Inference Server, Distributed Training, PyTorch DDP, DeepSpeed, Frontend, REST gRPC API design

Company Overview

APPLY NOW

Share On Whatsapp

About Dicetek LLC

Job Description

You are in need of 1 AI and Machine Learning Engineer to assist your Team in Emerging Technologies. The chosen resource is required to work offshore and should possess the following qualifications and experience: Must Have: - At least 2 years of experience in MLOps, DevOps, or backend engineering for AI workloads. - Proficiency in DeepStream 7.x power user pipelines, Gstplugins, nvdsanalytics, nvstreammux. - Strong understanding of containerization (Docker) and GPU scheduling. - Demonstrated track record in optimizing latency/throughput on NVIDIA GPUs (TensorRT, mixed precision, CUDA toolkit). - Hands-on experience in deploying YOLO or similar CNNs in a production environment. - Familiarity with self-hosting and serving LLMs (vLLM, TensorRTLLM, or similar) along with quantization, pruning, and distillation. - Proficiency in Python & bash scripting and confidence in CI/CD scripting. Nice to have: - Exposure to cloud GPUs (AWS/GCP/Azure). - Experience with edge devices such as Jetson, Xavier, Orin. - Proficiency in performance profiling with Nsight Systems / DCGM. - Knowledge of Triton Inference Server internals. - Familiarity with distributed training (PyTorch DDP, DeepSpeed). - Basic frontend/REST gRPC API design skills. Responsibilities: - Build & automate inference pipelines. - Design, containerize, and deploy CV models (YOLO v8/v11, custom CNNs) with DeepStream 7.x, optimizing for lowest latency and highest throughput on NVIDIA GPUs. - Migrate existing Triton workloads to DeepStream with minimal downtime. - Serve and optimize large language models. - Self-host Llama 3.2, Llama 4, and future LLM/VLMs on the cluster using best practice quantization, pruning, and distillation techniques. - Expose fast, reliable APIs and monitoring for downstream teams. - Continuous delivery & observability. - Automate build/test/release steps and set up health metrics, logs, and alerts to ensure model stability in production. - Efficiently allocate GPU resources across CV and LLM services. - Model lifecycle support (10-20%): Assist data scientists with occasional fine-tuning or retraining runs and package models for production.,

AI AND MACHINE LEARNING ENGINEER Dicetek LLC

Salary Information not included

About Dicetek LLC

Job Description

COMPANY

Products

OTHERS

Partner