About the job
We are seeking a Machine Learning Engineer to join the core R&D team of TRAE, responsible for model training, optimization, quantization, and deployment. You will work closely with top engineers and researchers to explore and implement LLM training and deployment for the software engineering, driving the continuous evolution of TRAE’s intelligent engineering capabilities.
Responsibilities
- Design, train, and fine-tune large language models (LLMs) that support TRAE’s core reasoning and code generation capabilities.
- Build efficient, stable, and scalable model training and evaluation pipelines.
- Collaborate with infrastructure and product teams to deploy and monitor models efficiently on GPU clusters.
- Continuously optimize models for latency, throughput, and accuracy.
- Stay up to date with and apply cutting-edge techniques in large model optimization and inference acceleration.
Qualifications
Minimum
- Bachelor’s degree or above in Computer Science, Electrical Engineering, Mathematics, or a related field.
- Solid foundation in machine learning, deep learning, and optimization algorithms.
- Proficiency in PyTorch or TensorFlow, with programming skills in Python and C++/CUDA.
- Experience with large-scale distributed training, mixed-precision training, or model parallelism.
- Hands-on experience with model quantization, pruning, or CUDA-based deployment optimization.
- Passionate about building efficient, production-ready AI systems and advancing the future of AI-driven software development.
Preferred
- PhD in AI related research fields.
- Hands-on experience in developing and deploying AI systems in a real-world setting.