Software Engineer, Acceleration Kernel Development
Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. As a Software Engineer on the Acceleration Kernel Development team, you’ll write low-level code that optimizes high-efficiency machine learning workloads, focusing on performance and precision. Responsibilities Writing low-level code that directly powers high-efficiency machine learning workloads, optimizing every cycle, every memory move, every instruction Building and optimizing compute kernels for parallel ML and high-performance workloads Analyzing and tuning instruction-level performance across latency, memory, and bandwidth Collaborating closely with ML engineers to integrate optimizations into production Debugging, profiling, and maintaining a fast, reliable low-level software stack Skills Expertise in building and optimizing compute kernels for parallel ML and high-performance workloads Ability to analyze and tune instruction-level performance across latency, memory, and bandwidth A collaborative mindset to work closely with ML engineers and integrate optimizations into production Ownership of debugging, profiling, and maintaining a fast, reliable low-level software stack Comfortable in C/C++ and able to build fast, efficient code from the ground up Obsessed with performance and precision, especially in tensors and ML workloads Motivated by complex problems and thrives in collaborative, fast-moving environments Benefits Highly competitive compensation package Benefits Company Overview Tenstorrent develops AI hardware and software solutions for data processing and machine learning application. It was founded in 2016, and is headquartered in Toronto, Ontario, CAN, with a workforce of 501-1000 employees. Its website is