Software Engineer, Acceleration Kernel Development
Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. As a Software Engineer on the Acceleration Kernel Development team, you will write low-level code that powers high-efficiency machine learning workloads and optimize software performance. Responsibilities Expertise in building and optimizing compute kernels for parallel ML and high-performance workloads Ability to analyze and tune instruction-level performance across latency, memory, and bandwidth A collaborative mindset to work closely with ML engineers and integrate optimizations into production Ownership of debugging, profiling, and maintaining a fast, reliable low-level software stack Skills Expertise in building and optimizing compute kernels for parallel ML and high-performance workloads Ability to analyze and tune instruction-level performance across latency, memory, and bandwidth A collaborative mindset to work closely with ML engineers and integrate optimizations into production Ownership of debugging, profiling, and maintaining a fast, reliable low-level software stack Comfortable in C/C++ and able to build fast, efficient code from the ground up A developer who loves high performance code, wrangling bits, optimizing compute, and making hardware fly Obsessed with performance and precision, especially in tensors and ML workloads Motivated by complex problems and thrives in collaborative, fast-moving environments Benefits Highly competitive compensation package Benefits Company Overview Tenstorrent develops AI hardware and software solutions for data processing and machine learning application. It was founded in 2016, and is headquartered in Toronto, Ontario, CAN, with a workforce of 501-1000 employees. Its website is