Blogs

The Flexibility of PyTorch with the Performance of FlashAttention

By Team PyTorch

Using FlexAttention for inference: backend optimized for decoding and PagedAttention.

By Team PyTorch

A High-Level DSL (PyTorch with Tiles) for Performant and Portable ML Kernels

By Team PyTorch

AMD Collaboration with the University of Michigan offers High Performance Open-Source Solutions to the Bioinformatics Community