Skip to content

How Intel Uses PyTorch to Empower Generative AI through Intel Arc GPUs #1897

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 12 commits into from
Jan 24, 2025
8 changes: 8 additions & 0 deletions _community_blog/bringing-the-pytorch-community-together.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
---
title: 'Bringing the PyTorch Community Together'
author: Team PyTorch
ext_url: /blog/bringing-the-pytorch-community-together/
date: January 22, 2025
---

As we step into a new year, it’s a great moment to reflect on the incredible community events that made 2024 a memorable year for the PyTorch Foundation. Global meetups, events, and conferences brought the community together to learn, connect, and grow. Here’s a quick recap of the year’s highlights and what to expect in 2025.
8 changes: 8 additions & 0 deletions _community_stories/56.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
---
title: 'How Intel Uses PyTorch to Empower Generative AI through Intel Arc GPUs'
ext_url: /blog/how-intel-uses-pytorch-to-empower-generative-ai-through-intel-arc-gpus/
date: Jan 24, 2025
tags: ["Gaming"]
---
Intel has long been at the forefront of technological innovation, and its recent venture into Generative AI (GenAI) solutions is no exception. With the rise of AI-powered gaming experiences, Intel sought to deliver an accessible and intuitive GenAI inferencing solution tailored for AI PCs powered by Intel’s latest GPUs. By leveraging PyTorch as the backbone for development efforts, Intel successfully launched AI Playground, an open source application that showcases advanced GenAI workloads.

285 changes: 285 additions & 0 deletions _posts/2025-01-21-accelerating-llm-inference.md

Large diffs are not rendered by default.

134 changes: 134 additions & 0 deletions _posts/2025-01-22-bringing-the-pytorch-community-together.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,134 @@
---
layout: blog_detail
title: "Bringing the PyTorch Community Together"
author: "Team PyTorch"
hidden: true
---

As we step into a new year, it’s a great moment to reflect on the incredible community events that made 2024 a memorable year for the PyTorch Foundation. Global meetups, events, and conferences brought the community together to learn, connect, and grow. Here’s a quick recap of the year’s highlights and what to expect in 2025\.

![PyTorch Seattle Meetup (May 23)](/assets/images/community-events-recap/fg5.jpg){:style="width:100%"}

**PyTorch Seattle Meetup (May 23\)**

We hosted a PyTorch Meetup in Seattle in May at the Meta Bellevue Office where Meta, Microsoft, and Google gave technical talks and about 60 attendees participated in discussion and networking.

**PyTorch Docathon 2024 (June 4-20)**

The PyTorch Docathon returned for its third edition, spanning over two weeks in June. This unique event focused on improving PyTorch’s documentation with contributions from community members worldwide. Documentation is the backbone of any successful open source project, and PyTorch’s Docathon fostered inclusivity and collaboration, making it easier for new users to adopt the framework and for experienced developers to maximize its potential. The 2024 Docathon resulted in more than 50 merged pull requests and was a testament to the collaborative spirit of the PyTorch community and its commitment to enhancing accessibility and usability. Watch the [PyTorch Docathon Kickoff](https://youtu.be/2D0aej50umA?feature=shared) on YouTube.

![PyTorch Shanghai Meetup (August 15)](/assets/images/community-events-recap/fg3.png){:style="width:100%"}

#### **PyTorch Shanghai Meetup (August 15\)**

In August, the [PyTorch Shanghai Meetup](https://pytorch.org/blog/pytorch-shanghai-notes/) brought together developers, researchers, and enthusiasts in Shanghai, China. This event served as a platform for knowledge sharing, with engaging talks and networking opportunities. Highlights from the agenda included insights into PyTorch’s latest developments, community-led presentations showcasing innovative use cases, and networking sessions fostering collaboration among attendees.

![PyTorch Conference 2024 (September 18-19)](/assets/images/community-events-recap/fg1.jpg){:style="width:100%"}

#### **PyTorch Conference 2024 (September 18-19)**

The PyTorch Conference in San Francisco was undoubtedly one of the year’s most significant events. This two-day gathering brought together top-tier researchers, developers, and academic communities, fostering collaboration and innovation in machine learning.

![What Made It Special](/assets/images/community-events-recap/fg6.jpeg){:style="width:100%"}

#### **What Made It Special:**

* Keynote speeches from industry leaders and PyTorch maintainers.
* In-depth sessions covering PyTorch’s end-to-end machine learning capabilities.
* Hands-on workshops and breakout sessions.
* A vibrant expo area showcasing cutting-edge tools and applications.
* Startup Showcase where early-stage founders pitched their AI startups to a panel of top venture capitalists.
* DL Compiler Mini-Summit that took a deep dive into the advances in deep learning (DL) compilers that are transforming AI workloads.
* Fine-Tuning Mini-Summit that covered everything from memory efficiency, parameter-efficient fine-tuning and quantization to performance at scale and reproducible evaluations.
* Poster Session showcasing innovations in PyTorch, including model optimization, hardware integration, generative AI, quantization, and tools for enhanced performance and usability, with contributions from industry leaders.

The conference’s focus on fostering collaboration underscored PyTorch’s role as a driving force in the open source ML community. Missed out? You can watch the [PyTorch Conference 2024 Playlist](https://youtube.com/playlist?list=PL_lsbAsL_o2B_znuvm-pDtV_cRhpqZb8l&si=mdoSkqMJYKRlzxlg) to catch any sessions you might have missed.

![GPU MODE IRL Hackathon (September 21)](/assets/images/community-events-recap/fg4.jpg){:style="width:100%"}

#### **GPU MODE IRL Hackathon (September 21\)**

PyTorch sponsored this meetup in person in San Francisco where attendees made friends, watched keynotes, hacked all day, took breaks with afternoon talks, and then hacked all night. We heard about torchao, our new quantization and sparsity library, vLLM which deploys PyTorch models in production, llm.c, and more. Key takeaways included: GPU Mode IRL Hackathon 1st place winner was inspired by PyTorch FlexAttention to improve CUTLASS, NCCL in Triton would help us do distributed programming with a minimal NCCL reimplementation in pure Python, No libtorch pytorch binaries dramatically reduces binary sizes for on device deployments.

![Consumer AI Edge Hackathon (November 22-23)](/assets/images/community-events-recap/fg8.png){:style="width:100%"}

#### **Consumer AI Edge Hackathon (November 22-23)**

The PyTorch team served as mentors and coaches in a Hackathon in Paris, co-sponsored by Hugging Face, Scaleway, and Entrepreneur First, challenging teams to create innovative consumer (B2C) applications leveraging Hugging Face, PyTorch and other open source on-device tools and models. 120+ people across 22 teams hacked for 2 days (and nights\!) building the future of AI-powered on-device solutions based on open source models and tools. Participants created innovative applications, powered by PyTorch, [ExecuTorch](https://github.com/pytorch/executorch/tree/main) and Hugging Face resources, such as an on-device yoga coach, a magical storytelling companion and a Kinect-like experience to mobile phones. The PyTorch team is planning similar events in other geographies in 2025 around innovative on-device AI applications.

![PyTorch Korea User Group Meetup (November 30)](/assets/images/community-events-recap/fg9.png){:style="width:100%"}

#### **PyTorch Korea User Group Meetup (November 30\)**

The PyTorch Korea User Group, founded in 2018, is a community dedicated to introducing PyTorch to Korean-speaking users and growing together. The group began by translating PyTorch 0.3 tutorials into Korean and has since supported PyTorch's growth in Korea. The group focuses on three primary activities:

1. Sharing knowledge for PyTorch learning and application,
2. Sharing insights and experiences in the field of artificial intelligence, and
3. Fostering growth through online and offline networking.

The PyTorch Korea User Group reaches tens of thousands of Korean AI developers every month. If you're interested in their activities, check out these links:

* [PyTorch Korea User Group](https://pytorch.kr)
* [PyTorch Korean Tutorials](https://tutorials.pytorch.kr)
* [PyTorch Korean Community](https://discuss.pytorch.kr)
* [GitHub Repository](https://github.com/PyTorchKorea)
* [YouTube Channel](https://youtube.com/@pytorchkr)

![PyTorch Korea User Group 2025 Events Overview](/assets/images/community-events-recap/fg2.jpeg){:style="width:100%"}

The PyTorch Korea User Group has planned three major activities for the year:

1. **PyTorch CoreSIG**
Since December 2024, this weekly online event has been held every Wednesday afternoon. Led by Kim Hong-Seok, CSO of Rebellions (a PyTorch member company), it provides in-depth knowledge and experience regarding PyTorch internals. Approximately 150 Korean developers participate weekly, reflecting growing interest in PyTorch Core development in Korea.
2. **Offline Meetup**
These meetups provide opportunities to share insights and experiences in PyTorch and artificial intelligence, along with networking. Around 3–4 sessions are planned for this year, focusing on key topics in PyTorch and AI.
3. **Online Community Engagement**
This activity involves sharing and discussing various projects and papers in the AI field. For more information, visit: [https://discuss.pytorch.kr](https://discuss.pytorch.kr).

#### **Open Source AI Night at NeurIPS 2024 (December 10\)**

The PyTorch Foundation co-hosted a social event at NeurIPS along with The Fin AI and Open Finance Foundation that featured engaging discussions on open source AI and applications in finance.

![PyTorch Webinars](/assets/images/community-events-recap/fg7.jpeg){:style="width:100%"}

**PyTorch Webinars**

Throughout 2024, PyTorch hosted the following virtual webinars:

Expert Exchanges:

* [How does batching work on modern CPUs?](https://www.youtube.com/live/HTcnp9NEHGY?feature=shared)
* [DistServe: disaggregating prefill and decoding for goodput-optimized LLM inference](https://www.youtube.com/live/Bh-jlh5vlF0?feature=shared)
* [Efficient Streaming Language Models with Attention Sinks](https://www.youtube.com/live/RnM84Sv9WpA?feature=shared)
* [Adapting open source models with Open-Instruct and Tulu](https://www.youtube.com/live/e1qUJFAo10s?feature=shared)
* [Efficient Generative Models: From Sparse to Distributed Inference](https://www.youtube.com/live/Eqg0VIiWrgM?feature=shared)

Summer Series:

* [Using PyTorch for Monocular Depth Estimation Webinar](https://youtu.be/xf2QgioY370?feature=shared)
* [Accelerating LLM family of models on Arm Neoverse based Graviton AWS processors with KleidiAI](https://youtu.be/NeHIhQWewug?feature=shared)
* [torch.compile: The Missing Manual](https://www.youtube.com/live/rew5CSUaIXg?feature=shared)

Release Live Q&As:

* [PyTorch 2.4: Live Q&A](https://www.youtube.com/live/ry_QgUIYX1E?feature=shared)
* [PyTorch 2.5 Live Q&A](https://www.youtube.com/live/B3IgXpl4xt4?feature=shared)

Live Webinars:

* [PyTorch Documentary Virtual Premiere](https://www.youtube.com/watch?v=EjgTv6aSeqk)
* [Using PyTorch to Help Predict Wildfires](https://www.youtube.com/watch?v=gSC_IHyx0IM)
* [Seismic Data to Subsurface Models with OpenFWI: Training an AI Model with PyTorch](https://www.youtube.com/watch?v=zvk3Rr-OjU0)
* [Dinosaur Bone Hunting with Intel AI](https://www.youtube.com/watch?v=w4JmPkqnD0E)

Each of these events underscored the importance of collaboration and community engagement in advancing AI research and applications. Thank you to everyone who participated, organized, and supported these events—your contributions make all the difference\!

---

### **Looking Ahead**

2024 was packed with opportunities to connect, learn, and contribute, and there will be even more ways to connect with the PyTorch community in 2025\.

Mark your calendar\! The [PyTorch Conference](https://events.linuxfoundation.org/pytorch-conference-2025/) is returning to San Francisco on October 22-23, 2025\. Get ready for an exciting event filled with technical deep dives, exciting announcements, insightful sessions, and enhanced opportunities for community collaboration.

Stay tuned for more upcoming events and opportunities to get involved by [subscribing to our newsletter](https://pytorch.org/newsletter).
Original file line number Diff line number Diff line change
@@ -0,0 +1,43 @@
---
layout: blog_detail
title: "How Intel Uses PyTorch to Empower Generative AI through Intel Arc GPUs"
author: "Team PyTorch"
---

Intel has long been at the forefront of technological innovation, and its recent venture into Generative AI (GenAI) solutions is no exception. With the rise of AI-powered gaming experiences, Intel sought to deliver an accessible and intuitive GenAI inferencing solution tailored for AI PCs powered by Intel’s latest GPUs. By leveraging PyTorch as the backbone for development efforts, Intel successfully launched AI Playground, an open source application that showcases advanced GenAI workloads.

**The Business Challenge**

Our goal was to deliver an accessible and intuitive GenAI inferencing solution tailored for AI PCs powered by Intel. We recognized the need to showcase the capabilities of the latest GenAI workloads on our newest line of client GPUs. To address this, we developed a starter application, [AI Playground](https://github.com/intel/ai-playground), which is open source and includes a comprehensive developer reference sample available on GitHub using PyTorch. This application seamlessly integrates image generation, image enhancement, and chatbot functionalities, using retrieval-augmented generation (RAG) features, all within a single, user-friendly installation package. This initiative not only demonstrates the functionality of these AI workloads but also serves as an educational resource for the ecosystem, guiding developers on effectively leveraging the [Intel® Arc™ GPU](https://www.intel.com/content/www/us/en/products/details/discrete-gpus/arc.html) product line for advanced AI applications. This solution leverages Intel® Arc™ Xe Cores and [Xe Matrix Extensions (XMX)](https://www.intel.com/content/www/us/en/support/articles/000091112/graphics.html) for accelerating inferencing.

![AI Playground](/assets/images/intel-case-study/fg1.png){:style="width:100%"}

**How Intel Used PyTorch**

PyTorch is the core AI framework for AI Playground. We extensively leverage PyTorch's eager mode, which aligns perfectly with the dynamic and iterative nature of our generative models. This approach not only enhances our development workflow but also enables us to rapidly prototype and iterate on advanced AI features. By harnessing PyTorch’s powerful capabilities, we have created a robust reference sample that showcases the potential of GenAI on Intel GPUs in one cohesive application.

**Solving AI Challenges with PyTorch**

PyTorch has been instrumental in addressing our AI challenges by providing a robust training and inference framework optimized for discrete and integrated Intel Arc GPU product lines. Choosing PyTorch over alternative frameworks or APIs was crucial. Other options would have necessitated additional custom development or one-off solutions, which could have significantly slowed our time to market and limited our feature set. With PyTorch, we leveraged its flexibility and ease of use, allowing our team to focus on innovation through experimentation, rather than infrastructure. The integration of [Intel® Extension for PyTorch](https://www.intel.com/content/www/us/en/developer/tools/oneapi/optimization-for-pytorch.html#gs.j6azz7) further enhanced performance by optimizing computational efficiency and enabling seamless scaling on Intel hardware, ensuring that our application ran faster and more efficiently.

**A Word from Intel**

*With PyTorch as the backbone of our AI Playground project, we achieved rapid development cycles that significantly accelerated our time to market. This flexibility enabled us to iteratively enhance features and effectively align with the commitments of our hardware launches in 2024\.*

*\-Bob Duffy, AI Playground Product Manager*

![PyTorch Case Stidu](/assets/images/intel-case-study/fg2.png){:style="width:100%"}

**The Benefits of Using PyTorch**

The biggest benefit of using PyTorch for us is the large PyTorch ecosystem, which connects us with an active and cooperative community of developers. This collaboration has facilitated the seamless deployment of key features from existing open source projects, allowing us to integrate the latest GenAI capabilities into AI Playground. Remarkably, we accomplished this with minimal re-coding, ensuring that these advanced features are readily accessible on Intel Arc GPUs.

**Learn More**

For more information about Intel’s AI Playground and collaboration with PyTorch, visit the following links:

* [PyTorch Optimizations from Intel](https://www.intel.com/content/www/us/en/developer/tools/oneapi/optimization-for-pytorch.html#gs.j8h6mc)
* [AI Playground GitHub](https://github.com/intel/ai-playground)
* [AI Playground](https://intel.com/ai-playground)
* [AI Playground Deep Dive Video](https://youtu.be/cYPZye1MC6U)
* [Intel GPU Support Now Available in PyTorch 2.5](https://pytorch.org/blog/intel-gpu-support-pytorch-2-5/)
Binary file added assets/images/accelerating-llm-inference/fg1.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added assets/images/accelerating-llm-inference/fg2.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added assets/images/accelerating-llm-inference/fg3.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added assets/images/accelerating-llm-inference/fg4.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added assets/images/accelerating-llm-inference/fg5.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added assets/images/community-events-recap/fg1.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added assets/images/community-events-recap/fg2.jpeg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added assets/images/community-events-recap/fg3.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added assets/images/community-events-recap/fg4.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added assets/images/community-events-recap/fg5.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added assets/images/community-events-recap/fg6.jpeg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added assets/images/community-events-recap/fg7.jpeg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added assets/images/community-events-recap/fg8.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added assets/images/community-events-recap/fg9.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added assets/images/intel-case-study/fg1.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added assets/images/intel-case-study/fg2.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.