Deploying the DeepSeek R1 distilled reasoning models on AMD Ryzen AI processors and Radeon graphics cards is easy and available now through LM Studio.
DeepSeek R1 Distilled Reasoning Models – AMD Ryzen AI and Radeon
DeepSeek R1 is a recently released ‘reasoning’ model that was distilled into highly capable smaller models. The new reasoning models are a new class of large language models (LLMs) designed to tackle highly complex tasks by using chain-of-thought (CoT) reasoning with the tradeoff of taking longer to respond.
Reasoning models add a ‘thinking’ stage before the final output which you can see by expanding the thinking window before it gives its final answer.

A reasoning model like the DeepSeek R1 Distilled Reasoning Model may first spend thousands of tokens to analyze the problem before giving a final answer. This allows it to be excellent at complex problem-solving tasks involving math and science and approach a problem from all angles before giving its response.
Depending on your AMD hardware, each of these models will offer state-of-the-art reasoning capability on your AMD Ryzen AI processor or Radeon graphics card.
Here’s how to do it:
- Make sure that you are on the 25.1.1 Optional or higher Adrenalin driver
- Download LM Studio 0.3.8 or above from the website
- Install LM Studio and skip the onboarding screen
- Click on the Discover tab
- Choose your DeepSeek R1 Distill. Smaller distills like the Qwen 1.5B offer fast performance while bigger distills will offer superior reasoning capability.
- On the right side, make sure the Q4 K M quantization is selected
- Click “Download”
- Once downloaded, head back to the chat tab
- Select DeepSeek R1 distill from the drop-down menu
- Make sure “manually select parameters” is checked
- In the GPU offload layers – move the slider all the way to the max
- Click model load
- Interact with a reasoning running model completely on your local AMD hardware
Here is a table of the maximum recommended DeepSeek R1 Distill size:
*= AMD recommends running all distills in Q4 K M quantization.
1= Requires Variable Graphics Memory set to Custom: 24GB.
2= Requires Variable Graphics Memory set to High.
*= AMD recommends running all distills in Q4 K M quantization.
1= Lists the maximum supported distill without partial GPU offload.
Source: Gadget Pilipinas
0 Comments