<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>strix halo Archives - Efison Lisan Teknologi</title>
	<atom:link href="https://efisonlt.com/tag/strix-halo/feed/" rel="self" type="application/rss+xml" />
	<link>https://efisonlt.com/tag/strix-halo/</link>
	<description>Computation for Everybody</description>
	<lastBuildDate>Thu, 28 May 2026 10:38:46 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=7.0</generator>

<image>
	<url>https://efisonlt.com/wp-content/uploads/2020/03/cropped-efison_logo_orange-skuer-32x32.png</url>
	<title>strix halo Archives - Efison Lisan Teknologi</title>
	<link>https://efisonlt.com/tag/strix-halo/</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>Our Experience with MSI EdgeXpert &#8211; Nvidia DGX Spark GB10</title>
		<link>https://efisonlt.com/our-experience-with-msi-edgexpert-nvidia-dgx-spark-gb10/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=our-experience-with-msi-edgexpert-nvidia-dgx-spark-gb10</link>
		
		<dc:creator><![CDATA[Laatansa Imroni]]></dc:creator>
		<pubDate>Thu, 28 May 2026 07:48:40 +0000</pubDate>
				<category><![CDATA[Review]]></category>
		<category><![CDATA[ai]]></category>
		<category><![CDATA[cuda]]></category>
		<category><![CDATA[dgx]]></category>
		<category><![CDATA[imagegen]]></category>
		<category><![CDATA[llm]]></category>
		<category><![CDATA[nvidia]]></category>
		<category><![CDATA[qwen]]></category>
		<category><![CDATA[r9700]]></category>
		<category><![CDATA[radeon]]></category>
		<category><![CDATA[rtx]]></category>
		<category><![CDATA[spark]]></category>
		<category><![CDATA[strix halo]]></category>
		<guid isPermaLink="false">https://efisonlt.com/?p=1974</guid>

					<description><![CDATA[<p>&#8220;Agentic AI&#8221;. That&#8217;s the key phrase. Everybody is talking about how we should utilise AI on our life. Invest on the skill, buy your own hardware, they said. And this one is probably the best gateway hardware out there. Behold! A mini AI supercomputer (Nvidia&#8217;s words, not mine) Such a big statement, I thought. But&#8230;&#160;<a href="https://efisonlt.com/our-experience-with-msi-edgexpert-nvidia-dgx-spark-gb10/" rel="bookmark">Read More &#187;<span class="screen-reader-text">Our Experience with MSI EdgeXpert &#8211; Nvidia DGX Spark GB10</span></a></p>
<p>The post <a href="https://efisonlt.com/our-experience-with-msi-edgexpert-nvidia-dgx-spark-gb10/">Our Experience with MSI EdgeXpert &#8211; Nvidia DGX Spark GB10</a> appeared first on <a href="https://efisonlt.com">Efison Lisan Teknologi</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<p class="wp-block-paragraph">&#8220;Agentic AI&#8221;. That&#8217;s the key phrase.</p>



<p class="wp-block-paragraph">Everybody is talking about how we should utilise AI on our life. Invest on the skill, buy your own hardware, they said.</p>



<p class="wp-block-paragraph">And this one is probably the best gateway hardware out there.</p>



<h2 class="wp-block-heading">Behold! A mini AI supercomputer (<a href="https://nvidianews.nvidia.com/news/nvidia-dgx-spark-arrives-for-worlds-ai-developers">Nvidia&#8217;s words</a>, not mine)</h2>



<p class="wp-block-paragraph">Such a big statement, I thought. But after a short exploration, this thing is really a minified supercomputer in many different aspects. It has this enterprise-level of software support, clustering, network interface, and also the emphasis on being an AI server, not just a mere mini PC.</p>



<p class="wp-block-paragraph">I won&#8217;t rewrite the full specification table here. You can see it yourself on <a href="https://ipc.msi.com/product_detail/Industrial-Computer-Box-PC/AI-Supercomputer/EdgeXpert-MS-C931">MSI EdgeXpert</a> page or <a href="https://www.nvidia.com/en-sg/products/workstations/dgx-spark/">Nvidia DGX Spark page</a>.</p>



<p class="wp-block-paragraph">But let me tell you all the interesting parts.</p>



<h2 class="wp-block-heading">At a Glance</h2>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db151be&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db151be" class="aligncenter size-large wp-lightbox-container"><img fetchpriority="high" decoding="async" width="1024" height="705" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-5-1024x705.png" alt="" class="wp-image-1982" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-5-1024x705.png 1024w, https://efisonlt.com/wp-content/uploads/2026/05/image-5-300x207.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/image-5-768x529.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/image-5.png 1271w" sizes="(max-width: 1024px) 100vw, 1024px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">Ignore the scratches. Not mine to blame. It has almost a half of its height dedicated for an intake grill.</figcaption></figure>
</div>

<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db15c17&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db15c17" class="aligncenter size-large wp-lightbox-container"><img decoding="async" width="1024" height="686" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-3-1024x686.png" alt="" class="wp-image-1980" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-3-1024x686.png 1024w, https://efisonlt.com/wp-content/uploads/2026/05/image-3-300x201.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/image-3-768x515.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/image-3.png 1300w" sizes="(max-width: 1024px) 100vw, 1024px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">Back view. On top is the exhaust grill. All the physical interfaces are on the backside. Yes, including the power button.</figcaption></figure>
</div>

<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db16374&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db16374" class="aligncenter size-full wp-lightbox-container"><img decoding="async" width="810" height="551" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-1.png" alt="USB ports" class="wp-image-1978" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-1.png 810w, https://efisonlt.com/wp-content/uploads/2026/05/image-1-300x204.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/image-1-768x522.png 768w" sizes="(max-width: 810px) 100vw, 810px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">4*USB type-C, no USB type-A in sight.</figcaption></figure>
</div>

<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db16a44&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db16a44" class="aligncenter size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="711" height="597" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-4.png" alt="" class="wp-image-1981" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-4.png 711w, https://efisonlt.com/wp-content/uploads/2026/05/image-4-300x252.png 300w" sizes="(max-width: 711px) 100vw, 711px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">The ConnectX-7 200 Gbps ports for a better clustering support compared to any mini PC I&#8217;ve ever seen.</figcaption></figure>
</div>


<p class="wp-block-paragraph">The omission of power LED or any power-on indicator is kinda heartbreaking, tho.</p>



<h2 class="wp-block-heading">Hardware Overview</h2>



<p class="wp-block-paragraph">This system is using a custom multi-chip package SoC called Nvidia GB10. The GB10 is consisted of (should be) Grace Arm CPU and Blackwell GPU. Nvidia didn&#8217;t really mention the specific name of both the CPU and GPU so I&#8217;ll put the specification below to represent the number of the compute unit count instead. </p>



<figure class="wp-block-table"><table><thead><tr><th>Component</th><th class="has-text-align-left" data-align="left">Specification</th></tr></thead><tbody><tr><td><strong>CPU</strong></td><td class="has-text-align-left" data-align="left">10-core Arm Cortex-X925, 10-core Arm Cortex-A725</td></tr><tr><td><strong>GPU</strong></td><td class="has-text-align-left" data-align="left">48-SM Blackwell</td></tr><tr><td><strong>Storage</strong></td><td class="has-text-align-left" data-align="left">1 TB NVMe</td></tr><tr><td><strong>Memory</strong></td><td class="has-text-align-left" data-align="left">128 GB 256-bit LPDDR5x unified</td></tr><tr><td><strong>Memory Bandwidth</strong></td><td class="has-text-align-left" data-align="left">273 GB/s</td></tr><tr><td><strong>Ethernet</strong></td><td class="has-text-align-left" data-align="left">10 GbE RJ-45</td></tr><tr><td><strong>High-Speed Network</strong></td><td class="has-text-align-left" data-align="left">ConnectX-7 NIC @ 200 Gbps</td></tr><tr><td><strong>Wireless Connection</strong></td><td class="has-text-align-left" data-align="left">Wifi 7, Bluetooth 5.4</td></tr><tr><td><strong>Power Supply</strong></td><td class="has-text-align-left" data-align="left">240 W AC-to-DC power brick via USB type-C</td></tr><tr><td><strong>BMC/IPMI</strong></td><td class="has-text-align-left" data-align="left">None</td></tr></tbody></table></figure>



<p class="wp-block-paragraph">As it stands, my main complaint is the lack of baseboard management controller (BMC) to remotely monitor the state of the hardware, connect to a virtual KVM to the server, and to control the power state, exclusive from the OS. This way, you can only get the monitoring of the unit <strong>after</strong> you boot into the OS. Well, unfortunately Nvidia decided it is not supercomputer enough to bear the right of having a BMC.</p>



<h2 class="wp-block-heading">Software Overview</h2>



<p class="wp-block-paragraph">This one is a more interesting one. The OS is called <a href="https://docs.nvidia.com/dgx/dgx-spark/dgx-os.html">DGX OS</a> officially, but it&#8217;s actually an Ubuntu 24.04 with some &#8220;custom sauce&#8221; from Nvidia.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db17172&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db17172" class="aligncenter size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="815" height="412" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-6.png" alt="" class="wp-image-1983" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-6.png 815w, https://efisonlt.com/wp-content/uploads/2026/05/image-6-300x152.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/image-6-768x388.png 768w" sizes="(max-width: 815px) 100vw, 815px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">neofetch of DGX Spark.</figcaption></figure>
</div>


<p class="wp-block-paragraph">The &#8220;custom sauce&#8221; lies on the repository being used for delivering packages optimised for the GB10 which are maintained and deployed by Nvidia.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db17785&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db17785" class="aligncenter size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="697" height="225" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-7.png" alt="" class="wp-image-1984" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-7.png 697w, https://efisonlt.com/wp-content/uploads/2026/05/image-7-300x97.png 300w" sizes="(max-width: 697px) 100vw, 697px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">Nvidia repo for DGX Spark.</figcaption></figure>
</div>


<p class="wp-block-paragraph">It comes with GNOME Desktop Environment if you fancy using it as a normal desktop PC. You can install various software through command line or Software Center in a typical Ubuntu desktop fashion.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db17d6f&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db17d6f" class="aligncenter size-large wp-lightbox-container"><img loading="lazy" decoding="async" width="1024" height="576" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-8-1024x576.png" alt="" class="wp-image-1985" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-8-1024x576.png 1024w, https://efisonlt.com/wp-content/uploads/2026/05/image-8-300x169.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/image-8-768x432.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/image-8-1536x865.png 1536w, https://efisonlt.com/wp-content/uploads/2026/05/image-8-2048x1153.png 2048w" sizes="(max-width: 1024px) 100vw, 1024px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">Webcam works, too, if you feel like wanting to plug a webcam.</figcaption></figure>
</div>


<p class="wp-block-paragraph">As it is a mini server, it has this cute quirk, which is you&#8217;ll find no sleep (or hibernate) option. You can only set the power saving to blank the screen after being left idle for some time. Well you don&#8217;t want your server to suddenly sleep in the middle of serving, no?</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db18279&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db18279" class="aligncenter size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="740" height="391" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-9.png" alt="" class="wp-image-1986" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-9.png 740w, https://efisonlt.com/wp-content/uploads/2026/05/image-9-300x159.png 300w" sizes="(max-width: 740px) 100vw, 740px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">No sleep. Sleep is for the weak. Excuse my keyboard (Bridge75) for the photobomb.</figcaption></figure>
</div>


<h2 class="wp-block-heading">Being an AI Server: Default Software</h2>



<p class="wp-block-paragraph">After first boot and setup, you&#8217;ll get yourself a default dashboard called DGX Dashboard which can be accessed by a browser. The default address is <a href="http://localhost:11000">http://localhost:11000</a> which can only be accessed locally from the machine itself. But you can easily redirect it with a proxy or tunneling just fine.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db1893b&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db1893b" class="aligncenter size-large wp-lightbox-container"><img loading="lazy" decoding="async" width="1024" height="760" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-10-1024x760.png" alt="" class="wp-image-1990" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-10-1024x760.png 1024w, https://efisonlt.com/wp-content/uploads/2026/05/image-10-300x223.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/image-10-768x570.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/image-10.png 1247w" sizes="(max-width: 1024px) 100vw, 1024px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">DGX Dashboard</figcaption></figure>
</div>


<p class="wp-block-paragraph">As can be seen above you can easily see simple statuses of the hardware and launch a JupyterLab which would then can be used as PyTorch or TensorFlow development environment just as easy. From there you can use the underlying hardware both CPU and GPU to develop or test your various Jupyter workflow.</p>



<p class="wp-block-paragraph">From the same dashboard we can click on the Updates menu to&#8230; update. Nvidia recommends updating from this dashboard instead of using CLI, and you will also get your firmware or UEFI update from this same interface.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db18ea8&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db18ea8" class="aligncenter size-large wp-lightbox-container"><img loading="lazy" decoding="async" width="1024" height="761" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-11-1024x761.png" alt="" class="wp-image-1991" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-11-1024x761.png 1024w, https://efisonlt.com/wp-content/uploads/2026/05/image-11-300x223.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/image-11-768x570.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/image-11.png 1244w" sizes="(max-width: 1024px) 100vw, 1024px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">DGX Dashboard: Updates</figcaption></figure>
</div>


<p class="wp-block-paragraph">Continuing right there is a Settings menu which can be used to change the hostname and enable/disable telemetry.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db19440&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db19440" class="aligncenter size-large wp-lightbox-container"><img loading="lazy" decoding="async" width="1024" height="761" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-12-1024x761.png" alt="" class="wp-image-1992" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-12-1024x761.png 1024w, https://efisonlt.com/wp-content/uploads/2026/05/image-12-300x223.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/image-12-768x571.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/image-12.png 1245w" sizes="(max-width: 1024px) 100vw, 1024px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">DGX Dashboard: Settings</figcaption></figure>
</div>


<p class="wp-block-paragraph">The next three buttons on the right are external links in which you can interact with:</p>



<h3 class="wp-block-heading"><strong><a href="https://docs.nvidia.com/dgx/dgx-spark/index.html">Docs</a>:</strong> Product Documentation and User Manual</h3>



<p class="wp-block-paragraph">Here you can look for everything you need to know about DGX Spark down from its hardware, software, configuration, release notes, system update guide, how to get support, to legal information. This is as complete as you want it to be. Personally I found that this is the most complete manual ever written for a mini PC. Oh wait it&#8217;s a mini supercomputer, not just a mini PC.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db199a9&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db199a9" class="aligncenter wp-lightbox-container"><img loading="lazy" decoding="async" width="1001" height="1024" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-27-1001x1024.png" alt="" class="wp-image-2029" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-27-1001x1024.png 1001w, https://efisonlt.com/wp-content/uploads/2026/05/image-27-293x300.png 293w, https://efisonlt.com/wp-content/uploads/2026/05/image-27-768x785.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/image-27.png 1280w" sizes="(max-width: 1001px) 100vw, 1001px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">Docs button will bring you to DGX Spark User Guide site.</figcaption></figure>
</div>


<h3 class="wp-block-heading"><strong><a href="https://forums.developer.nvidia.com/c/accelerated-computing/dgx-spark-gb10/dgx-spark-gb10/721">Forums</a>:</strong> Discussion Boards</h3>



<p class="wp-block-paragraph">Do you ever feel like missing a bulletin boards? Or interacting with a bunch of geniuses around the world who have the same systems? Afraid not! This forum is as lively as it can, with Nvidia moderators also actively participates in the discussions. Maybe someone has a crazy idea, maybe someone has a solution for a particular problem, you name it. Lovely.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db19eaa&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db19eaa" class="aligncenter size-large wp-lightbox-container"><img loading="lazy" decoding="async" width="1001" height="1024" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-28-1001x1024.png" alt="" class="wp-image-2030" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-28-1001x1024.png 1001w, https://efisonlt.com/wp-content/uploads/2026/05/image-28-293x300.png 293w, https://efisonlt.com/wp-content/uploads/2026/05/image-28-768x785.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/image-28.png 1280w" sizes="(max-width: 1001px) 100vw, 1001px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">Lively forum.</figcaption></figure>
</div>

<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db1a32e&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db1a32e" class="aligncenter wp-lightbox-container"><img loading="lazy" decoding="async" width="1001" height="1024" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-29-1001x1024.png" alt="" class="wp-image-2031" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-29-1001x1024.png 1001w, https://efisonlt.com/wp-content/uploads/2026/05/image-29-293x300.png 293w, https://efisonlt.com/wp-content/uploads/2026/05/image-29-768x785.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/image-29.png 1280w" sizes="(max-width: 1001px) 100vw, 1001px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">Maybe a little bit too lively. But what is life without genius madlads around?</figcaption></figure>
</div>


<h3 class="wp-block-heading"><strong><a href="https://build.nvidia.com/spark/">Resources</a>:</strong> Guides for Deploying AI Softwares</h3>



<p class="wp-block-paragraph">For me this is one of the killer features. This is the guide for building, running, and deploying any popular AI software, some with prebuilt container images or models built by Nvidia themselves. Again, personally this has been the best resources provided by first-party for users to have the least possible headache of running any AI software they want to use.</p>



<figure class="wp-block-video aligncenter"><video height="1440" style="aspect-ratio: 2560 / 1440;" width="2560" controls src="https://efisonlt.com/wp-content/uploads/2026/05/DGX-spark-resources-overview.mp4"></video><figcaption class="wp-element-caption">They encourage you to build any AI software in the easiest way possible.</figcaption></figure>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db1aae4&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db1aae4" class="aligncenter size-large wp-lightbox-container"><img loading="lazy" decoding="async" width="604" height="1024" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-30-604x1024.png" alt="" class="wp-image-2033" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-30-604x1024.png 604w, https://efisonlt.com/wp-content/uploads/2026/05/image-30-177x300.png 177w, https://efisonlt.com/wp-content/uploads/2026/05/image-30.png 699w" sizes="(max-width: 604px) 100vw, 604px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">Example: vLLM serving guide.</figcaption></figure>
</div>

<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db1af25&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db1af25" class="aligncenter size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="690" height="280" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-31.png" alt="" class="wp-image-2034" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-31.png 690w, https://efisonlt.com/wp-content/uploads/2026/05/image-31-300x122.png 300w" sizes="(max-width: 690px) 100vw, 690px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">They have their ready-to-run models, if you prefer.</figcaption></figure>
</div>

<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db1b39e&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db1b39e" class="aligncenter is-resized wp-lightbox-container"><img loading="lazy" decoding="async" width="690" height="554" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-32.png" alt="" class="wp-image-2035" style="width:690px;height:auto" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-32.png 690w, https://efisonlt.com/wp-content/uploads/2026/05/image-32-300x241.png 300w" sizes="(max-width: 690px) 100vw, 690px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">You can also use Nvidia&#8217;s prebuilt container image from the <a href="https://catalog.ngc.nvidia.com">NGC</a>.</figcaption></figure>
</div>


<h2 class="wp-block-heading">CPU (and Memory) Performance Benchmarks</h2>



<p class="wp-block-paragraph">Here I tested some benchmarks to portray how fast it is in general tasks.</p>



<h3 class="wp-block-heading">Geekbench 6</h3>



<p class="wp-block-paragraph">It slightly beat my desktop PC which has a bigger cooler and consumes more power.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db1b91f&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db1b91f" class="aligncenter size-large is-resized wp-lightbox-container"><img loading="lazy" decoding="async" width="855" height="1024" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-13-855x1024.png" alt="" class="wp-image-1995" style="width:855px;height:auto" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-13-855x1024.png 855w, https://efisonlt.com/wp-content/uploads/2026/05/image-13-251x300.png 251w, https://efisonlt.com/wp-content/uploads/2026/05/image-13-768x919.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/image-13.png 858w" sizes="(max-width: 855px) 100vw, 855px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption"><a href="https://browser.geekbench.com/v6/cpu/compare/17994641?baseline=12639240">Geekbench 6 result against an 8P core i7-12700K.</a></figcaption></figure>
</div>


<h3 class="wp-block-heading">Phoronix Linux Kernel 7.0 Compilation</h3>



<p class="wp-block-paragraph">Unfortunately it fell short in compilation performance. Personally, I also found this machine to be quite slow during Nunchaku wheel compilation which I will explain later on.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db1be8e&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db1be8e" class="aligncenter size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="600" height="149" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-17.png" alt="" class="wp-image-1999" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-17.png 600w, https://efisonlt.com/wp-content/uploads/2026/05/image-17-300x75.png 300w" sizes="(max-width: 600px) 100vw, 600px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption"><a href="https://openbenchmarking.org/result/2605148-NE-DGXSPARK245">Compilation time of Linux Kernel 7.0.</a></figcaption></figure>
</div>


<p class="wp-block-paragraph">It is slower than an aging 8-core 8-thread i7-9700. Ouch.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db1c353&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db1c353" class="aligncenter size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="978" height="119" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-15.png" alt="" class="wp-image-1997" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-15.png 978w, https://efisonlt.com/wp-content/uploads/2026/05/image-15-300x37.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/image-15-768x93.png 768w" sizes="(max-width: 978px) 100vw, 978px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption"><a href="https://openbenchmarking.org/test/pts/build-linux-kernel">Linux kernel 7.0 compilation time comparison on various different CPUs</a>.</figcaption></figure>
</div>


<h3 class="wp-block-heading">Phoronix SVT-AV1 4.0</h3>



<p class="wp-block-paragraph">Similar picture painted when tested against SVT AV1 encoding using CPU.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db1c847&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db1c847" class="aligncenter size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="600" height="184" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-16.png" alt="" class="wp-image-1998" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-16.png 600w, https://efisonlt.com/wp-content/uploads/2026/05/image-16-300x92.png 300w" sizes="(max-width: 600px) 100vw, 600px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption"><a href="https://openbenchmarking.org/result/2605148-NE-DGXSPARK245">AV1 encoding speed using SVT library.</a></figcaption></figure>
</div>


<p class="wp-block-paragraph">It&#8217;s only marginally faster than an aging 4-core 8-thread mobile CPU i7-8550U.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db1ccf9&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db1ccf9" class="aligncenter size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="889" height="110" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-18.png" alt="" class="wp-image-2000" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-18.png 889w, https://efisonlt.com/wp-content/uploads/2026/05/image-18-300x37.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/image-18-768x95.png 768w" sizes="(max-width: 889px) 100vw, 889px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption"><a href="https://openbenchmarking.org/test/pts/svt-av1">SVT-AV1 performance comparison on various different CPUs.</a></figcaption></figure>
</div>


<h3 class="wp-block-heading">Phoronix 7-zip 26.01 Compression and Decompression</h3>



<p class="wp-block-paragraph">This one yielded a better outcome. My theory would be the large memory bandwidth helps with the compression/decompression performance by much and feeds the cores with the necessary data nicely.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db1d1ea&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db1d1ea" class="aligncenter size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="600" height="182" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-19.png" alt="" class="wp-image-2001" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-19.png 600w, https://efisonlt.com/wp-content/uploads/2026/05/image-19-300x91.png 300w" sizes="(max-width: 600px) 100vw, 600px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption"><a href="https://openbenchmarking.org/result/2605148-NE-DGXSPARK245">7-zip 26.01 file compression speed.</a></figcaption></figure>
</div>


<p class="wp-block-paragraph">Only slightly lower than another mini AI machine (Ryzen AI Max+ 395) and faster than the current gen mainstream-class Intel desktop CPU (Core Ultra 5 250K Plus).</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db1d6e3&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db1d6e3" class="aligncenter size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="1013" height="140" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-22.png" alt="" class="wp-image-2004" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-22.png 1013w, https://efisonlt.com/wp-content/uploads/2026/05/image-22-300x41.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/image-22-768x106.png 768w" sizes="(max-width: 1013px) 100vw, 1013px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption"><a href="https://openbenchmarking.org/test/pts/compress-7zip&amp;eval=056b87080920579f1e0b2f364cf8191f205d4f52#metrics">7-zip 26.01 file compression performance on various different CPUs.</a></figcaption></figure>
</div>


<p class="wp-block-paragraph">The decompression performance also painted a good number.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db1dbac&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db1dbac" class="aligncenter size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="600" height="182" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-20.png" alt="" class="wp-image-2002" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-20.png 600w, https://efisonlt.com/wp-content/uploads/2026/05/image-20-300x91.png 300w" sizes="(max-width: 600px) 100vw, 600px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption"><a href="https://openbenchmarking.org/result/2605148-NE-DGXSPARK245">7-zip 26.01 file decompression speed.</a></figcaption></figure>
</div>


<p class="wp-block-paragraph">Albeit a much bigger gap against the Ryzen AI Max+ 395. Still respectable nonetheless.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db1e04a&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db1e04a" class="aligncenter size-large wp-lightbox-container"><img loading="lazy" decoding="async" width="1024" height="171" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-23-1024x171.png" alt="" class="wp-image-2005" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-23-1024x171.png 1024w, https://efisonlt.com/wp-content/uploads/2026/05/image-23-300x50.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/image-23-768x128.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/image-23.png 1030w" sizes="(max-width: 1024px) 100vw, 1024px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption"><a href="https://openbenchmarking.org/test/pts/compress-7zip&amp;eval=c492dfa3fd4ab77f88e146cd61863fe368407a56#metrics">7-zip 26.01 file decompression performance on various different CPUs.</a></figcaption></figure>
</div>


<h2 class="wp-block-heading">Comparison against A Similar Mini AI Machine</h2>



<p class="wp-block-paragraph">It&#8217;s been a year or so that AMD released the Strix Halo (Ryzen AI Max+ 395 with integrated Radeon 8060S). I actually <a href="https://www.youtube.com/watch?v=_nl9WtOEL2E">tested one</a> but haven&#8217;t written any article about it (sorry &#x1f625;). Now when you see the raw specification, they&#8217;re kinda similar in stature.</p>



<figure class="wp-block-table"><table><thead><tr><th></th><th class="has-text-align-right" data-align="right">DGX Spark</th><th>Strix Halo</th></tr></thead><tbody><tr><td><strong>Memory</strong></td><td class="has-text-align-right" data-align="right">128 GB 256-bit LPDDR5x unified</td><td>128 GB 256-bit LPDDR5x shared</td></tr><tr><td><strong>Memory Bandwidth</strong></td><td class="has-text-align-right" data-align="right">273 GB/s</td><td>256 GB/s</td></tr><tr><td><strong>Smallest Floating Point Support</strong></td><td class="has-text-align-right" data-align="right">NVFP4, MXFP4</td><td>BF16, FP16</td></tr><tr><td><strong>Smallest Integer Support</strong></td><td class="has-text-align-right" data-align="right">INT4</td><td>INT4</td></tr></tbody></table></figure>



<p class="wp-block-paragraph">But it has this glaring difference: DGX Spark <strong>supports a smaller floating point data type</strong>.</p>



<p class="wp-block-paragraph">Aside from the data type support, you can also look at the memory. DGX Spark uses <strong>unified</strong> memory while Strix Halo uses <strong>shared</strong> memory. This is different in nature.</p>



<p class="wp-block-paragraph">With unified memory, you get a transparent memory addressing in which the CPU and GPU can access the same memory address, making it possible for the CPU or GPU-bound application to get more memory available on the pool on-demand. They can access the same 128 GB available memory in whole.</p>



<p class="wp-block-paragraph">Strix Halo, uses shared memory in which you need to set a fixed amount of shared UMA (unified memory address) size available for the GPU. While the GPU can dynamically allocate the GPU memory to the available memory address, the fixed amount of shared UMA is locked for GPU and the CPU can only use the remaining memory pool. For example you can set the UMA size of 1 GB statically set to the GPU, then the </p>



<p class="wp-block-paragraph">We&#8217;ll see the difference in performance later on.</p>



<h2 class="wp-block-heading">LLM Performance</h2>



<p class="wp-block-paragraph">First, I need to talk about sidelining llama.cpp for vLLM. The reason is llama.cpp wouldn&#8217;t be representative for this machine LLM capability measurement. As I already presented above, it has a large unified memory capacity of 128 GB. With this much of a memory, you are not on the mercy of GGUF models with llama.cpp and you can just run the non-GGUF quantized model with an inference engine good enough to serve more requests.</p>



<p class="wp-block-paragraph">Wait that explanation isn&#8217;t really helpful, right?</p>



<p class="wp-block-paragraph">Let me just show you this model size comparison.</p>



<p class="wp-block-paragraph">Here are <a href="https://huggingface.co/unsloth/Qwen3.6-35B-A3B-GGUF">unsloth/Qwen3.6-35B-A3B-GGUF</a> models&#8217; size.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db1e8a3&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db1e8a3" class="aligncenter size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="491" height="755" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-25.png" alt="" class="wp-image-2012" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-25.png 491w, https://efisonlt.com/wp-content/uploads/2026/05/image-25-195x300.png 195w" sizes="(max-width: 491px) 100vw, 491px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">With GGUF model, you can grab a lower quantization easily to scale with your memory capacity.</figcaption></figure>
</div>


<p class="wp-block-paragraph">And this is <a href="https://huggingface.co/unsloth/Qwen3.6-35B-A3B-NVFP4">unsloth/Qwen3.6-35B-A3B-NVFP4</a> model size.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db1ed4a&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db1ed4a" class="aligncenter size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="398" height="259" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-24.png" alt="" class="wp-image-2011" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-24.png 398w, https://efisonlt.com/wp-content/uploads/2026/05/image-24-300x195.png 300w" sizes="(max-width: 398px) 100vw, 398px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">This non-GGUF model only has one variant, which is 23 GB in size.</figcaption></figure>
</div>


<p class="wp-block-paragraph">llama.cpp will allocate certain parts of the memory to be used as KV cache. This number is fixed based on the data type being used and the context length set up in the llama-server directive. It can&#8217;t grow or shrink to fit the available memory, nor set up on-the-fly when the inference server is running.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db1f22f&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db1f22f" class="aligncenter wp-lightbox-container"><img loading="lazy" decoding="async" width="960" height="76" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-33.png" alt="" class="wp-image-2037" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-33.png 960w, https://efisonlt.com/wp-content/uploads/2026/05/image-33-300x24.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/image-33-768x61.png 768w" sizes="(max-width: 960px) 100vw, 960px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">In this example, I used context length of 262144 tokens, K cache data type of q4_0, and V cache data type of q4_0. It used 1440 MB of the available memory for the context alone, independent from the memory needed to run the model.</figcaption></figure>
</div>


<p class="wp-block-paragraph">Unlike llama.cpp, when you use an enterprise focused inference engine like vLLM, you can use the remaining available memory for more concurrency.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db1f6d1&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db1f6d1" class="aligncenter size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="766" height="118" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/Screenshot_20260525_121749.png" alt="" class="wp-image-2025" srcset="https://efisonlt.com/wp-content/uploads/2026/05/Screenshot_20260525_121749.png 766w, https://efisonlt.com/wp-content/uploads/2026/05/Screenshot_20260525_121749-300x46.png 300w" sizes="(max-width: 766px) 100vw, 766px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">In this example, the remaining 67.5 GB after loading the data would be allocated as KV cache memory, which then translate to 13.3 of maximum concurrencies estimation.</figcaption></figure>
</div>


<p class="wp-block-paragraph">Now when we put the comparison side-by-side, you can see that vLLM can serve more concurrent requests easily compared to llama.cpp, even with a similarly sized GGUF model. All the benchmarks were done with <a href="https://github.com/eugr/llama-benchy">llama-benchy</a> for easy benchmarking tool and consistency.</p>



<details class="wp-block-details is-layout-flow wp-block-details-is-layout-flow"><summary>llama.cpp serve command line, model: unsloth/Qwen3.6-35B-A3B-GGUF, UD-Q4_K_XL quantization</summary>
<div class="wp-block-kevinbatdorf-code-block-pro" data-code-block-pro-font-family="Code-Pro-JetBrains-Mono" style="font-size:.875rem;font-family:Code-Pro-JetBrains-Mono,ui-monospace,SFMono-Regular,Menlo,Monaco,Consolas,monospace;--cbp-line-number-width:calc(2 * 0.6 * .875rem);line-height:1.25rem;--cbp-tab-width:2;tab-size:var(--cbp-tab-width, 2)"><span style="display:flex;align-items:center;padding:10px 0px 10px 16px;margin-bottom:-2px;width:100%;text-align:left;background-color:#2b2b2b;color:#c7c7c7">Bash</span><span role="button" tabindex="0" style="color:#D4D4D4;display:none" aria-label="Copy" class="code-block-pro-copy-button"><pre class="code-block-pro-copy-button-pre" aria-hidden="true"><textarea class="code-block-pro-copy-button-textarea" tabindex="-1" aria-hidden="true" readonly>llama-serve \
  --model unsloth/Qwen3.6-35B-A3B-GGUF/Qwen3.6-35B-A3B-UD-Q4_K_XL.gguf \
  --mmproj unsloth/Qwen3.6-35B-A3B-GGUF/mmproj-BF16.gguf --image-min-tokens 1024 \
  -ctk q8_0 -ctv q8_0 -c 262144 -ub 4096 -b 4096 \
  -fa 1 \
  --temp 0.6 --top-p 0.95 --top-k 20 --min-p 0.0 \
  --presence-penalty 0.0 --repeat-penalty 1.0 \
  --parallel 1 --threads 16 \
  --host 0.0.0.0 --port 8000 \
  --jinja</textarea></pre><svg xmlns="http://www.w3.org/2000/svg" style="width:24px;height:24px" fill="none" viewBox="0 0 24 24" stroke="currentColor" stroke-width="2"><path class="with-check" stroke-linecap="round" stroke-linejoin="round" d="M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2m-6 9l2 2 4-4"></path><path class="without-check" stroke-linecap="round" stroke-linejoin="round" d="M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2"></path></svg></span><pre class="shiki dark-plus" style="background-color: #1E1E1E" tabindex="0"><code><span class="line"><span style="color: #DCDCAA">llama-serve</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--model</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">unsloth/Qwen3.6-35B-A3B-GGUF/Qwen3.6-35B-A3B-UD-Q4_K_XL.gguf</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--mmproj</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">unsloth/Qwen3.6-35B-A3B-GGUF/mmproj-BF16.gguf</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">--image-min-tokens</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">1024</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">-ctk</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">q8_0</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">-ctv</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">q8_0</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">-c</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">262144</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">-ub</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">4096</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">-b</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">4096</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">-fa</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">1</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--temp</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">0.6</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">--top-p</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">0.95</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">--top-k</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">20</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">--min-p</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">0.0</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--presence-penalty</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">0.0</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">--repeat-penalty</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">1.0</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--parallel</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">1</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">--threads</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">16</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--host</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">0.0</span><span style="color: #CE9178">.0.0</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">--port</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">8000</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--jinja</span></span></code></pre></div>
</details>



<details class="wp-block-details is-layout-flow wp-block-details-is-layout-flow"><summary>vLLM serve command line, model: unsloth/Qwen3.6-35B-A3B-NVFP4</summary>
<div class="wp-block-kevinbatdorf-code-block-pro" data-code-block-pro-font-family="Code-Pro-JetBrains-Mono" style="font-size:.875rem;font-family:Code-Pro-JetBrains-Mono,ui-monospace,SFMono-Regular,Menlo,Monaco,Consolas,monospace;--cbp-line-number-width:calc(1 * 0.6 * .875rem);line-height:1.25rem;--cbp-tab-width:2;tab-size:var(--cbp-tab-width, 2)"><span style="display:flex;align-items:center;padding:10px 0px 10px 16px;margin-bottom:-2px;width:100%;text-align:left;background-color:#2b2b2b;color:#c7c7c7">Bash</span><span role="button" tabindex="0" style="color:#D4D4D4;display:none" aria-label="Copy" class="code-block-pro-copy-button"><pre class="code-block-pro-copy-button-pre" aria-hidden="true"><textarea class="code-block-pro-copy-button-textarea" tabindex="-1" aria-hidden="true" readonly>podman run \
  -p 8000:8000 \
  --device nvidia.com/gpu=all \
  -v ~/.cache/huggingface:/root/.cache/huggingface:Z \
  docker.io/vllm/vllm-openai:cu130-nightly unsloth/Qwen3.6-35B-A3B-NVFP4 \
  --enable-auto-tool-choice --tool-call-parser qwen3_coder --reasoning-parser qwen3 \
  --trust-remote-code \
  --dtype float16 \
  --max-model-len -1 \
  --gpu-memory-utilization 0.8</textarea></pre><svg xmlns="http://www.w3.org/2000/svg" style="width:24px;height:24px" fill="none" viewBox="0 0 24 24" stroke="currentColor" stroke-width="2"><path class="with-check" stroke-linecap="round" stroke-linejoin="round" d="M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2m-6 9l2 2 4-4"></path><path class="without-check" stroke-linecap="round" stroke-linejoin="round" d="M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2"></path></svg></span><pre class="shiki dark-plus" style="background-color: #1E1E1E" tabindex="0"><code><span class="line"><span style="color: #DCDCAA">podman</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">run</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">-p</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">8000</span><span style="color: #CE9178">:8000</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--device</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">nvidia.com/gpu=all</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">-v</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">~/.cache/huggingface:/root/.cache/huggingface:Z</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #CE9178">docker.io/vllm/vllm-openai:cu130-nightly</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">unsloth/Qwen3.6-35B-A3B-NVFP4</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--enable-auto-tool-choice</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">--tool-call-parser</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">qwen3_coder</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">--reasoning-parser</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">qwen3</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--trust-remote-code</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--dtype</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">float16</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--max-model-len</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">-1</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--gpu-memory-utilization</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">0.8</span></span></code></pre></div>
</details>



<details class="wp-block-details is-layout-flow wp-block-details-is-layout-flow"><summary>llama-benchy benchmarking line for the llama.cpp server, model: unsloth/Qwen3.6-35B-A3B-GGUF, UD-Q4_K_XL quantization</summary>
<div class="wp-block-kevinbatdorf-code-block-pro" data-code-block-pro-font-family="Code-Pro-JetBrains-Mono" style="font-size:.875rem;font-family:Code-Pro-JetBrains-Mono,ui-monospace,SFMono-Regular,Menlo,Monaco,Consolas,monospace;line-height:1.25rem;--cbp-tab-width:2;tab-size:var(--cbp-tab-width, 2)"><span style="display:flex;align-items:center;padding:10px 0px 10px 16px;margin-bottom:-2px;width:100%;text-align:left;background-color:#2b2b2b;color:#c7c7c7">Bash</span><span role="button" tabindex="0" style="color:#D4D4D4;display:none" aria-label="Copy" class="code-block-pro-copy-button"><pre class="code-block-pro-copy-button-pre" aria-hidden="true"><textarea class="code-block-pro-copy-button-textarea" tabindex="-1" aria-hidden="true" readonly>llama-benchy \
  --base-url http://${HOST}:8000/v1 \
  --model Qwen3.6-35B-A3B-UD-Q4_K_XL.gguf \
  --depth 0 8192 \
  --pp 2048 --tg 256 \
  --concurrency 1 2 4 \
  --no-results-on-fail</textarea></pre><svg xmlns="http://www.w3.org/2000/svg" style="width:24px;height:24px" fill="none" viewBox="0 0 24 24" stroke="currentColor" stroke-width="2"><path class="with-check" stroke-linecap="round" stroke-linejoin="round" d="M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2m-6 9l2 2 4-4"></path><path class="without-check" stroke-linecap="round" stroke-linejoin="round" d="M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2"></path></svg></span><pre class="shiki dark-plus" style="background-color: #1E1E1E" tabindex="0"><code><span class="line"><span style="color: #DCDCAA">llama-benchy</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--base-url</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">http://</span><span style="color: #D4D4D4">${</span><span style="color: #9CDCFE">HOST</span><span style="color: #D4D4D4">}</span><span style="color: #CE9178">:8000/v1</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--model</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">Qwen3.6-35B-A3B-UD-Q4_K_XL.gguf</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--depth</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">0</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">8192</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--pp</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">2048</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">--tg</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">256</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--concurrency</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">1</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">2</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">4</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--no-results-on-fail</span></span></code></pre></div>
</details>



<details class="wp-block-details is-layout-flow wp-block-details-is-layout-flow"><summary>llama-benchy benchmarking line for the vLLM server, model: unsloth/Qwen3.6-35B-A3B-NVFP4</summary>
<div class="wp-block-kevinbatdorf-code-block-pro" data-code-block-pro-font-family="Code-Pro-JetBrains-Mono" style="font-size:.875rem;font-family:Code-Pro-JetBrains-Mono,ui-monospace,SFMono-Regular,Menlo,Monaco,Consolas,monospace;line-height:1.25rem;--cbp-tab-width:2;tab-size:var(--cbp-tab-width, 2)"><span style="display:flex;align-items:center;padding:10px 0px 10px 16px;margin-bottom:-2px;width:100%;text-align:left;background-color:#2b2b2b;color:#c7c7c7">Bash</span><span role="button" tabindex="0" style="color:#D4D4D4;display:none" aria-label="Copy" class="code-block-pro-copy-button"><pre class="code-block-pro-copy-button-pre" aria-hidden="true"><textarea class="code-block-pro-copy-button-textarea" tabindex="-1" aria-hidden="true" readonly>llama-benchy \
  --base-url http://${HOST}:8000/v1 \
  --model unsloth/Qwen3.6-35B-A3B-NVFP4 \
  --depth 0 8192 \
  --pp 2048 --tg 256 \
  --concurrency 1 2 4 \
  --no-results-on-fail</textarea></pre><svg xmlns="http://www.w3.org/2000/svg" style="width:24px;height:24px" fill="none" viewBox="0 0 24 24" stroke="currentColor" stroke-width="2"><path class="with-check" stroke-linecap="round" stroke-linejoin="round" d="M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2m-6 9l2 2 4-4"></path><path class="without-check" stroke-linecap="round" stroke-linejoin="round" d="M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2"></path></svg></span><pre class="shiki dark-plus" style="background-color: #1E1E1E" tabindex="0"><code><span class="line"><span style="color: #DCDCAA">llama-benchy</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--base-url</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">http://</span><span style="color: #D4D4D4">${</span><span style="color: #9CDCFE">HOST</span><span style="color: #D4D4D4">}</span><span style="color: #CE9178">:8000/v1</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--model</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">unsloth/Qwen3.6-35B-A3B-NVFP4</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--depth</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">0</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">8192</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--pp</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">2048</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">--tg</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">256</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--concurrency</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">1</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">2</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">4</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--no-results-on-fail</span></span></code></pre></div>
</details>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db21a90&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db21a90" class="aligncenter size-large wp-lightbox-container"><img loading="lazy" decoding="async" width="1024" height="685" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/pp2048-1024x685.png" alt="" class="wp-image-2013" srcset="https://efisonlt.com/wp-content/uploads/2026/05/pp2048-1024x685.png 1024w, https://efisonlt.com/wp-content/uploads/2026/05/pp2048-300x201.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/pp2048-768x514.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/pp2048-1536x1027.png 1536w, https://efisonlt.com/wp-content/uploads/2026/05/pp2048.png 1773w" sizes="(max-width: 1024px) 100vw, 1024px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">The llama.cpp fell through in prompt processing performance when faced with more than 1 concurrent request.</figcaption></figure>
</div>

<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db21fe5&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db21fe5" class="aligncenter size-large wp-lightbox-container"><img loading="lazy" decoding="async" width="1024" height="691" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/tg256-1024x691.png" alt="" class="wp-image-2014" srcset="https://efisonlt.com/wp-content/uploads/2026/05/tg256-1024x691.png 1024w, https://efisonlt.com/wp-content/uploads/2026/05/tg256-300x203.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/tg256-768x518.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/tg256-1536x1037.png 1536w, https://efisonlt.com/wp-content/uploads/2026/05/tg256.png 1757w" sizes="(max-width: 1024px) 100vw, 1024px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">Similar story with the token generation performance. Although not as dramatic as the prompt processing one.</figcaption></figure>
</div>


<p class="wp-block-paragraph">Now that I&#8217;ve already put out my reasons, let&#8217;s continue to the vLLM benchmarks.</p>



<p class="wp-block-paragraph">But wait, there&#8217;s more. I also put <a href="https://efisonlt.com/our-experience-with-asus-amd-radeon-ai-pro-r9700-turbo/">Radeon AI Pro R9700</a> as another data for comparison.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db22536&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db22536" class="aligncenter size-large is-resized wp-lightbox-container"><img loading="lazy" decoding="async" width="1024" height="527" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-26-1024x527.png" alt="" class="wp-image-2015" style="aspect-ratio:1.9430508615453308;width:302px;height:auto" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-26-1024x527.png 1024w, https://efisonlt.com/wp-content/uploads/2026/05/image-26-300x154.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/image-26-768x395.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/image-26.png 1366w" sizes="(max-width: 1024px) 100vw, 1024px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button></figure>
</div>


<figure class="wp-block-table"><table><thead><tr><th></th><th class="has-text-align-right" data-align="right">DGX Spark</th><th class="has-text-align-right" data-align="right">Radeon AI Pro R9700</th><th class="has-text-align-right" data-align="right">Strix Halo</th></tr></thead><tbody><tr><td><strong>Memory</strong></td><td class="has-text-align-right" data-align="right">128 GB 256-bit LPDDR5x unified</td><td class="has-text-align-right" data-align="right">32 GB 256-bit GDDR6 dedicated</td><td class="has-text-align-right" data-align="right">128 GB 256-bit LPDDR5x shared</td></tr><tr><td><strong>Memory Bandwidth</strong></td><td class="has-text-align-right" data-align="right">273 GB/s</td><td class="has-text-align-right" data-align="right">644.6 GB/s</td><td class="has-text-align-right" data-align="right">256 GB/s</td></tr><tr><td><strong>Smallest Floating Point Support</strong></td><td class="has-text-align-right" data-align="right">NVFP4, MXFP4</td><td class="has-text-align-right" data-align="right">FP8</td><td class="has-text-align-right" data-align="right">BF16, FP16</td></tr><tr><td><strong>Smallest Integer Support</strong></td><td class="has-text-align-right" data-align="right">INT4</td><td class="has-text-align-right" data-align="right">INT4</td><td class="has-text-align-right" data-align="right">INT4</td></tr></tbody></table></figure>



<h3 class="wp-block-heading">vLLM Container Image Version</h3>



<p class="wp-block-paragraph">For the vLLM deployment, I used these container images running on Podman for each machine, as I found them to be the most performant at the time of testing (18 May 2026).</p>



<figure class="wp-block-table"><table><thead><tr><th>Machine/GPU</th><th>Container Image</th></tr></thead><tbody><tr><td>DGX Spark</td><td><a href="https://hub.docker.com/r/vllm/vllm-openai/tags?name=cu130-nightly">docker.io/vllm/vllm-openai:cu130-nightly</a></td></tr><tr><td>Radeon AI Pro R9700</td><td><a href="https://hub.docker.com/r/rocm/vllm-dev/tags?name=nightly">docker.io/rocm/vllm-dev:nightly</a></td></tr><tr><td>Strix Halo</td><td><a href="https://hub.docker.com/r/kyuz0/vllm-therock-gfx1151/tags?name=latest">docker.io/kyuz0/vllm-therock-gfx1151:latest</a></td></tr></tbody></table></figure>



<p class="wp-block-paragraph">Also here are the vLLM inference engine serving lines for each GPU:</p>



<details class="wp-block-details is-layout-flow wp-block-details-is-layout-flow"><summary>vLLM serve command line for DGX Spark</summary>
<div class="wp-block-kevinbatdorf-code-block-pro" data-code-block-pro-font-family="Code-Pro-JetBrains-Mono" style="font-size:.875rem;font-family:Code-Pro-JetBrains-Mono,ui-monospace,SFMono-Regular,Menlo,Monaco,Consolas,monospace;--cbp-line-number-width:calc(1 * 0.6 * .875rem);line-height:1.25rem;--cbp-tab-width:2;tab-size:var(--cbp-tab-width, 2)"><span style="display:flex;align-items:center;padding:10px 0px 10px 16px;margin-bottom:-2px;width:100%;text-align:left;background-color:#2b2b2b;color:#c7c7c7">Bash</span><span role="button" tabindex="0" style="color:#D4D4D4;display:none" aria-label="Copy" class="code-block-pro-copy-button"><pre class="code-block-pro-copy-button-pre" aria-hidden="true"><textarea class="code-block-pro-copy-button-textarea" tabindex="-1" aria-hidden="true" readonly>podman run \
  -p 8000:8000 \
  --device nvidia.com/gpu=all \
  -v ~/.cache/huggingface:/root/.cache/huggingface:Z \
  docker.io/vllm/vllm-openai:cu130-nightly ${MODEL} \
  --enable-auto-tool-choice --tool-call-parser qwen3_coder --reasoning-parser qwen3 \
  --trust-remote-code \
  --dtype float16 \
  --max-model-len -1 \
  --gpu-memory-utilization 0.8</textarea></pre><svg xmlns="http://www.w3.org/2000/svg" style="width:24px;height:24px" fill="none" viewBox="0 0 24 24" stroke="currentColor" stroke-width="2"><path class="with-check" stroke-linecap="round" stroke-linejoin="round" d="M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2m-6 9l2 2 4-4"></path><path class="without-check" stroke-linecap="round" stroke-linejoin="round" d="M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2"></path></svg></span><pre class="shiki dark-plus" style="background-color: #1E1E1E" tabindex="0"><code><span class="line"><span style="color: #DCDCAA">podman</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">run</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">-p</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">8000</span><span style="color: #CE9178">:8000</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--device</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">nvidia.com/gpu=all</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">-v</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">~/.cache/huggingface:/root/.cache/huggingface:Z</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #CE9178">docker.io/vllm/vllm-openai:cu130-nightly</span><span style="color: #D4D4D4"> ${</span><span style="color: #9CDCFE">MODEL</span><span style="color: #D4D4D4">} </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--enable-auto-tool-choice</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">--tool-call-parser</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">qwen3_coder</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">--reasoning-parser</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">qwen3</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--trust-remote-code</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--dtype</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">float16</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--max-model-len</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">-1</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--gpu-memory-utilization</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">0.8</span></span></code></pre></div>
</details>



<details class="wp-block-details is-layout-flow wp-block-details-is-layout-flow"><summary>vLLM serve command line for Radeon AI Pro R9700 </summary>
<div class="wp-block-kevinbatdorf-code-block-pro" data-code-block-pro-font-family="Code-Pro-JetBrains-Mono" style="font-size:.875rem;font-family:Code-Pro-JetBrains-Mono,ui-monospace,SFMono-Regular,Menlo,Monaco,Consolas,monospace;--cbp-line-number-width:calc(1 * 0.6 * .875rem);line-height:1.25rem;--cbp-tab-width:2;tab-size:var(--cbp-tab-width, 2)"><span style="display:flex;align-items:center;padding:10px 0px 10px 16px;margin-bottom:-2px;width:100%;text-align:left;background-color:#2b2b2b;color:#c7c7c7">Bash</span><span role="button" tabindex="0" style="color:#D4D4D4;display:none" aria-label="Copy" class="code-block-pro-copy-button"><pre class="code-block-pro-copy-button-pre" aria-hidden="true"><textarea class="code-block-pro-copy-button-textarea" tabindex="-1" aria-hidden="true" readonly># Create distrobox
distrobox create vllm \
  --image docker.io/rocm/vllm-dev:nightly \
  -- \
  --device /dev/dri --device /dev/kfd \
  --group-add keep-groups --security-opt seccomp=unconfined

# Enter distrobox
distrobox enter vllm

# Run vllm serve inside the distrobox
vllm serve \
  ${MODEL} \
  --host 0.0.0.0 --port 8000 \
  --enable-auto-tool-choice --tool-call-parser qwen3_coder --reasoning-parser qwen3 \
  --trust-remote-code \
  --dtype float16 \
  --max-model-len -1 \
  --gpu-memory-utilization 0.9 \
  --enforce-eager</textarea></pre><svg xmlns="http://www.w3.org/2000/svg" style="width:24px;height:24px" fill="none" viewBox="0 0 24 24" stroke="currentColor" stroke-width="2"><path class="with-check" stroke-linecap="round" stroke-linejoin="round" d="M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2m-6 9l2 2 4-4"></path><path class="without-check" stroke-linecap="round" stroke-linejoin="round" d="M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2"></path></svg></span><pre class="shiki dark-plus" style="background-color: #1E1E1E" tabindex="0"><code><span class="line"><span style="color: #6A9955"># Create distrobox</span></span>
<span class="line"><span style="color: #DCDCAA">distrobox</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">create</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">vllm</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--image</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">docker.io/rocm/vllm-dev:nightly</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--device</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">/dev/dri</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">--device</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">/dev/kfd</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--group-add</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">keep-groups</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">--security-opt</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">seccomp=unconfined</span></span>
<span class="line"></span>
<span class="line"><span style="color: #6A9955"># Enter distrobox</span></span>
<span class="line"><span style="color: #DCDCAA">distrobox</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">enter</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">vllm</span></span>
<span class="line"></span>
<span class="line"><span style="color: #6A9955"># Run vllm serve inside the distrobox</span></span>
<span class="line"><span style="color: #DCDCAA">vllm</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">serve</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  ${</span><span style="color: #9CDCFE">MODEL</span><span style="color: #D4D4D4">} </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--host</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">0.0</span><span style="color: #CE9178">.0.0</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">--port</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">8000</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--enable-auto-tool-choice</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">--tool-call-parser</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">qwen3_coder</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">--reasoning-parser</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">qwen3</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--trust-remote-code</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--dtype</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">float16</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--max-model-len</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">-1</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--gpu-memory-utilization</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">0.9</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--enforce-eager</span></span></code></pre></div>
</details>



<details class="wp-block-details is-layout-flow wp-block-details-is-layout-flow"><summary>vLLM serve command line for Strix Halo</summary>
<div class="wp-block-kevinbatdorf-code-block-pro" data-code-block-pro-font-family="Code-Pro-JetBrains-Mono" style="font-size:.875rem;font-family:Code-Pro-JetBrains-Mono,ui-monospace,SFMono-Regular,Menlo,Monaco,Consolas,monospace;--cbp-line-number-width:calc(1 * 0.6 * .875rem);line-height:1.25rem;--cbp-tab-width:2;tab-size:var(--cbp-tab-width, 2)"><span style="display:flex;align-items:center;padding:10px 0px 10px 16px;margin-bottom:-2px;width:100%;text-align:left;background-color:#2b2b2b;color:#c7c7c7">Bash</span><span role="button" tabindex="0" style="color:#D4D4D4;display:none" aria-label="Copy" class="code-block-pro-copy-button"><pre class="code-block-pro-copy-button-pre" aria-hidden="true"><textarea class="code-block-pro-copy-button-textarea" tabindex="-1" aria-hidden="true" readonly># Create distrobox
distrobox create vllm-gfx1151 \
  --image docker.io/kyuz0/vllm-therock-gfx1151:latest \
  -- \
  --device /dev/dri --device /dev/kfd \
  --group-add keep-groups --security-opt seccomp=unconfined

# Enter distrobox
distrobox enter vllm-gfx1151

# Run vllm serve inside the distrobox
VLLM_ROCM_USE_AITER=1 vllm serve \
  ${MODEL} \
  --host 0.0.0.0 --port 8000 \
  --enable-auto-tool-choice --tool-call-parser qwen3_coder --reasoning-parser qwen3 \
  --trust-remote-code \
  --dtype float16 \
  --max-model-len -1 \
  --gpu-memory-utilization 0.8</textarea></pre><svg xmlns="http://www.w3.org/2000/svg" style="width:24px;height:24px" fill="none" viewBox="0 0 24 24" stroke="currentColor" stroke-width="2"><path class="with-check" stroke-linecap="round" stroke-linejoin="round" d="M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2m-6 9l2 2 4-4"></path><path class="without-check" stroke-linecap="round" stroke-linejoin="round" d="M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2"></path></svg></span><pre class="shiki dark-plus" style="background-color: #1E1E1E" tabindex="0"><code><span class="line"><span style="color: #6A9955"># Create distrobox</span></span>
<span class="line"><span style="color: #DCDCAA">distrobox</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">create</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">vllm-gfx1151</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--image</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">docker.io/kyuz0/vllm-therock-gfx1151:latest</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--device</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">/dev/dri</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">--device</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">/dev/kfd</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--group-add</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">keep-groups</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">--security-opt</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">seccomp=unconfined</span></span>
<span class="line"></span>
<span class="line"><span style="color: #6A9955"># Enter distrobox</span></span>
<span class="line"><span style="color: #DCDCAA">distrobox</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">enter</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">vllm-gfx1151</span></span>
<span class="line"></span>
<span class="line"><span style="color: #6A9955"># Run vllm serve inside the distrobox</span></span>
<span class="line"><span style="color: #9CDCFE">VLLM_ROCM_USE_AITER</span><span style="color: #D4D4D4">=</span><span style="color: #B5CEA8">1</span><span style="color: #D4D4D4"> </span><span style="color: #DCDCAA">vllm</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">serve</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  ${</span><span style="color: #9CDCFE">MODEL</span><span style="color: #D4D4D4">} </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--host</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">0.0</span><span style="color: #CE9178">.0.0</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">--port</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">8000</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--enable-auto-tool-choice</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">--tool-call-parser</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">qwen3_coder</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">--reasoning-parser</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">qwen3</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--trust-remote-code</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--dtype</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">float16</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--max-model-len</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">-1</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--gpu-memory-utilization</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">0.8</span></span></code></pre></div>
</details>



<details class="wp-block-details is-layout-flow wp-block-details-is-layout-flow"><summary>llama-benchy benchmarking for the vLLM servers</summary>
<div class="wp-block-kevinbatdorf-code-block-pro" data-code-block-pro-font-family="Code-Pro-JetBrains-Mono" style="font-size:.875rem;font-family:Code-Pro-JetBrains-Mono,ui-monospace,SFMono-Regular,Menlo,Monaco,Consolas,monospace;line-height:1.25rem;--cbp-tab-width:2;tab-size:var(--cbp-tab-width, 2)"><span style="display:flex;align-items:center;padding:10px 0px 10px 16px;margin-bottom:-2px;width:100%;text-align:left;background-color:#2b2b2b;color:#c7c7c7">Bash</span><span role="button" tabindex="0" style="color:#D4D4D4;display:none" aria-label="Copy" class="code-block-pro-copy-button"><pre class="code-block-pro-copy-button-pre" aria-hidden="true"><textarea class="code-block-pro-copy-button-textarea" tabindex="-1" aria-hidden="true" readonly>llama-benchy \
  --base-url http://${HOST}:8000/v1 \
  --model ${MODEL} \
  --depth 0 8192 16384 32768 \
  --pp 2048 --tg 256 \
  --concurrency 1 2 4 8 \
  --no-results-on-fail</textarea></pre><svg xmlns="http://www.w3.org/2000/svg" style="width:24px;height:24px" fill="none" viewBox="0 0 24 24" stroke="currentColor" stroke-width="2"><path class="with-check" stroke-linecap="round" stroke-linejoin="round" d="M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2m-6 9l2 2 4-4"></path><path class="without-check" stroke-linecap="round" stroke-linejoin="round" d="M9 5H7a2 2 0 00-2 2v12a2 2 0 002 2h10a2 2 0 002-2V7a2 2 0 00-2-2h-2M9 5a2 2 0 002 2h2a2 2 0 002-2M9 5a2 2 0 012-2h2a2 2 0 012 2"></path></svg></span><pre class="shiki dark-plus" style="background-color: #1E1E1E" tabindex="0"><code><span class="line"><span style="color: #DCDCAA">llama-benchy</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--base-url</span><span style="color: #D4D4D4"> </span><span style="color: #CE9178">http://</span><span style="color: #D4D4D4">${</span><span style="color: #9CDCFE">HOST</span><span style="color: #D4D4D4">}</span><span style="color: #CE9178">:8000/v1</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--model</span><span style="color: #D4D4D4"> ${</span><span style="color: #9CDCFE">MODEL</span><span style="color: #D4D4D4">} </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--depth</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">0</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">8192</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">16384</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">32768</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--pp</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">2048</span><span style="color: #D4D4D4"> </span><span style="color: #569CD6">--tg</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">256</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--concurrency</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">1</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">2</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">4</span><span style="color: #D4D4D4"> </span><span style="color: #B5CEA8">8</span><span style="color: #D4D4D4"> </span><span style="color: #D7BA7D">\</span></span>
<span class="line"><span style="color: #D4D4D4">  </span><span style="color: #569CD6">--no-results-on-fail</span></span></code></pre></div>
</details>



<h3 class="wp-block-heading">vLLM Performance, Qwen/Qwen3.5-35B-A3B-GPTQ-Int4</h3>



<p class="wp-block-paragraph">First, I used an exact same model, <a href="https://huggingface.co/Qwen/Qwen3.5-35B-A3B-GPTQ-Int4">Qwen/Qwen3.5-35B-A3B-GPTQ-Int4</a>, which is Qwen3.5-35B-A3B model that has been quantized to int4 (4-bit integer). This way we can measure the performance without resorting to any hardware data type support advantage.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db23697&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db23697" class="aligncenter size-large wp-lightbox-container"><img loading="lazy" decoding="async" width="1024" height="512" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/pp2048-1-1024x512.png" alt="" class="wp-image-2016" srcset="https://efisonlt.com/wp-content/uploads/2026/05/pp2048-1-1024x512.png 1024w, https://efisonlt.com/wp-content/uploads/2026/05/pp2048-1-300x150.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/pp2048-1-768x384.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/pp2048-1-1536x768.png 1536w, https://efisonlt.com/wp-content/uploads/2026/05/pp2048-1-2048x1024.png 2048w" sizes="(max-width: 1024px) 100vw, 1024px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">vLLM prompt processing performance using Qwen/Qwen3.5-35B-A3B-GPTQ-Int4 from DGX Spark, R9700, and Strix Halo.</figcaption></figure>
</div>


<p class="wp-block-paragraph">As we can see above, the Radeon AI Pro R9700 was able to beat the DGX Spark when the prompt has no meaningful context length. But, the DGX Spark was able to gain advantage, even so slightly, on deeper contexts. Unfortunately, the Strix Halo fumbled big with the performance was nowhere around those two.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db23b84&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db23b84" class="aligncenter size-large wp-lightbox-container"><img loading="lazy" decoding="async" width="1024" height="512" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/tg256-1-1024x512.png" alt="" class="wp-image-2019" srcset="https://efisonlt.com/wp-content/uploads/2026/05/tg256-1-1024x512.png 1024w, https://efisonlt.com/wp-content/uploads/2026/05/tg256-1-300x150.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/tg256-1-768x384.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/tg256-1-1536x768.png 1536w, https://efisonlt.com/wp-content/uploads/2026/05/tg256-1-2048x1024.png 2048w" sizes="(max-width: 1024px) 100vw, 1024px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">vLLM token generation performance using Qwen/Qwen3.5-35B-A3B-GPTQ-Int4 from DGX Spark, R9700, and Strix Halo.</figcaption></figure>
</div>


<p class="wp-block-paragraph">Funnily enough, even with lower memory bandwidth (273 GB/s vs 644.6 GB/s), the DGX Spark was able to blow the Radeon AI Pro R9700 out of the water. Don&#8217;t even mention the Strix Halo, it failed to perform to even half of the DGX Spark performance, and even worse on deeper contexts.</p>



<h3 class="wp-block-heading">vLLM Performance, Qwen/Qwen3.6-35B-A3B, Different Quantization</h3>



<p class="wp-block-paragraph">For this one I used same models with different quantization on the DGX Spark against both set of Radeons.</p>



<p class="wp-block-paragraph">The GB10 in the DGX Spark is based on <a href="https://developer.nvidia.com/blog/introducing-nvfp4-for-efficient-and-accurate-low-precision-inference/">Nvidia Blackwell architecture which supports NVFP4</a>. This support is shared with other Blackwell-based GPU such as RTX 50 Series, RTX PRO Blackwell Series, B100, B200, B300, etc. For that reason, I used <a href="https://huggingface.co/unsloth/Qwen3.6-35B-A3B-NVFP4">unsloth/Qwen3.6-35B-A3B-NVFP4</a> to represent the lowest possible hardware quantization support.</p>



<p class="wp-block-paragraph">As the Radeon AI Pro R9700 and Strix Halo only support 4-bit on the integer unit and not on the floating point unit, I used <a href="https://huggingface.co/cyankiwi/Qwen3.6-35B-A3B-AWQ-4bit">cyankiwi/Qwen3.6-35B-A3B-AWQ-4bit</a>. It uses <a href="https://hanlab.mit.edu/projects/awq">activation-aware weight quantization technique from the MIT HAN Lab</a> and mainly utilises int4 (4-bit integer).</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db24176&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db24176" class="aligncenter size-large wp-lightbox-container"><img loading="lazy" decoding="async" width="1024" height="512" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/pp2048-2-1024x512.png" alt="" class="wp-image-2041" srcset="https://efisonlt.com/wp-content/uploads/2026/05/pp2048-2-1024x512.png 1024w, https://efisonlt.com/wp-content/uploads/2026/05/pp2048-2-300x150.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/pp2048-2-768x384.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/pp2048-2-1536x768.png 1536w, https://efisonlt.com/wp-content/uploads/2026/05/pp2048-2-2048x1024.png 2048w" sizes="(max-width: 1024px) 100vw, 1024px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">vLLM prompt processing performance using Qwen/Qwen3.6-35B-A3B from DGX Spark, R9700, and Strix Halo.</figcaption></figure>
</div>


<p class="wp-block-paragraph">This one brought a more interesting comparison between the DGX Spark and the Radeon AI Pro R9700. Somehow the DGX Spark was able to gain foothold on no context depth with 4 concurrent requests. Also the deeper contexts on the DGX Spark shined this time against the R9700. And yes, the Strix Halo performance was far behind those two.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db24658&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db24658" class="aligncenter size-large wp-lightbox-container"><img loading="lazy" decoding="async" width="1024" height="512" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/tg256-2-1024x512.png" alt="" class="wp-image-2042" srcset="https://efisonlt.com/wp-content/uploads/2026/05/tg256-2-1024x512.png 1024w, https://efisonlt.com/wp-content/uploads/2026/05/tg256-2-300x150.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/tg256-2-768x384.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/tg256-2-1536x768.png 1536w, https://efisonlt.com/wp-content/uploads/2026/05/tg256-2-2048x1024.png 2048w" sizes="(max-width: 1024px) 100vw, 1024px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">vLLM token generation performance using Qwen/Qwen3.6-35B-A3B from DGX Spark, R9700, and Strix Halo.</figcaption></figure>
</div>


<p class="wp-block-paragraph">The data portrays similar story. But this time, the DGX Spark was slightly faster and the R9700 was slightly slower, giving more performance delta between those two. Poor Strix Halo still struggled hard.</p>



<h3 class="wp-block-heading">LLM Performance Conclusion</h3>



<p class="wp-block-paragraph">At the time of writing, DGX Spark can be found for around IDR170Mio (assuming the currency rate of IDR18k/USD), give or take, with bigger storage size would warrant higher price.</p>



<p class="wp-block-paragraph">Now compare that to R9700 which can be found for around IDR30Mio. When you factor the total system cost of around IDR80Mio for a high-end system, yet yields LLM performance not far short of the DGX Spark, you would have to answer at least these 4 questions:</p>



<ol class="wp-block-list">
<li>Do you need a smaller footprint?</li>



<li>Do you need lower heat and power envelope?</li>



<li>Do you need a better software/model/documentation/support?</li>



<li>Do you intend to setup a cluster?</li>
</ol>



<p class="wp-block-paragraph">If most of your answers are yes, I think the IDR170Mio of DGX Spark would worth the money compared to R9700 or Strix Halo.</p>



<p class="wp-block-paragraph">I&#8217;m sorry Strix Halo. But with your current expected price for 128 GB variant of around IDR75Mio, I would rather choose between R9700 or DGX Spark.</p>



<h2 class="wp-block-heading">Image Generation Performance</h2>



<p class="wp-block-paragraph">Things get more interesting here on image generation test. As I already wrote above regarding the NVFP4 support, you can expect that this mini machine would be able to run Nunchaku SVDQuant FP4 model like the RTX 5070 Ti we tested on <a href="https://efisonlt.com/our-experience-with-asus-amd-radeon-ai-pro-r9700-turbo/#test-results-qwen-image-edit-2509">this article</a>. But there&#8217;s a big <s>trouble</s> challenge. Big big challenge.</p>



<p class="wp-block-paragraph">The Nunchaku wheel binaries provided on <a href="https://github.com/nunchaku-ai/nunchaku/releases/">Nunchaku Github release</a> are only available for x86_64. Meanwhile this DGX Spark uses Arm processor inside.</p>



<p class="wp-block-paragraph">Oof.</p>



<p class="wp-block-paragraph">Oh wait. Turns out there is <a href="https://note.com/tori29umai/n/n7dea04e9281b">someone from Japan who was able to build and compile from source</a>, so that Nunchaku wheel would run for aarch64 (Arm64)!</p>



<p class="wp-block-paragraph">Yep. After following the direction in which I required to use Google Translate to decipher that article (shout out to <a href="https://note.com/tori29umai">tori29umai</a>) and waited for several minutes of compilation process (trust me it was goddamn long lol, I&#8217;m used to faster CPUs), voila!</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db24f28&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db24f28" class="aligncenter size-large wp-lightbox-container"><img loading="lazy" decoding="async" width="911" height="1024" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/nunchaku-compilation-911x1024.png" alt="" class="wp-image-2044" srcset="https://efisonlt.com/wp-content/uploads/2026/05/nunchaku-compilation-911x1024.png 911w, https://efisonlt.com/wp-content/uploads/2026/05/nunchaku-compilation-267x300.png 267w, https://efisonlt.com/wp-content/uploads/2026/05/nunchaku-compilation-768x864.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/nunchaku-compilation.png 1245w" sizes="(max-width: 911px) 100vw, 911px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">Those who do not know pain (of self-compiling) will never understand true peace (of simply installing).</figcaption></figure>
</div>


<p class="wp-block-paragraph">I used the same model (Qwen Image Edit 2509), same workflow, and same input images for the comparison as the one used in <a href="https://efisonlt.com/our-experience-with-asus-amd-radeon-ai-pro-r9700-turbo/#qwen-image-edit-2509-benchmark-setup-and-rambling-about-pytorch-for-rocm-on-windows-situation">this article</a>.</p>



<figure class="wp-block-table"><table><thead><tr><th>Type</th><th>Model</th></tr></thead><tbody><tr><td>GGUF</td><td>Base: <a href="https://huggingface.co/QuantStack/Qwen-Image-Edit-2509-GGUF/blob/main/Qwen-Image-Edit-2509-Q4_K_M.gguf">QuantStack/Qwen-Image-Edit-2509-Q4_K_M</a><br>LoRA: <a href="https://huggingface.co/lightx2v/Qwen-Image-Lightning/blob/main/Qwen-Image-Edit-2509/Qwen-Image-Edit-2509-Lightning-4steps-V1.0-bf16.safetensors">lightx2v/Qwen-Image-Edit-2509-Lightning-4steps-V1.0-bf16</a></td></tr><tr><td>Nunchaku SVDQuant NVFP4</td><td><a href="https://huggingface.co/nunchaku-ai/nunchaku-qwen-image-edit-2509/blob/main/svdq-fp4_r32-qwen-image-edit-2509-lightningv2.0-4steps.safetensors">nunchaku-ai/nunchaku-qwen-image-edit-2509/svdq-fp4_r32-qwen-image-edit-2509-lightningv2.0-4steps</a></td></tr></tbody></table></figure>



<p class="wp-block-paragraph">First, I would like to show you how the DGX Spark stands against the R9700 and the Strix Halo, using GGUF model which can be run across those three.</p>



<details class="wp-block-details is-layout-flow wp-block-details-is-layout-flow"><summary>DGX Spark &#8211; Qwen Image Edit 2509 GGUF run screenshot</summary>
<p class="wp-block-paragraph"></p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db2558d&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db2558d" class="aligncenter size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="415" height="503" fetchpriority="low" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/Qwen_Image_Edit_2509_GGUF_result-crop-4-results.png" alt="" class="wp-image-2075" srcset="https://efisonlt.com/wp-content/uploads/2026/05/Qwen_Image_Edit_2509_GGUF_result-crop-4-results.png 415w, https://efisonlt.com/wp-content/uploads/2026/05/Qwen_Image_Edit_2509_GGUF_result-crop-4-results-248x300.png 248w" sizes="(max-width: 415px) 100vw, 415px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button></figure>
</div></details>



<details class="wp-block-details is-layout-flow wp-block-details-is-layout-flow"><summary>Radeon AI Pro R9700 &#8211; Qwen Image Edit 2509 GGUF run screenshot</summary><div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db25b49&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db25b49" class="aligncenter size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="425" height="509" fetchpriority="low" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/R9700.png" alt="" class="wp-image-2076" srcset="https://efisonlt.com/wp-content/uploads/2026/05/R9700.png 425w, https://efisonlt.com/wp-content/uploads/2026/05/R9700-250x300.png 250w" sizes="(max-width: 425px) 100vw, 425px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button></figure>
</div></details>



<details class="wp-block-details is-layout-flow wp-block-details-is-layout-flow"><summary>Strix Halo &#8211; Qwen Image Edit 2509 GGUF run screenshot</summary>
<p class="wp-block-paragraph"></p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db260d9&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db260d9" class="aligncenter size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="433" height="516" fetchpriority="low" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/Strix-Halo.png" alt="" class="wp-image-2078" srcset="https://efisonlt.com/wp-content/uploads/2026/05/Strix-Halo.png 433w, https://efisonlt.com/wp-content/uploads/2026/05/Strix-Halo-252x300.png 252w" sizes="(max-width: 433px) 100vw, 433px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button></figure>
</div></details>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db265c5&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db265c5" class="aligncenter size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="1000" height="500" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/gguf_only.png" alt="" class="wp-image-2074" srcset="https://efisonlt.com/wp-content/uploads/2026/05/gguf_only.png 1000w, https://efisonlt.com/wp-content/uploads/2026/05/gguf_only-300x150.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/gguf_only-768x384.png 768w" sizes="(max-width: 1000px) 100vw, 1000px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">Qwen Image Edit 2509 GGUF performance comparison.</figcaption></figure>
</div>


<p class="wp-block-paragraph">One interesting thing is, the Radeon AI Pro R9700 is now much faster than the last time <a href="https://efisonlt.com/our-experience-with-asus-amd-radeon-ai-pro-r9700-turbo/#test-results-qwen-image-edit-2509">I tested</a>.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db26a9b&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db26a9b" class="aligncenter size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="471" height="156" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-48.png" alt="" class="wp-image-2079" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-48.png 471w, https://efisonlt.com/wp-content/uploads/2026/05/image-48-300x99.png 300w" sizes="(max-width: 471px) 100vw, 471px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">52.17s then to 36.87s now translates to around 30% time shaving! Kudos to AMD!</figcaption></figure>
</div>


<p class="wp-block-paragraph">The Strix Halo is around 4x slower than the other two. Also faster than the last time I tried which requires around 188s to complete the task.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db26f8a&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db26f8a" class="aligncenter size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="249" height="821" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-49.png" alt="" class="wp-image-2080" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-49.png 249w, https://efisonlt.com/wp-content/uploads/2026/05/image-49-91x300.png 91w" sizes="(max-width: 249px) 100vw, 249px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">This was the last time I ran Qwen Image Edit 2509 GGUF on the Strix Halo.</figcaption></figure>
</div>


<p class="wp-block-paragraph">Now, let&#8217;s get to the staple of Nunchaku SVDQuant FP4 variant.</p>



<details class="wp-block-details is-layout-flow wp-block-details-is-layout-flow"><summary>DGX Spark &#8211; Qwen Image Edit 2509 SVDQuant FP4 run screenshot</summary>
<p class="wp-block-paragraph"></p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db27462&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db27462" class="aligncenter size-large wp-lightbox-container"><img loading="lazy" decoding="async" width="258" height="1024" fetchpriority="low" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/Qwen_Image_Edit_2509_SVD_NVFP4_result-258x1024.png" alt="" class="wp-image-2081" srcset="https://efisonlt.com/wp-content/uploads/2026/05/Qwen_Image_Edit_2509_SVD_NVFP4_result-258x1024.png 258w, https://efisonlt.com/wp-content/uploads/2026/05/Qwen_Image_Edit_2509_SVD_NVFP4_result.png 308w" sizes="(max-width: 258px) 100vw, 258px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button></figure>
</div></details>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db2795b&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db2795b" class="aligncenter size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="1000" height="500" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/full_comparison.png" alt="" class="wp-image-2082" srcset="https://efisonlt.com/wp-content/uploads/2026/05/full_comparison.png 1000w, https://efisonlt.com/wp-content/uploads/2026/05/full_comparison-300x150.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/full_comparison-768x384.png 768w" sizes="(max-width: 1000px) 100vw, 1000px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">Qwen Image Edit 2509 complete performance comparison.</figcaption></figure>
</div>


<p class="wp-block-paragraph">The DGX Spark finished the task and shaved around 38% of required time to complete using SVDQuant FP4 model variant. Turns out having a 4-bit floating point unit still helps even with a smaller chip and lower power.</p>



<h2 class="wp-block-heading">Desktop PC Experience</h2>



<p class="wp-block-paragraph">If you want to switch the experience from being a mini supercomputer to a mere mini PC, it handles the job well. Well enough that you can do browsing, but nothing special from the performance standpoint.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db27eb9&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db27eb9" class="aligncenter size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="936" height="688" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-34.png" alt="" class="wp-image-2046" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-34.png 936w, https://efisonlt.com/wp-content/uploads/2026/05/image-34-300x221.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/image-34-768x565.png 768w" sizes="(max-width: 936px) 100vw, 936px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">Reasonable.</figcaption></figure>
</div>

<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db28311&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db28311" class="aligncenter size-large wp-lightbox-container"><img loading="lazy" decoding="async" width="1024" height="182" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-35-1024x182.png" alt="" class="wp-image-2047" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-35-1024x182.png 1024w, https://efisonlt.com/wp-content/uploads/2026/05/image-35-300x53.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/image-35-768x136.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/image-35.png 1216w" sizes="(max-width: 1024px) 100vw, 1024px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">Comparison to other systems, courtesy of <a href="https://www.notebookcheck.net/Benchmarks-and-Test-Results.142793.0.html#gpulist_1952329">NotebookCheck.net</a>. Apparently faster than Strix Halo represented by the Asus ProArt PX13 there.</figcaption></figure>
</div>


<p class="wp-block-paragraph">For multimedia purposes, it supports a wide range of codec on its encoder and decoder.</p>



<figure data-wp-context="{&quot;galleryId&quot;:&quot;6a19d1db2852c&quot;}" data-wp-interactive="core/gallery" class="wp-block-gallery aligncenter has-nested-images columns-default wp-block-gallery-1 is-layout-flex wp-block-gallery-is-layout-flex">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db28852&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db28852" class="wp-block-image size-large wp-lightbox-container"><img loading="lazy" decoding="async" width="1024" height="254" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" data-id="2049" src="https://efisonlt.com/wp-content/uploads/2026/05/image-37-1024x254.png" alt="" class="wp-image-2049" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-37-1024x254.png 1024w, https://efisonlt.com/wp-content/uploads/2026/05/image-37-300x74.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/image-37-768x190.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/image-37.png 1062w" sizes="(max-width: 1024px) 100vw, 1024px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">Encoder codec support: YES YES YES YES YES YES&#8230;</figcaption></figure>



<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db28c46&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db28c46" class="wp-block-image size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="423" height="263" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" data-id="2050" src="https://efisonlt.com/wp-content/uploads/2026/05/image-38.png" alt="" class="wp-image-2050" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-38.png 423w, https://efisonlt.com/wp-content/uploads/2026/05/image-38-300x187.png 300w" sizes="(max-width: 423px) 100vw, 423px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">&#8230;YES YES YES YES YES</figcaption></figure>
</figure>



<figure data-wp-context="{&quot;galleryId&quot;:&quot;6a19d1db290ec&quot;}" data-wp-interactive="core/gallery" class="wp-block-gallery has-nested-images columns-default wp-block-gallery-2 is-layout-flex wp-block-gallery-is-layout-flex">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db293b9&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db293b9" class="wp-block-image size-large wp-lightbox-container"><img loading="lazy" decoding="async" width="1024" height="294" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" data-id="2051" src="https://efisonlt.com/wp-content/uploads/2026/05/image-39-1024x294.png" alt="" class="wp-image-2051" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-39-1024x294.png 1024w, https://efisonlt.com/wp-content/uploads/2026/05/image-39-300x86.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/image-39-768x221.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/image-39.png 1062w" sizes="(max-width: 1024px) 100vw, 1024px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">Decoder codec support: ALSO GIVE ME AS MANY YES&#8230;</figcaption></figure>



<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db297b9&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db297b9" class="wp-block-image size-large wp-lightbox-container"><img loading="lazy" decoding="async" width="898" height="305" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" data-id="2052" src="https://efisonlt.com/wp-content/uploads/2026/05/image-40.png" alt="" class="wp-image-2052" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-40.png 898w, https://efisonlt.com/wp-content/uploads/2026/05/image-40-300x102.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/image-40-768x261.png 768w" sizes="(max-width: 898px) 100vw, 898px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">&#8230;AS POSSIBLE</figcaption></figure>
</figure>



<p class="wp-block-paragraph">And my test showed that it is possible to use the NVENC (Nvidia encoder) to do some video rendering with the state of the art AV1 codec.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db29eeb&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db29eeb" class="aligncenter size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="468" height="614" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-41.png" alt="" class="wp-image-2053" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-41.png 468w, https://efisonlt.com/wp-content/uploads/2026/05/image-41-229x300.png 229w" sizes="(max-width: 468px) 100vw, 468px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">NVENC AV1 is available!</figcaption></figure>
</div>

<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db2a390&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db2a390" class="aligncenter size-large wp-lightbox-container"><img loading="lazy" decoding="async" width="1024" height="296" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-43-1024x296.png" alt="" class="wp-image-2055" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-43-1024x296.png 1024w, https://efisonlt.com/wp-content/uploads/2026/05/image-43-300x87.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/image-43-768x222.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/image-43-1536x444.png 1536w, https://efisonlt.com/wp-content/uploads/2026/05/image-43.png 1604w" sizes="(max-width: 1024px) 100vw, 1024px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">AV1 video encoding demonstration using Kdenlive.</figcaption></figure>
</div>

<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db2a7f3&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db2a7f3" class="aligncenter size-large wp-lightbox-container"><img loading="lazy" decoding="async" width="1024" height="380" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-51-1024x380.png" alt="" class="wp-image-2085" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-51-1024x380.png 1024w, https://efisonlt.com/wp-content/uploads/2026/05/image-51-300x111.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/image-51-768x285.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/image-51-1536x569.png 1536w, https://efisonlt.com/wp-content/uploads/2026/05/image-51-2048x759.png 2048w" sizes="(max-width: 1024px) 100vw, 1024px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">AV1 video encoding demonstration using Handbrake.</figcaption></figure>
</div>


<h2 class="wp-block-heading">Gaming</h2>



<p class="wp-block-paragraph">Funnily enough, you can also game on DGX Spark. Despite of having a full-fledged Nvidia GPU with shader core, raytrace core, and tensor core, you won&#8217;t be expected to game on this as well as a normal gaming PC. The reason is most of the game available there are developed for:</p>



<ol class="wp-block-list">
<li>Windows (or DirectX API, to be precise)</li>



<li>x86 CPU</li>
</ol>



<p class="wp-block-paragraph">Well, this system is running on:</p>



<ol class="wp-block-list">
<li>Linux (Vulkan API)</li>



<li>Arm CPU</li>
</ol>



<p class="wp-block-paragraph">Now think about it. In order for your ordinary game to work, it needs to translate/emulate both the graphics API and the CPU ISA, before it hits the underlying low level instructions on the GPU and the CPU. The translation layers require certain computational cost in order for it to run.</p>



<p class="wp-block-paragraph">For example, certain games require you to have an x86 CPU with AVX2 instruction extension. But there&#8217;s no such thing on Arm, and no such thing on GB10 CPU as well.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db2aeba&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db2aeba" class="aligncenter size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="763" height="135" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-44.png" alt="" class="wp-image-2056" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-44.png 763w, https://efisonlt.com/wp-content/uploads/2026/05/image-44-300x53.png 300w" sizes="(max-width: 763px) 100vw, 763px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">GB10 CPU flags. Do you see any AVX/AVX2/AVX512 anywhere?</figcaption></figure>
</div>


<p class="wp-block-paragraph">Thanks to the people from the <a href="https://box86.org/">Box86/Box64 project</a>, we actually have an x86 to Arm emulator. Kudos to them, they&#8217;re also managed to emulate AVX(2)! Therefore, this <a href="https://discourse.ubuntu.com/t/call-for-testing-steam-snap-for-arm64/74719">Steam for Arm64 on Ubuntu was born</a>.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db2b50e&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db2b50e" class="aligncenter size-full is-resized wp-lightbox-container"><img loading="lazy" decoding="async" width="838" height="974" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-45.png" alt="" class="wp-image-2057" style="width:838px;height:auto" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-45.png 838w, https://efisonlt.com/wp-content/uploads/2026/05/image-45-258x300.png 258w, https://efisonlt.com/wp-content/uploads/2026/05/image-45-768x893.png 768w" sizes="(max-width: 838px) 100vw, 838px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">Steam snap for Arm64</figcaption></figure>
</div>


<p class="wp-block-paragraph">And yes, with the magic of Box64 and <a href="https://github.com/valvesoftware/proton">Proton</a> and <a href="https://gitlab.winehq.org/wine/vkd3d">VKD3D</a> and many many more open source projects, I managed to play Clair Obscur: Expedition 33!</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db2baa9&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db2baa9" class="aligncenter size-large wp-lightbox-container"><img loading="lazy" decoding="async" width="1024" height="576" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-47-1024x576.png" alt="" class="wp-image-2060" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-47-1024x576.png 1024w, https://efisonlt.com/wp-content/uploads/2026/05/image-47-300x169.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/image-47-768x432.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/image-47-1536x864.png 1536w, https://efisonlt.com/wp-content/uploads/2026/05/image-47-2048x1152.png 2048w" sizes="(max-width: 1024px) 100vw, 1024px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">Expedition 33 in all of its glory.</figcaption></figure>
</div>


<p class="wp-block-paragraph">Unfortunately I wasn&#8217;t able to make MangoHud or Steam overlay to work, so there&#8217;s no framerate reading available. It was playable, not as smooth as my own PC, but respectable nonetheless for a machine which is not intended for gaming.</p>



<h2 class="wp-block-heading">Power Consumption</h2>



<p class="wp-block-paragraph">DGX Spark only exposes the power draw data from the GPU side. I tried probing the available sensors exposed to the OS to find a way to read the CPU or the SoC power draw using various tools like <a href="https://github.com/lm-sensors/lm-sensors">lm-sensors</a>, <a href="https://github.com/level1techs/siomon">siomon</a>, and <a href="https://github.com/prometheus/node_exporter">Prometheus node_exporter</a>, to no avail. In the end, I only had limited time doing the power draw test. Regardless I can show you some interesting data, using both the GPU power draw sensor and wall outlet watt meter.</p>


<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db2c0aa&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db2c0aa" class="aligncenter size-full is-resized wp-lightbox-container"><img loading="lazy" decoding="async" width="824" height="733" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-52.png" alt="" class="wp-image-2086" style="width:824px;height:auto" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-52.png 824w, https://efisonlt.com/wp-content/uploads/2026/05/image-52-300x267.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/image-52-768x683.png 768w" sizes="(max-width: 824px) 100vw, 824px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">Video encoding GPU power draw of around 15-17W.</figcaption></figure>
</div>

<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db2c5a0&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db2c5a0" class="aligncenter size-large wp-lightbox-container"><img loading="lazy" decoding="async" width="1024" height="334" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/power-imagegen-1024x334.png" alt="" class="wp-image-2087" srcset="https://efisonlt.com/wp-content/uploads/2026/05/power-imagegen-1024x334.png 1024w, https://efisonlt.com/wp-content/uploads/2026/05/power-imagegen-300x98.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/power-imagegen-768x250.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/power-imagegen.png 1080w" sizes="(max-width: 1024px) 100vw, 1024px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">Image generation GPU power draw of around 85W, with idle GPU power draw of around 10-11W.</figcaption></figure>
</div>

<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db2ca80&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db2ca80" class="aligncenter size-large wp-lightbox-container"><img loading="lazy" decoding="async" width="1024" height="698" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-55-1024x698.png" alt="" class="wp-image-2090" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-55-1024x698.png 1024w, https://efisonlt.com/wp-content/uploads/2026/05/image-55-300x204.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/image-55-768x523.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/image-55.png 1237w" sizes="(max-width: 1024px) 100vw, 1024px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">Image generation wall outlet power draw of around 160W.</figcaption></figure>
</div>

<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db2d00c&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db2d00c" class="aligncenter size-full wp-lightbox-container"><img loading="lazy" decoding="async" width="824" height="584" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-53.png" alt="" class="wp-image-2088" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-53.png 824w, https://efisonlt.com/wp-content/uploads/2026/05/image-53-300x213.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/image-53-768x544.png 768w" sizes="(max-width: 824px) 100vw, 824px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">Expedition 33 gaming GPU power draw of around 45W.</figcaption></figure>
</div>

<div class="wp-block-image">
<figure data-wp-context="{&quot;imageId&quot;:&quot;6a19d1db2d56a&quot;}" data-wp-interactive="core/image" data-wp-key="6a19d1db2d56a" class="aligncenter size-large wp-lightbox-container"><img loading="lazy" decoding="async" width="1024" height="896" data-wp-class--hide="state.isContentHidden" data-wp-class--show="state.isContentVisible" data-wp-init="callbacks.setButtonStyles" data-wp-on--click="actions.showLightbox" data-wp-on--load="callbacks.setButtonStyles" data-wp-on--pointerdown="actions.preloadImage" data-wp-on--pointerenter="actions.preloadImageWithDelay" data-wp-on--pointerleave="actions.cancelPreload" data-wp-on-window--resize="callbacks.setButtonStyles" src="https://efisonlt.com/wp-content/uploads/2026/05/image-57-1024x896.png" alt="" class="wp-image-2092" srcset="https://efisonlt.com/wp-content/uploads/2026/05/image-57-1024x896.png 1024w, https://efisonlt.com/wp-content/uploads/2026/05/image-57-300x262.png 300w, https://efisonlt.com/wp-content/uploads/2026/05/image-57-768x672.png 768w, https://efisonlt.com/wp-content/uploads/2026/05/image-57.png 1189w" sizes="(max-width: 1024px) 100vw, 1024px" /><button
			class="lightbox-trigger"
			type="button"
			aria-haspopup="dialog"
			data-wp-bind--aria-label="state.thisImage.triggerButtonAriaLabel"
			data-wp-init="callbacks.initTriggerButton"
			data-wp-on--click="actions.showLightbox"
			data-wp-style--right="state.thisImage.buttonRight"
			data-wp-style--top="state.thisImage.buttonTop"
		>
			<svg xmlns="http://www.w3.org/2000/svg" width="12" height="12" fill="none" viewBox="0 0 12 12">
				<path fill="#fff" d="M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z" />
			</svg>
		</button><figcaption class="wp-element-caption">Expedition 33 gaming wall outlet power draw of around 126-130W.</figcaption></figure>
</div>


<h2 class="wp-block-heading">Noise</h2>



<p class="wp-block-paragraph">Every implementation of the DGX Spark platform from various Nvidia partners brings their own cooling solution. As for this test, it is specific to MSI EdgeXpert and not representative to all DGX Spark solutions available.</p>



<p class="wp-block-paragraph">I would describe the noise as a typical gaming laptop whisper which can get louder under load. Even at the loudest it&#8217;s still pretty reasonable and you can have this system on your desk no problem.</p>



<figure class="wp-block-video aligncenter"><video height="1440" style="aspect-ratio: 2560 / 1440;" width="2560" controls src="https://efisonlt.com/wp-content/uploads/2026/05/idle.mp4"></video><figcaption class="wp-element-caption">Idle noise.</figcaption></figure>



<figure class="wp-block-video aligncenter"><video height="1440" style="aspect-ratio: 2560 / 1440;" width="2560" controls src="https://efisonlt.com/wp-content/uploads/2026/05/load.mp4"></video><figcaption class="wp-element-caption">Under load noise.</figcaption></figure>



<h2 class="wp-block-heading">Verdict</h2>



<p class="wp-block-paragraph">What can we grab from all those tests?</p>



<p class="wp-block-paragraph">The fact that this has no AI moniker in its name yet excels in pretty much all of my AI tests, impressed me by very much.</p>



<p class="wp-block-paragraph">If you&#8217;re looking for an edge device, with excellent support, software, and matured CUDA platform, maybe this device is for you. The problem would be the price. My estimation of IDR170Mio as per 28th May 2026 is based on the uncertainty of the currency exchange rate, the memory/storage chip shortage, and the freight cargo price hike caused by the Middle East conflict. It can probably get more expensive in the future, with no option of waiting until the price drops.</p>



<p class="wp-block-paragraph">If you&#8217;re looking for alternatives, I don&#8217;t feel there&#8217;s any non-enterprise system or GPU that can give 128 GB of total available memory with this level of performance, especially around vLLM concurrent requests and deep context which are required for agentic AI of today. Not to mention the possibility of stacking multiple DGX Spark and setting up tensor parallelism to run a bigger model or to get more performance.</p>



<p class="wp-block-paragraph">Also pls don&#8217;t buy it for a gaming PC. Save your money and buy any typical desktop or laptop instead.</p>



<p class="wp-block-paragraph"></p>
<p>The post <a href="https://efisonlt.com/our-experience-with-msi-edgexpert-nvidia-dgx-spark-gb10/">Our Experience with MSI EdgeXpert &#8211; Nvidia DGX Spark GB10</a> appeared first on <a href="https://efisonlt.com">Efison Lisan Teknologi</a>.</p>
]]></content:encoded>
					
		
		<enclosure url="https://efisonlt.com/wp-content/uploads/2026/05/DGX-spark-resources-overview.mp4" length="2354147" type="video/mp4" />
<enclosure url="https://efisonlt.com/wp-content/uploads/2026/05/idle.mp4" length="13696813" type="video/mp4" />
<enclosure url="https://efisonlt.com/wp-content/uploads/2026/05/load.mp4" length="25238578" type="video/mp4" />

			</item>
	</channel>
</rss>
