<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[3090还是3090 *2+NVLink]]></title><description><![CDATA[<p dir="auto">如果我想安装 qwen 3.6 27b 模型<br />
主要是当hermes agent和 openclaw的本地模型<br />
会安装 Uncensored版本的</p>
<p dir="auto">1.建议使用 3090吗?<br />
2.一张跟两张 3090 +NVLink 差异大吗? 会建议两张吗?</p>
<p dir="auto">目前海外的价格  感觉这样买<br />
比买 4090 24G 或是 5090 32G划算<br />
不在国内 没办法买到 4090 48G ........</p>
<p dir="auto">目前主要使用<br />
1.Claude Opus 4.7 + thinking xhigh<br />
2.DeepSeek V4 Pro + thinking max<br />
3.MiniMax-M2.7 + thinking high<br />
希望能力能超过 "MiniMax-M2.7+thinking high"<br />
能跟 "DeepSeek V4 Pro+thinking max" 差不多就更好了</p>
]]></description><link>https://lcz.me/topic/10/3090还是3090-2-nvlink</link><generator>RSS for Node</generator><lastBuildDate>Wed, 20 May 2026 07:04:46 GMT</lastBuildDate><atom:link href="https://lcz.me/topic/10.rss" rel="self" type="application/rss+xml"/><pubDate>Sun, 03 May 2026 15:54:44 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to 3090还是3090 *2+NVLink on Tue, 05 May 2026 18:46:24 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/peace-love" aria-label="Profile: Peace-Love">@<bdi>Peace-Love</bdi></a> 好吧，以后还真能改，这是隐藏福利。</p>
]]></description><link>https://lcz.me/post/265</link><guid isPermaLink="true">https://lcz.me/post/265</guid><dc:creator><![CDATA[terry]]></dc:creator><pubDate>Tue, 05 May 2026 18:46:24 GMT</pubDate></item><item><title><![CDATA[Reply to 3090还是3090 *2+NVLink on Tue, 05 May 2026 11:44:34 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/terry" aria-label="Profile: terry">@<bdi>terry</bdi></a><br />
能買三張 5090.</p>
]]></description><link>https://lcz.me/post/216</link><guid isPermaLink="true">https://lcz.me/post/216</guid><dc:creator><![CDATA[Peace Love]]></dc:creator><pubDate>Tue, 05 May 2026 11:44:34 GMT</pubDate></item><item><title><![CDATA[Reply to 3090还是3090 *2+NVLink on Tue, 05 May 2026 08:04:54 GMT]]></title><description><![CDATA[<p dir="auto">目前单3090 跑qwen 3.6 q4km 用了Truboquant 可以跑128k上下文 没什么问题</p>
]]></description><link>https://lcz.me/post/201</link><guid isPermaLink="true">https://lcz.me/post/201</guid><dc:creator><![CDATA[muskelon]]></dc:creator><pubDate>Tue, 05 May 2026 08:04:54 GMT</pubDate></item><item><title><![CDATA[Reply to 3090还是3090 *2+NVLink on Tue, 05 May 2026 05:48:31 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/peace-love" aria-label="Profile: Peace-Love">@<bdi>Peace-Love</bdi></a> 那何必呢，为什么不直接用Pro6000，性价比不是更高？</p>
]]></description><link>https://lcz.me/post/190</link><guid isPermaLink="true">https://lcz.me/post/190</guid><dc:creator><![CDATA[terry]]></dc:creator><pubDate>Tue, 05 May 2026 05:48:31 GMT</pubDate></item><item><title><![CDATA[Reply to 3090还是3090 *2+NVLink on Tue, 05 May 2026 05:44:13 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/terry" aria-label="Profile: terry">@<bdi>terry</bdi></a><br />
5090 的溢價 , 來自於將來能改 64G , 甚至 96G .<br />
<img src="https://lcz.me/assets/plugins/nodebb-plugin-emoji/emoji/android/1f644.png?v=d348ca29232" class="not-responsive emoji emoji-android emoji--face_with_rolling_eyes" style="height:23px;width:auto;vertical-align:middle" title=":face_with_rolling_eyes:" alt="🙄" /></p>
]]></description><link>https://lcz.me/post/189</link><guid isPermaLink="true">https://lcz.me/post/189</guid><dc:creator><![CDATA[Peace Love]]></dc:creator><pubDate>Tue, 05 May 2026 05:44:13 GMT</pubDate></item><item><title><![CDATA[Reply to 3090还是3090 *2+NVLink on Tue, 05 May 2026 04:53:25 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/%E6%9A%A7%E6%98%A7%E5%85%89%E5%BD%B1" aria-label="Profile: 暧昧光影">@<bdi>暧昧光影</bdi></a> 挺好的，做好散热都没啥问题。</p>
]]></description><link>https://lcz.me/post/178</link><guid isPermaLink="true">https://lcz.me/post/178</guid><dc:creator><![CDATA[terry]]></dc:creator><pubDate>Tue, 05 May 2026 04:53:25 GMT</pubDate></item><item><title><![CDATA[Reply to 3090还是3090 *2+NVLink on Tue, 05 May 2026 04:47:44 GMT]]></title><description><![CDATA[<p dir="auto">看到up推荐3090，担心背面显存温度过高，加了点入了3090ti，up觉得怎么样@terry</p>
]]></description><link>https://lcz.me/post/176</link><guid isPermaLink="true">https://lcz.me/post/176</guid><dc:creator><![CDATA[暧昧光影]]></dc:creator><pubDate>Tue, 05 May 2026 04:47:44 GMT</pubDate></item><item><title><![CDATA[Reply to 3090还是3090 *2+NVLink on Mon, 04 May 2026 08:46:34 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/%E9%8D%BE%E5%AD%90%E6%8F%9A" aria-label="Profile: 鍾子揚">@<bdi>鍾子揚</bdi></a> 不建议折腾35b，它不如27b强，甚至差距明显</p>
]]></description><link>https://lcz.me/post/78</link><guid isPermaLink="true">https://lcz.me/post/78</guid><dc:creator><![CDATA[terry]]></dc:creator><pubDate>Mon, 04 May 2026 08:46:34 GMT</pubDate></item><item><title><![CDATA[Reply to 3090还是3090 *2+NVLink on Mon, 04 May 2026 08:43:40 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://www.reddit.com/r/LocalLLaMA/comments/1sw5fb7/qwen36_35b_a3b_heretic_kld_00015_incredible_model/" rel="nofollow ugc">https://www.reddit.com/r/LocalLLaMA/comments/1sw5fb7/qwen36_35b_a3b_heretic_kld_00015_incredible_model/</a></p>
<p dir="auto">這個技術可以把整個qwen 3.6 35bA3B Q8量化+256k上下文塞進去24g vram～有點想跑看看</p>
]]></description><link>https://lcz.me/post/77</link><guid isPermaLink="true">https://lcz.me/post/77</guid><dc:creator><![CDATA[鍾子揚]]></dc:creator><pubDate>Mon, 04 May 2026 08:43:40 GMT</pubDate></item><item><title><![CDATA[Reply to 3090还是3090 *2+NVLink on Mon, 04 May 2026 08:13:55 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/terry" aria-label="Profile: terry">@<bdi>terry</bdi></a> 感谢提供意见，在海外买不到4080S 32G  我找另外两张 再次感谢</p>
]]></description><link>https://lcz.me/post/76</link><guid isPermaLink="true">https://lcz.me/post/76</guid><dc:creator><![CDATA[starryskyknight]]></dc:creator><pubDate>Mon, 04 May 2026 08:13:55 GMT</pubDate></item><item><title><![CDATA[Reply to 3090还是3090 *2+NVLink on Mon, 04 May 2026 03:58:35 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/starryskyknight" aria-label="Profile: starryskyknight">@<bdi>starryskyknight</bdi></a> 你买4080S 32G，或加几千买RTX Pro4500 32G。如果想便宜3090 24G。</p>
]]></description><link>https://lcz.me/post/72</link><guid isPermaLink="true">https://lcz.me/post/72</guid><dc:creator><![CDATA[terry]]></dc:creator><pubDate>Mon, 04 May 2026 03:58:35 GMT</pubDate></item><item><title><![CDATA[Reply to 3090还是3090 *2+NVLink on Mon, 04 May 2026 03:48:15 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/terry" aria-label="Profile: terry">@<bdi>terry</bdi></a> 好的，谢谢。我试一下</p>
]]></description><link>https://lcz.me/post/71</link><guid isPermaLink="true">https://lcz.me/post/71</guid><dc:creator><![CDATA[刘海彬]]></dc:creator><pubDate>Mon, 04 May 2026 03:48:15 GMT</pubDate></item><item><title><![CDATA[Reply to 3090还是3090 *2+NVLink on Mon, 04 May 2026 03:38:24 GMT]]></title><description><![CDATA[<p dir="auto">Q4_K_XL.gguf 这个模型比较大，不太好，不是越大越好的，你换成Q4KM，因为做的人多，兼容性更好。推理关掉， --reasoning-budget 512 改为0，跑Agent它推理极大影响效率，智力提升微乎其微，kv改为80k，可以尝试Truboquant版本。</p>
]]></description><link>https://lcz.me/post/69</link><guid isPermaLink="true">https://lcz.me/post/69</guid><dc:creator><![CDATA[terry]]></dc:creator><pubDate>Mon, 04 May 2026 03:38:24 GMT</pubDate></item><item><title><![CDATA[Reply to 3090还是3090 *2+NVLink on Mon, 04 May 2026 03:16:15 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/terry" aria-label="Profile: terry">@<bdi>terry</bdi></a> 哥，我的启动参数如下：<br />
/root/llama.cpp/build/bin/llama-server -m /data/models/gguf/Qwen3.6-27B-UD-Q4_K_XL.gguf --mmproj /data/models/gguf/Qwen3.6-27B-mmproj-F16.gguf --mmproj-offload --alias qwen36-27B-Q4 --jinja -ngl 999 -c 128000 -fa on --cache-ram 16384 --cache-type-k q8_0 --cache-type-v q8_0 -np 1 --sampling-seq k --top-k 1 --host 0.0.0.0 --port 11434 --reasoning on --reasoning-format deepseek --reasoning-budget 512</p>
]]></description><link>https://lcz.me/post/67</link><guid isPermaLink="true">https://lcz.me/post/67</guid><dc:creator><![CDATA[刘海彬]]></dc:creator><pubDate>Mon, 04 May 2026 03:16:15 GMT</pubDate></item><item><title><![CDATA[Reply to 3090还是3090 *2+NVLink on Mon, 04 May 2026 02:54:13 GMT]]></title><description><![CDATA[<p dir="auto">terry 抱歉 预算大概一万七一万八人民币内</p>
]]></description><link>https://lcz.me/post/66</link><guid isPermaLink="true">https://lcz.me/post/66</guid><dc:creator><![CDATA[starryskyknight]]></dc:creator><pubDate>Mon, 04 May 2026 02:54:13 GMT</pubDate></item><item><title><![CDATA[Reply to 3090还是3090 *2+NVLink on Mon, 04 May 2026 01:32:32 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/starryskyknight" aria-label="Profile: starryskyknight">@<bdi>starryskyknight</bdi></a> 你预算都不说，a100最好</p>
]]></description><link>https://lcz.me/post/61</link><guid isPermaLink="true">https://lcz.me/post/61</guid><dc:creator><![CDATA[terry]]></dc:creator><pubDate>Mon, 04 May 2026 01:32:32 GMT</pubDate></item><item><title><![CDATA[Reply to 3090还是3090 *2+NVLink on Mon, 04 May 2026 01:32:10 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/%E5%88%98%E6%B5%B7%E5%BD%AC" aria-label="Profile: 刘海彬">@<bdi>刘海彬</bdi></a> 可能吧，我暂时没遇到，你是不是用了q4ks? Kv怎么量化的？</p>
]]></description><link>https://lcz.me/post/60</link><guid isPermaLink="true">https://lcz.me/post/60</guid><dc:creator><![CDATA[terry]]></dc:creator><pubDate>Mon, 04 May 2026 01:32:10 GMT</pubDate></item><item><title><![CDATA[Reply to 3090还是3090 *2+NVLink on Mon, 04 May 2026 00:44:49 GMT]]></title><description><![CDATA[<p dir="auto">我目前使用rtx3090 跑qwen3.6 27B Q4量化，给hermes用基本可以的，就是有时候偶发工具调用死循环，我已经在hermes的人设内容限制很死了，概率降低了很多，但是偶尔还是会，我感觉是模型问题了。</p>
]]></description><link>https://lcz.me/post/57</link><guid isPermaLink="true">https://lcz.me/post/57</guid><dc:creator><![CDATA[刘海彬]]></dc:creator><pubDate>Mon, 04 May 2026 00:44:49 GMT</pubDate></item><item><title><![CDATA[Reply to 3090还是3090 *2+NVLink on Sun, 03 May 2026 19:34:22 GMT]]></title><description><![CDATA[<p dir="auto"><a class="plugin-mentions-user plugin-mentions-a" href="/user/terry" aria-label="Profile: terry">@<bdi>terry</bdi></a> 海外我的用途 最推荐的是哪张卡啊? 我看您比较推荐nvidia的生态</p>
]]></description><link>https://lcz.me/post/52</link><guid isPermaLink="true">https://lcz.me/post/52</guid><dc:creator><![CDATA[starryskyknight]]></dc:creator><pubDate>Sun, 03 May 2026 19:34:22 GMT</pubDate></item><item><title><![CDATA[Reply to 3090还是3090 *2+NVLink on Sun, 03 May 2026 17:21:47 GMT]]></title><description><![CDATA[<p dir="auto">Claude Opus 4.5时代最好用，最近幻觉太多，minimax也不错，但是deepseek v4 flash便宜。本地就只有qwen 27b，但是速度远不如在线。</p>
]]></description><link>https://lcz.me/post/49</link><guid isPermaLink="true">https://lcz.me/post/49</guid><dc:creator><![CDATA[墙内人]]></dc:creator><pubDate>Sun, 03 May 2026 17:21:47 GMT</pubDate></item><item><title><![CDATA[Reply to 3090还是3090 *2+NVLink on Sun, 03 May 2026 17:10:46 GMT]]></title><description><![CDATA[<p dir="auto">我不用Deepseek V4 Pro，我都是用的Flash，跑Agent不需要那么大参数，280b都超标了，事实上Qwen3.6 27b可以完成绝大多数工作。它的问题是本地模型的工具链没有云端丰富。但是可以用V4 Flash作为fallback参数，本地不行就调用它。它执行完毕之后形成skills，本地模型再跑就可以了。你换成Qwen3.6 27b+Deepseek V4 Flash不会有多大差距。Hermes不太吃模型自身能力，它的harness做的不错。</p>
<p dir="auto">关于显卡，一张卡和两张卡+NVLink差距当然大，两张TP算力和显存都翻倍，减去框架开销也有1.8倍左右。3090单卡就够了，你多研究下Turboquant mtp dflash等技术，就一个turboquant搞定就够你玩了。</p>
<p dir="auto">现在不建议味了跑AI买5090，太贵了，你可以买个RTX Pro 4500 32G就够你用， 5000 48G， 6000 96G都是很好的选择。性能都够了，不会有啥便秘的感觉。5090烧接口，功耗太高这是基本无解的。它的溢价来自于游戏能力。</p>
]]></description><link>https://lcz.me/post/47</link><guid isPermaLink="true">https://lcz.me/post/47</guid><dc:creator><![CDATA[terry]]></dc:creator><pubDate>Sun, 03 May 2026 17:10:46 GMT</pubDate></item></channel></rss>