Shanghaojin personal page - To The Moon

Shanghaojin

No personal profile

0Follow

0Followers

0Topic

0Badge

2025-10-20

Share your opinion about this news…

Alibaba Cloud's New System Cuts Nvidia GPU Usage By 82%, Amid Trump's Flip Flop On AI Chip Ban On China

Go to Tiger App to see more news

{"i18n":{"language":"en_US"},"userPageInfo":{"id":"4176773904226292","uuid":"4176773904226292","gmtCreate":1713670768659,"gmtModify":1713759560908,"name":"Shanghaojin","pinyin":"shanghaojin","introduction":"","introductionEn":"","signature":"","avatar":"https://community-static.tradeup.com/news/default-avatar.jpg","hat":null,"hatId":null,"hatName":null,"vip":1,"status":2,"fanSize":0,"headSize":0,"tweetSize":0,"questionSize":0,"limitLevel":999,"accountStatus":1,"level":{"id":0,"name":"","nameTw":"","represent":"","factor":"","iconColor":"","bgColor":""},"themeCounts":0,"badgeCounts":0,"badges":[],"moderator":false,"superModerator":false,"manageSymbols":null,"badgeLevel":null,"boolIsFan":false,"boolIsHead":false,"favoriteSize":0,"symbols":null,"coverImage":null,"realNameVerified":"init","userBadges":[],"userBadgeCount":0,"currentWearingBadge":null,"individualDisplayBadges":null,"crmLevel":1,"crmLevelSwitch":0,"location":null,"starInvestorFollowerNum":0,"starInvestorFlag":false,"starInvestorOrderShareNum":0,"subscribeStarInvestorNum":0,"ror":null,"winRationPercentage":null,"showRor":false,"investmentPhilosophy":null,"starInvestorSubscribeFlag":false},"baikeInfo":{},"tab":"post","tweets":[{"id":491301338562864,"gmtCreate":1760969270917,"gmtModify":1760971038953,"author":{"id":"4176773904226292","authorId":"4176773904226292","name":"Shanghaojin","avatar":"https://community-static.tradeup.com/news/default-avatar.jpg","crmLevel":1,"crmLevelSwitch":0,"followedFlag":false,"idStr":"4176773904226292","authorIdStr":"4176773904226292"},"themes":[],"htmlText":"Share your opinion about this news…","listText":"Share your opinion about this news…","text":"Share your opinion about this news…","images":[],"top":1,"highlighted":1,"essential":1,"paper":1,"likeSize":1,"commentSize":0,"repostSize":0,"link":"https://ttm.financial/post/491301338562864","repostId":"2576027086","repostType":2,"repost":{"id":"2576027086","kind":"highlight","weMediaInfo":{"introduction":"Stock Market Quotes, Business News, Financial News, Trading Ideas, and Stock Research by Professionals","home_visible":0,"media_name":"Benzinga","id":"1052270027","head_image":"https://static.tigerbbs.com/d08bf7808052c0ca9deb4e944cae32aa"},"pubTimestamp":1760930775,"share":"https://ttm.financial/m/news/2576027086?lang=en_US&edition=fundamental","pubTime":"2025-10-20 11:26","market":"us","language":"en","title":"Alibaba Cloud's New System Cuts Nvidia GPU Usage By 82%, Amid Trump's Flip Flop On AI Chip Ban On China","url":"https://stock-news.laohu8.com/highlight/detail?id=2576027086","media":"Benzinga","summary":"Alibaba introduces Aegaeon, a computing pooling system reducing Nvidia GPU reliance by 82%.","content":"<html><head></head><body><p><strong><a href=\"https://laohu8.com/S/BABA\">Alibaba Group</a> Holding</strong> has introduced a new computing pooling system called Aegaeon, which dramatically reduces the reliance on <strong><a href=\"https://laohu8.com/S/NVDA\">Nvidia</a></strong> GPUs by 82% for AI models.</p><p>This innovation was tested in Alibaba Cloud&#39;s model marketplace for over three months, according to a research paper presented this week at the 31st Symposium on Operating Systems Principles (SOSP) in Seoul, South Korea.</p><p>The Aegaeon system has successfully decreased the number of Nvidia H20 GPUs required from 1,192 to just 213 for serving models with up to 72 billion parameters.</p><p>&#34;Aegaeon is the first work to reveal the excessive costs associated with serving concurrent LLM workloads on the market,&#34; the researchers stated in the paper.</p><p>Researchers from <strong>Peking University</strong> and Alibaba Cloud emphasized the high costs associated with serving concurrent large language model workloads.</p><p>Alibaba Cloud, the AI and cloud services division of the Hangzhou-based Alibaba, aims to boost efficiency by pooling GPU resources, allowing a single GPU to support multiple models.</p><p>The system addresses resource inefficiency, as previously, 17.7% of GPUs were allocated to serve only 1.35% of requests in Alibaba Cloud&#39;s marketplace.</p><p>Cloud service providers like Alibaba Cloud and <strong>ByteDance</strong>‘s Volcano Engine manage thousands of AI models simultaneously, often leading to inefficiencies. The Aegaeon system seeks to optimize this process by reducing the number of GPUs needed.</p><p>This development comes amid growing concerns over Nvidia’s presence in China. Recently, China raised security concerns about Nvidia’s H20 chips, especially regarding potential backdoor risks. As part of its deal with Nvidia, the Trump administration has struck an agreement for a 15% revenue share from the company&#39;s chip sales to China.</p><p>Nvidia CEO <strong>Jensen Huang</strong> stated that Nvidia’s market share in China has plummeted from 95% to zero. He expressed concerns over the impact of U.S. policies on Nvidia’s market presence in China.</p><p>Despite these challenges, Nvidia has financially insulated itself from potential escalations, as its guidance assumes zero revenue from China, according to Huang.</p></body></html>","collect":0,"html":"<!DOCTYPE html>\n<html>\n<head>\n<meta http-equiv=\"Content-Type\" content=\"text/html; charset=utf-8\" />\n<meta name=\"viewport\" content=\"width=device-width,initial-scale=1.0,minimum-scale=1.0,maximum-scale=1.0,user-scalable=no\"/>\n<meta name=\"format-detection\" content=\"telephone=no,email=no,address=no\" />\n<title>Alibaba Cloud's New System Cuts Nvidia GPU Usage By 82%, Amid Trump's Flip Flop On AI Chip Ban On China</title>\n<style type=\"text/css\">\na,abbr,acronym,address,applet,article,aside,audio,b,big,blockquote,body,canvas,caption,center,cite,code,dd,del,details,dfn,div,dl,dt,\nem,embed,fieldset,figcaption,figure,footer,form,h1,h2,h3,h4,h5,h6,header,hgroup,html,i,iframe,img,ins,kbd,label,legend,li,mark,menu,nav,\nobject,ol,output,p,pre,q,ruby,s,samp,section,small,span,strike,strong,sub,summary,sup,table,tbody,td,tfoot,th,thead,time,tr,tt,u,ul,var,video{ font:inherit;margin:0;padding:0;vertical-align:baseline;border:0 }\nbody{ font-size:16px; line-height:1.5; color:#999; background:transparent; }\n.wrapper{ overflow:hidden;word-break:break-all;padding:10px; }\nh1,h2{ font-weight:normal; line-height:1.35; margin-bottom:.6em; }\nh3,h4,h5,h6{ line-height:1.35; margin-bottom:1em; }\nh1{ font-size:24px; }\nh2{ font-size:20px; }\nh3{ font-size:18px; }\nh4{ font-size:16px; }\nh5{ font-size:14px; }\nh6{ font-size:12px; }\np,ul,ol,blockquote,dl,table{ margin:1.2em 0; }\nul,ol{ margin-left:2em; }\nul{ list-style:disc; }\nol{ list-style:decimal; }\nli,li p{ margin:10px 0;}\nimg{ max-width:100%;display:block;margin:0 auto 1em; }\nblockquote{ color:#B5B2B1; border-left:3px solid #aaa; padding:1em; }\nstrong,b{font-weight:bold;}\nem,i{font-style:italic;}\ntable{ width:100%;border-collapse:collapse;border-spacing:1px;margin:1em 0;font-size:.9em; }\nth,td{ padding:5px;text-align:left;border:1px solid #aaa; }\nth{ font-weight:bold;background:#5d5d5d; }\n.symbol-link{font-weight:bold;}\n/* header{ border-bottom:1px solid #494756; } */\n.title{ margin:0 0 8px;line-height:1.3;color:#ddd; }\n.meta {color:#5e5c6d;font-size:13px;margin:0 0 .5em; }\na{text-decoration:none; color:#2a4b87;}\n.meta .head { display: inline-block; overflow: hidden}\n.head .h-thumb { width: 30px; height: 30px; margin: 0; padding: 0; border-radius: 50%; float: left;}\n.head .h-content { margin: 0; padding: 0 0 0 9px; float: left;}\n.head .h-name {font-size: 13px; color: #eee; margin: 0;}\n.head .h-time {font-size: 11px; color: #7E829C; margin: 0;line-height: 11px;}\n.small {font-size: 12.5px; display: inline-block; transform: scale(0.9); -webkit-transform: scale(0.9); transform-origin: left; -webkit-transform-origin: left;}\n.smaller {font-size: 12.5px; display: inline-block; transform: scale(0.8); -webkit-transform: scale(0.8); transform-origin: left; -webkit-transform-origin: left;}\n.bt-text {font-size: 12px;margin: 1.5em 0 0 0}\n.bt-text p {margin: 0}\n</style>\n</head>\n<body>\n<div class=\"wrapper\">\n<header>\n<h2 class=\"title\">\nAlibaba Cloud's New System Cuts Nvidia GPU Usage By 82%, Amid Trump's Flip Flop On AI Chip Ban On China\n</h2>\n\n<h4 class=\"meta\">\n\n\n<div class=\"head\" \">\n\n\n<div class=\"h-thumb\" style=\"background-image:url(https://static.tigerbbs.com/d08bf7808052c0ca9deb4e944cae32aa);background-size:cover;\"></div>\n\n<div class=\"h-content\">\n<p class=\"h-name\">Benzinga </p>\n<p class=\"h-time\">2025-10-20 11:26</p>\n</div>\n\n</div>\n\n\n</h4>\n\n</header>\n<article>\n<html><head></head><body><p><strong><a href=\"https://laohu8.com/S/BABA\">Alibaba Group</a> Holding</strong> has introduced a new computing pooling system called Aegaeon, which dramatically reduces the reliance on <strong><a href=\"https://laohu8.com/S/NVDA\">Nvidia</a></strong> GPUs by 82% for AI models.</p><p>This innovation was tested in Alibaba Cloud&#39;s model marketplace for over three months, according to a research paper presented this week at the 31st Symposium on Operating Systems Principles (SOSP) in Seoul, South Korea.</p><p>The Aegaeon system has successfully decreased the number of Nvidia H20 GPUs required from 1,192 to just 213 for serving models with up to 72 billion parameters.</p><p>&#34;Aegaeon is the first work to reveal the excessive costs associated with serving concurrent LLM workloads on the market,&#34; the researchers stated in the paper.</p><p>Researchers from <strong>Peking University</strong> and Alibaba Cloud emphasized the high costs associated with serving concurrent large language model workloads.</p><p>Alibaba Cloud, the AI and cloud services division of the Hangzhou-based Alibaba, aims to boost efficiency by pooling GPU resources, allowing a single GPU to support multiple models.</p><p>The system addresses resource inefficiency, as previously, 17.7% of GPUs were allocated to serve only 1.35% of requests in Alibaba Cloud&#39;s marketplace.</p><p>Cloud service providers like Alibaba Cloud and <strong>ByteDance</strong>‘s Volcano Engine manage thousands of AI models simultaneously, often leading to inefficiencies. The Aegaeon system seeks to optimize this process by reducing the number of GPUs needed.</p><p>This development comes amid growing concerns over Nvidia’s presence in China. Recently, China raised security concerns about Nvidia’s H20 chips, especially regarding potential backdoor risks. As part of its deal with Nvidia, the Trump administration has struck an agreement for a 15% revenue share from the company&#39;s chip sales to China.</p><p>Nvidia CEO <strong>Jensen Huang</strong> stated that Nvidia’s market share in China has plummeted from 95% to zero. He expressed concerns over the impact of U.S. policies on Nvidia’s market presence in China.</p><p>Despite these challenges, Nvidia has financially insulated itself from potential escalations, as its guidance assumes zero revenue from China, according to Huang.</p></body></html>\n\n</article>\n</div>\n</body>\n</html>\n","type":0,"thumbnail":"","relate_stocks":{"BK4548":"巴美列捷福持仓","LU0345770993.USD":"NINETY ONE GSF GLOBAL STRATEGIC EQUITY \"A\" (USD) INC","LU0345768153.USD":"NINETY ONE GSF GLOBAL STRATEGIC MANAGED \"A\" (USD) ACC","LU2236285917.USD":"ALLIANZ GLOBAL INCOME \"AMG\" (USD) INC","LU1923623000.USD":"Natixis Thematics AI & Robotics Fund R/A USD","LU0949170772.SGD":"Blackrock Global Equity Income A6 SGD-H","LU0788109477.HKD":"BGF GLOBAL ALLOCATION \"A2\" (HKDHGD) ACC","IE00BJLML261.HKD":"HSBC GLOBAL EQUITY INDEX \"HCH\" (HKD) ACC","LU2125909247.SGD":"Natixis Thematics Meta H-R/A SGD","IE00BMPRXN33.USD":"NEUBERGER BERMAN 5G CONNECTIVITY \"A\" (USD) ACC","LU1267930730.SGD":"富兰克林美国机遇基金AS Acc SGD (CPF)","LU1244550494.USD":"FRANKLIN GLOBAL MULTI-ASSET INCOME \"A\" (USDHEDGED) ACC","LU1935042488.USD":"MANULIFE GF GLOBAL MULTI-ASSET DIVERSIFIED INCOME  \"AA\" (USD) INC","LU1674673428.USD":"HSBC GIF GLOBAL LOWER CARBON EQUITY \"AC\" (USD) ACC","LU0310799852.SGD":"FTIF - Templeton Global Equity Income A MDIS SGD","SG9999015341.SGD":"United Income Focus Trust Acc SGD-H","LU0417517546.SGD":"Allianz US Equity Cl AT Acc SGD","BABA":"阿里巴巴","LU0918141887.USD":"安联亚洲实际收益股票基金","NVDA":"英伟达","LU0957791311.USD":"THREADNEEDLE (LUX) GLOBAL FOCUS \"ZU\" (USD) ACC","LU2471134879.HKD":"INVESCO GLOBAL EQUITY INCOME ADVANTAGE \"A\" (HKD) INC","BK4526":"热门中概股","SG9999001424.SGD":"United E-Commerce Fund SGD","LU0345769128.USD":"NINETY ONE GSF GLOBAL EQUITY \"A\" (USD) ACC","LU0289961442.SGD":"SUSTAINABLE GLOBAL THEMATIC PORTFOLIO \"AX\" (SGD) ACC","LU0289960550.SGD":"AB FCP I - GLOBAL EQUITY BLEND PORTFOLIO 'A' (SGD) ACC","IE00B1BXHZ80.USD":"Legg Mason ClearBridge - US Appreciation A Acc USD","LU0267386448.USD":"FIDELITY FIRST ALL COUNTRY WORLD \"A\" (USD) INC","LU1064131342.USD":"Fullerton Lux Funds - Global Absolute Alpha A Acc USD","LU0354030511.USD":"ALLSPRING  U.S. LARGE CAP GROWTH \"I\" (USD) ACC","LU2750360997.AUD":"INVESCO GLOBAL EQUITY INCOME ADVANTAGE \"A\" (AUDHDG) INC","IE00BZ1G4Q59.USD":"LEGG MASON CLEARBRIDGE US EQUITY SUSTAINABILITY LEADER \"A\"(USD) INC (A)","LU0354030438.USD":"富国美国大盘成长基金Cl A Acc"},"source_url":"https://www.benzinga.com/markets/tech/25/10/48293177/alibaba-clouds-new-system-cuts-nvidia-gpu-usage-by-82-amid-trumps-flip-flop-on-ai-chip-ban-on-china","is_english":true,"share_image_url":"https://static.laohu8.com/e9f99090a1c2ed51c021029395664489","article_id":"2576027086","content_text":"Alibaba Group Holding has introduced a new computing pooling system called Aegaeon, which dramatically reduces the reliance on Nvidia GPUs by 82% for AI models.This innovation was tested in Alibaba Cloud's model marketplace for over three months, according to a research paper presented this week at the 31st Symposium on Operating Systems Principles (SOSP) in Seoul, South Korea.The Aegaeon system has successfully decreased the number of Nvidia H20 GPUs required from 1,192 to just 213 for serving models with up to 72 billion parameters.\"Aegaeon is the first work to reveal the excessive costs associated with serving concurrent LLM workloads on the market,\" the researchers stated in the paper.Researchers from Peking University and Alibaba Cloud emphasized the high costs associated with serving concurrent large language model workloads.Alibaba Cloud, the AI and cloud services division of the Hangzhou-based Alibaba, aims to boost efficiency by pooling GPU resources, allowing a single GPU to support multiple models.The system addresses resource inefficiency, as previously, 17.7% of GPUs were allocated to serve only 1.35% of requests in Alibaba Cloud's marketplace.Cloud service providers like Alibaba Cloud and ByteDance‘s Volcano Engine manage thousands of AI models simultaneously, often leading to inefficiencies. The Aegaeon system seeks to optimize this process by reducing the number of GPUs needed.This development comes amid growing concerns over Nvidia’s presence in China. Recently, China raised security concerns about Nvidia’s H20 chips, especially regarding potential backdoor risks. As part of its deal with Nvidia, the Trump administration has struck an agreement for a 15% revenue share from the company's chip sales to China.Nvidia CEO Jensen Huang stated that Nvidia’s market share in China has plummeted from 95% to zero. He expressed concerns over the impact of U.S. policies on Nvidia’s market presence in China.Despite these challenges, Nvidia has financially insulated itself from potential escalations, as its guidance assumes zero revenue from China, according to Huang.","news_type":1,"symbols_score_info":{"NVDA":1.1,"BABA":1.1}},"isVote":1,"tweetType":1,"viewCount":294,"authorTweetTopStatus":1,"verified":2,"comments":[],"imageCount":0,"langContent":"EN","totalScore":0}],"hots":[{"id":491301338562864,"gmtCreate":1760969270917,"gmtModify":1760971038953,"author":{"id":"4176773904226292","authorId":"4176773904226292","name":"Shanghaojin","avatar":"https://community-static.tradeup.com/news/default-avatar.jpg","crmLevel":1,"crmLevelSwitch":0,"followedFlag":false,"authorIdStr":"4176773904226292","idStr":"4176773904226292"},"themes":[],"htmlText":"Share your opinion about this news…","listText":"Share your opinion about this news…","text":"Share your opinion about this news…","images":[],"top":1,"highlighted":1,"essential":1,"paper":1,"likeSize":1,"commentSize":0,"repostSize":0,"link":"https://ttm.financial/post/491301338562864","repostId":"2576027086","repostType":2,"repost":{"id":"2576027086","kind":"highlight","weMediaInfo":{"introduction":"Stock Market Quotes, Business News, Financial News, Trading Ideas, and Stock Research by Professionals","home_visible":0,"media_name":"Benzinga","id":"1052270027","head_image":"https://static.tigerbbs.com/d08bf7808052c0ca9deb4e944cae32aa"},"pubTimestamp":1760930775,"share":"https://ttm.financial/m/news/2576027086?lang=en_US&edition=fundamental","pubTime":"2025-10-20 11:26","market":"us","language":"en","title":"Alibaba Cloud's New System Cuts Nvidia GPU Usage By 82%, Amid Trump's Flip Flop On AI Chip Ban On China","url":"https://stock-news.laohu8.com/highlight/detail?id=2576027086","media":"Benzinga","summary":"Alibaba introduces Aegaeon, a computing pooling system reducing Nvidia GPU reliance by 82%.","content":"<html><head></head><body><p><strong><a href=\"https://laohu8.com/S/BABA\">Alibaba Group</a> Holding</strong> has introduced a new computing pooling system called Aegaeon, which dramatically reduces the reliance on <strong><a href=\"https://laohu8.com/S/NVDA\">Nvidia</a></strong> GPUs by 82% for AI models.</p><p>This innovation was tested in Alibaba Cloud&#39;s model marketplace for over three months, according to a research paper presented this week at the 31st Symposium on Operating Systems Principles (SOSP) in Seoul, South Korea.</p><p>The Aegaeon system has successfully decreased the number of Nvidia H20 GPUs required from 1,192 to just 213 for serving models with up to 72 billion parameters.</p><p>&#34;Aegaeon is the first work to reveal the excessive costs associated with serving concurrent LLM workloads on the market,&#34; the researchers stated in the paper.</p><p>Researchers from <strong>Peking University</strong> and Alibaba Cloud emphasized the high costs associated with serving concurrent large language model workloads.</p><p>Alibaba Cloud, the AI and cloud services division of the Hangzhou-based Alibaba, aims to boost efficiency by pooling GPU resources, allowing a single GPU to support multiple models.</p><p>The system addresses resource inefficiency, as previously, 17.7% of GPUs were allocated to serve only 1.35% of requests in Alibaba Cloud&#39;s marketplace.</p><p>Cloud service providers like Alibaba Cloud and <strong>ByteDance</strong>‘s Volcano Engine manage thousands of AI models simultaneously, often leading to inefficiencies. The Aegaeon system seeks to optimize this process by reducing the number of GPUs needed.</p><p>This development comes amid growing concerns over Nvidia’s presence in China. Recently, China raised security concerns about Nvidia’s H20 chips, especially regarding potential backdoor risks. As part of its deal with Nvidia, the Trump administration has struck an agreement for a 15% revenue share from the company&#39;s chip sales to China.</p><p>Nvidia CEO <strong>Jensen Huang</strong> stated that Nvidia’s market share in China has plummeted from 95% to zero. He expressed concerns over the impact of U.S. policies on Nvidia’s market presence in China.</p><p>Despite these challenges, Nvidia has financially insulated itself from potential escalations, as its guidance assumes zero revenue from China, according to Huang.</p></body></html>","collect":0,"html":"<!DOCTYPE html>\n<html>\n<head>\n<meta http-equiv=\"Content-Type\" content=\"text/html; charset=utf-8\" />\n<meta name=\"viewport\" content=\"width=device-width,initial-scale=1.0,minimum-scale=1.0,maximum-scale=1.0,user-scalable=no\"/>\n<meta name=\"format-detection\" content=\"telephone=no,email=no,address=no\" />\n<title>Alibaba Cloud's New System Cuts Nvidia GPU Usage By 82%, Amid Trump's Flip Flop On AI Chip Ban On China</title>\n<style type=\"text/css\">\na,abbr,acronym,address,applet,article,aside,audio,b,big,blockquote,body,canvas,caption,center,cite,code,dd,del,details,dfn,div,dl,dt,\nem,embed,fieldset,figcaption,figure,footer,form,h1,h2,h3,h4,h5,h6,header,hgroup,html,i,iframe,img,ins,kbd,label,legend,li,mark,menu,nav,\nobject,ol,output,p,pre,q,ruby,s,samp,section,small,span,strike,strong,sub,summary,sup,table,tbody,td,tfoot,th,thead,time,tr,tt,u,ul,var,video{ font:inherit;margin:0;padding:0;vertical-align:baseline;border:0 }\nbody{ font-size:16px; line-height:1.5; color:#999; background:transparent; }\n.wrapper{ overflow:hidden;word-break:break-all;padding:10px; }\nh1,h2{ font-weight:normal; line-height:1.35; margin-bottom:.6em; }\nh3,h4,h5,h6{ line-height:1.35; margin-bottom:1em; }\nh1{ font-size:24px; }\nh2{ font-size:20px; }\nh3{ font-size:18px; }\nh4{ font-size:16px; }\nh5{ font-size:14px; }\nh6{ font-size:12px; }\np,ul,ol,blockquote,dl,table{ margin:1.2em 0; }\nul,ol{ margin-left:2em; }\nul{ list-style:disc; }\nol{ list-style:decimal; }\nli,li p{ margin:10px 0;}\nimg{ max-width:100%;display:block;margin:0 auto 1em; }\nblockquote{ color:#B5B2B1; border-left:3px solid #aaa; padding:1em; }\nstrong,b{font-weight:bold;}\nem,i{font-style:italic;}\ntable{ width:100%;border-collapse:collapse;border-spacing:1px;margin:1em 0;font-size:.9em; }\nth,td{ padding:5px;text-align:left;border:1px solid #aaa; }\nth{ font-weight:bold;background:#5d5d5d; }\n.symbol-link{font-weight:bold;}\n/* header{ border-bottom:1px solid #494756; } */\n.title{ margin:0 0 8px;line-height:1.3;color:#ddd; }\n.meta {color:#5e5c6d;font-size:13px;margin:0 0 .5em; }\na{text-decoration:none; color:#2a4b87;}\n.meta .head { display: inline-block; overflow: hidden}\n.head .h-thumb { width: 30px; height: 30px; margin: 0; padding: 0; border-radius: 50%; float: left;}\n.head .h-content { margin: 0; padding: 0 0 0 9px; float: left;}\n.head .h-name {font-size: 13px; color: #eee; margin: 0;}\n.head .h-time {font-size: 11px; color: #7E829C; margin: 0;line-height: 11px;}\n.small {font-size: 12.5px; display: inline-block; transform: scale(0.9); -webkit-transform: scale(0.9); transform-origin: left; -webkit-transform-origin: left;}\n.smaller {font-size: 12.5px; display: inline-block; transform: scale(0.8); -webkit-transform: scale(0.8); transform-origin: left; -webkit-transform-origin: left;}\n.bt-text {font-size: 12px;margin: 1.5em 0 0 0}\n.bt-text p {margin: 0}\n</style>\n</head>\n<body>\n<div class=\"wrapper\">\n<header>\n<h2 class=\"title\">\nAlibaba Cloud's New System Cuts Nvidia GPU Usage By 82%, Amid Trump's Flip Flop On AI Chip Ban On China\n</h2>\n\n<h4 class=\"meta\">\n\n\n<div class=\"head\" \">\n\n\n<div class=\"h-thumb\" style=\"background-image:url(https://static.tigerbbs.com/d08bf7808052c0ca9deb4e944cae32aa);background-size:cover;\"></div>\n\n<div class=\"h-content\">\n<p class=\"h-name\">Benzinga </p>\n<p class=\"h-time\">2025-10-20 11:26</p>\n</div>\n\n</div>\n\n\n</h4>\n\n</header>\n<article>\n<html><head></head><body><p><strong><a href=\"https://laohu8.com/S/BABA\">Alibaba Group</a> Holding</strong> has introduced a new computing pooling system called Aegaeon, which dramatically reduces the reliance on <strong><a href=\"https://laohu8.com/S/NVDA\">Nvidia</a></strong> GPUs by 82% for AI models.</p><p>This innovation was tested in Alibaba Cloud&#39;s model marketplace for over three months, according to a research paper presented this week at the 31st Symposium on Operating Systems Principles (SOSP) in Seoul, South Korea.</p><p>The Aegaeon system has successfully decreased the number of Nvidia H20 GPUs required from 1,192 to just 213 for serving models with up to 72 billion parameters.</p><p>&#34;Aegaeon is the first work to reveal the excessive costs associated with serving concurrent LLM workloads on the market,&#34; the researchers stated in the paper.</p><p>Researchers from <strong>Peking University</strong> and Alibaba Cloud emphasized the high costs associated with serving concurrent large language model workloads.</p><p>Alibaba Cloud, the AI and cloud services division of the Hangzhou-based Alibaba, aims to boost efficiency by pooling GPU resources, allowing a single GPU to support multiple models.</p><p>The system addresses resource inefficiency, as previously, 17.7% of GPUs were allocated to serve only 1.35% of requests in Alibaba Cloud&#39;s marketplace.</p><p>Cloud service providers like Alibaba Cloud and <strong>ByteDance</strong>‘s Volcano Engine manage thousands of AI models simultaneously, often leading to inefficiencies. The Aegaeon system seeks to optimize this process by reducing the number of GPUs needed.</p><p>This development comes amid growing concerns over Nvidia’s presence in China. Recently, China raised security concerns about Nvidia’s H20 chips, especially regarding potential backdoor risks. As part of its deal with Nvidia, the Trump administration has struck an agreement for a 15% revenue share from the company&#39;s chip sales to China.</p><p>Nvidia CEO <strong>Jensen Huang</strong> stated that Nvidia’s market share in China has plummeted from 95% to zero. He expressed concerns over the impact of U.S. policies on Nvidia’s market presence in China.</p><p>Despite these challenges, Nvidia has financially insulated itself from potential escalations, as its guidance assumes zero revenue from China, according to Huang.</p></body></html>\n\n</article>\n</div>\n</body>\n</html>\n","type":0,"thumbnail":"","relate_stocks":{"BK4548":"巴美列捷福持仓","LU0345770993.USD":"NINETY ONE GSF GLOBAL STRATEGIC EQUITY \"A\" (USD) INC","LU0345768153.USD":"NINETY ONE GSF GLOBAL STRATEGIC MANAGED \"A\" (USD) ACC","LU2236285917.USD":"ALLIANZ GLOBAL INCOME \"AMG\" (USD) INC","LU1923623000.USD":"Natixis Thematics AI & Robotics Fund R/A USD","LU0949170772.SGD":"Blackrock Global Equity Income A6 SGD-H","LU0788109477.HKD":"BGF GLOBAL ALLOCATION \"A2\" (HKDHGD) ACC","IE00BJLML261.HKD":"HSBC GLOBAL EQUITY INDEX \"HCH\" (HKD) ACC","LU2125909247.SGD":"Natixis Thematics Meta H-R/A SGD","IE00BMPRXN33.USD":"NEUBERGER BERMAN 5G CONNECTIVITY \"A\" (USD) ACC","LU1267930730.SGD":"富兰克林美国机遇基金AS Acc SGD (CPF)","LU1244550494.USD":"FRANKLIN GLOBAL MULTI-ASSET INCOME \"A\" (USDHEDGED) ACC","LU1935042488.USD":"MANULIFE GF GLOBAL MULTI-ASSET DIVERSIFIED INCOME  \"AA\" (USD) INC","LU1674673428.USD":"HSBC GIF GLOBAL LOWER CARBON EQUITY \"AC\" (USD) ACC","LU0310799852.SGD":"FTIF - Templeton Global Equity Income A MDIS SGD","SG9999015341.SGD":"United Income Focus Trust Acc SGD-H","LU0417517546.SGD":"Allianz US Equity Cl AT Acc SGD","BABA":"阿里巴巴","LU0918141887.USD":"安联亚洲实际收益股票基金","NVDA":"英伟达","LU0957791311.USD":"THREADNEEDLE (LUX) GLOBAL FOCUS \"ZU\" (USD) ACC","LU2471134879.HKD":"INVESCO GLOBAL EQUITY INCOME ADVANTAGE \"A\" (HKD) INC","BK4526":"热门中概股","SG9999001424.SGD":"United E-Commerce Fund SGD","LU0345769128.USD":"NINETY ONE GSF GLOBAL EQUITY \"A\" (USD) ACC","LU0289961442.SGD":"SUSTAINABLE GLOBAL THEMATIC PORTFOLIO \"AX\" (SGD) ACC","LU0289960550.SGD":"AB FCP I - GLOBAL EQUITY BLEND PORTFOLIO 'A' (SGD) ACC","IE00B1BXHZ80.USD":"Legg Mason ClearBridge - US Appreciation A Acc USD","LU0267386448.USD":"FIDELITY FIRST ALL COUNTRY WORLD \"A\" (USD) INC","LU1064131342.USD":"Fullerton Lux Funds - Global Absolute Alpha A Acc USD","LU0354030511.USD":"ALLSPRING  U.S. LARGE CAP GROWTH \"I\" (USD) ACC","LU2750360997.AUD":"INVESCO GLOBAL EQUITY INCOME ADVANTAGE \"A\" (AUDHDG) INC","IE00BZ1G4Q59.USD":"LEGG MASON CLEARBRIDGE US EQUITY SUSTAINABILITY LEADER \"A\"(USD) INC (A)","LU0354030438.USD":"富国美国大盘成长基金Cl A Acc"},"source_url":"https://www.benzinga.com/markets/tech/25/10/48293177/alibaba-clouds-new-system-cuts-nvidia-gpu-usage-by-82-amid-trumps-flip-flop-on-ai-chip-ban-on-china","is_english":true,"share_image_url":"https://static.laohu8.com/e9f99090a1c2ed51c021029395664489","article_id":"2576027086","content_text":"Alibaba Group Holding has introduced a new computing pooling system called Aegaeon, which dramatically reduces the reliance on Nvidia GPUs by 82% for AI models.This innovation was tested in Alibaba Cloud's model marketplace for over three months, according to a research paper presented this week at the 31st Symposium on Operating Systems Principles (SOSP) in Seoul, South Korea.The Aegaeon system has successfully decreased the number of Nvidia H20 GPUs required from 1,192 to just 213 for serving models with up to 72 billion parameters.\"Aegaeon is the first work to reveal the excessive costs associated with serving concurrent LLM workloads on the market,\" the researchers stated in the paper.Researchers from Peking University and Alibaba Cloud emphasized the high costs associated with serving concurrent large language model workloads.Alibaba Cloud, the AI and cloud services division of the Hangzhou-based Alibaba, aims to boost efficiency by pooling GPU resources, allowing a single GPU to support multiple models.The system addresses resource inefficiency, as previously, 17.7% of GPUs were allocated to serve only 1.35% of requests in Alibaba Cloud's marketplace.Cloud service providers like Alibaba Cloud and ByteDance‘s Volcano Engine manage thousands of AI models simultaneously, often leading to inefficiencies. The Aegaeon system seeks to optimize this process by reducing the number of GPUs needed.This development comes amid growing concerns over Nvidia’s presence in China. Recently, China raised security concerns about Nvidia’s H20 chips, especially regarding potential backdoor risks. As part of its deal with Nvidia, the Trump administration has struck an agreement for a 15% revenue share from the company's chip sales to China.Nvidia CEO Jensen Huang stated that Nvidia’s market share in China has plummeted from 95% to zero. He expressed concerns over the impact of U.S. policies on Nvidia’s market presence in China.Despite these challenges, Nvidia has financially insulated itself from potential escalations, as its guidance assumes zero revenue from China, according to Huang.","news_type":1,"symbols_score_info":{"NVDA":1.1,"BABA":1.1}},"isVote":1,"tweetType":1,"viewCount":294,"authorTweetTopStatus":1,"verified":2,"comments":[],"imageCount":0,"langContent":"EN","totalScore":0}],"lives":[]}