<tt id="6hsgl"><pre id="6hsgl"><pre id="6hsgl"></pre></pre></tt>
          <nav id="6hsgl"><th id="6hsgl"></th></nav>
          国产免费网站看v片元遮挡,一亚洲一区二区中文字幕,波多野结衣一区二区免费视频,天天色综网,久久综合给合久久狠狠狠,男人的天堂av一二三区,午夜福利看片在线观看,亚洲中文字幕在线无码一区二区
          Global EditionASIA 中文雙語Fran?ais
          Business
          Home / Business / Technology

          Why is Chinese AI startup DeepSeek stirring up the tech world?

          Xinhua | Updated: 2025-01-31 20:21
          Share
          Share - WeChat

          BEIJING -- The artificial intelligence (AI) community is abuzz with excitement over DeepSeek-R1, a new open-source model developed by Chinese startup DeepSeek.

          Released on Jan 20, it quickly soared to the top of Apple's app store's free charts by Monday, surpassing OpenAI's ChatGPT.

          According to DeepSeek, in tasks such as mathematics, coding and natural language reasoning, the performance of this model is comparable to the leading models from heavyweights like OpenAI, but only at a fraction of the cash and computing power of its competitors.

          Here's what DeepSeek has done and why it is taking the AI industry by surprise.

          WHAT IS DEEPSEEK?

          Officially known as DeepSeek Artificial Intelligence Fundamental Technology Research Co, Ltd, the firm was founded in July 2023. As an innovative technology startup, DeepSeek is dedicated to developing cutting-edge large language models (LLMs) and related technologies.

          Since its first model "DeepSeek LLM" released in January last year, the company has undergone multiple rounds of iteration. In December, the startup launched its open-source LLM "V3," which overtook all of Meta's open-source LLMs and rivaled OpenAI's closed-source GPT4-o, according to US media reports.

          The just-released model R1 has achieved an important technological breakthrough -- using pure deep learning methods to allow AI to spontaneously emerge with reasoning capabilities.

          Unlike traditional approaches like Chain-of-Thought (CoT) and Supervised Fine-Tuning (SFT), DeepSeek has distinguished itself in the AI industry by adopting Reinforcement Learning (RL) as a core training method.

          While CoT and SFT rely on step-by-step reasoning and huge amounts of labeled data, respectively, RL enables models to learn through interaction and reward mechanisms, making it better suited for complex and dynamic tasks.

          The adoption of RL has allowed DeepSeek to enhance its models' reasoning, adaptability and efficiency, setting it apart as a frontrunner in the field.

          When queried about the meaning of "DeepSeek," its latest R1 chatbot replied, "The name reflects the company's mission to deeply explore and advance the foundational technologies of AI, aiming to push the boundaries of AI innovation and application."

          "BIGGER IS NO LONGER ALWAYS SMARTER"

          According to its V3 model technical report, DeepSeek's manufacturing cost is approximately $5.57 million, making it the least expensive among LLMs.

          Renowned US economist Jeffrey Sachs, a professor and director of the Center for Sustainable Development at Columbia University, told Xinhua that the breakthrough made by DeepSeek shows the possibility of advanced AI at much lower costs than was widely believed in the United States.

          DeepSeek-V3 makes it "look easy today with an open weights release of a frontier-grade LLM trained on a joke of a budget (2,048 GPUs for 2 months, $6M)," posted Andrej Karpathy, a founding member of OpenAI, on X.

          Compared to other well-known models, DeepSeek achieved an order-of-magnitude reduction of cost.

          The cost is "a stark contrast to the hundreds of millions, if not billions, that US companies typically invest in similar technologies," said Marc Andreessen, a prominent tech investor, depicting DeepSeek's R1 as "one of the most amazing breakthroughs" he had ever seen.

          The AI industry development has long relied on piling up computing power. The cost-efficient DeepSeek model may upend the AI landscape.

          Praising the DeepSeek-V3 Technical Report as "very nice and detailed," Karpathy said that the report is worthy of reading through.

          US investment bank and financial service provider Morgan Stanley believed that DeepSeek demonstrates an alternative path to efficient model training than the current arm's race among hyperscalers by significantly increasing the data quality and improving the model architecture.

          "Bigger is no longer always smarter," it said.

          OPEN-SOURCE MODEL

          "To see the DeepSeek new model, it's super impressive in terms of both how they have really effectively done an open-source model that does this inference-time compute, and is super-compute efficient," said Microsoft CEO Satya Nadella.

          Open source allows researchers, developers and users to access the model's underlying code and its "weights" -- the parameters that determine how the model processes information -- enabling them to use, modify or enhance the model to suit their needs.

          DeepSeek has greatly benefited from open-source principles and, in turn, demonstrates a strong commitment to sharing knowledge and contributing to the collective advancement of technology.

          Meta's chief AI scientist Yann LeCun said: "They came up with new ideas and built them on top of other people's work. Because their work is published and open source, everyone can profit from it."

          "That is the power of open research and open source," LeCun added.

          Echoing LeCun, Sachs, the US economist, said, "DeepSeek's business and development model is open source, which is a compelling and successful model for science, technology and business."

          While OpenAI initially started as an open-source organization but later shifted to a closed-source model, DeepSeek has taken a different path.

          Highlighting the importance of fostering collaboration and innovation through open-source principles, Liang Wenfeng, the founder of DeepSeek, said that building a robust technological ecosystem is the priority.

          "We won't choose closed-source," Liang said.

          Top
          BACK TO THE TOP
          English
          Copyright 1994 - . All rights reserved. The content (including but not limited to text, photo, multimedia information, etc) published in this site belongs to China Daily Information Co (CDIC). Without written authorization from CDIC, such content shall not be republished or used in any form. Note: Browsers with 1024*768 or higher resolution are suggested for this site.
          License for publishing multimedia online 0108263

          Registration Number: 130349
          FOLLOW US
          CLOSE
           
          主站蜘蛛池模板: 无码精品国产VA在线观看DVD| 韩国青草无码自慰直播专区| 91毛片网| 久久er99热精品一区二区| 男人av无码天堂| 国产成人福利在线视频播放下载| 免费男人j桶进女人p无遮挡动态图 | 久久丁香五月天综合网| 麻豆精品一区二区视频在线| 精品无码国产污污污免费| 日韩国产亚洲一区二区三区| 亚洲中文字幕无码专区| 亚洲免费观看一区二区三区| 在线观看91精品国产不卡| 成人无码一区二区三区网站| 伊人天天久大香线蕉av色| 福利视频在线一区二区| 少妇久久久被弄到高潮| 91免费精品国偷自产在线在线| 午夜毛片精彩毛片| 老太大性另类xxxⅹ| 黑人巨大亚洲一区二区久| 亚洲一本二区偷拍精品| 熟妇女人妻丰满少妇中文字幕| 国产成人精品永久免费视频| 69成人免费视频无码专区| 在线免费观看毛片av| 九九热在线精品视频免费| 亚洲国产成人无码影院| 亚洲男人的天堂久久香蕉| 国产亚洲精品国产福利在线观看| 欧美性猛交xxxx乱大交极品| 国产高在线精品亚洲三区| 成 人色 网 站 欧美大片 | 婷婷狠狠综合五月天| 日韩福利片午夜免费观着| 亚洲精品无码日韩国产不卡av| 免费又黄又爽又猛的毛片| 天天躁日日躁狠狠躁| 久久精品人人做人人爽电影蜜月| 国产精品国产精品偷麻豆|