badge icon

This article was automatically translated from the original Turkish version.

Article

DeepSeek R1 is a large language model (LLM) and artificial intelligence based chat assistant released on 20 January 2025. It was developed by the China-based DeepSeek. The model has been released with open source code.


DeepSeek logo - Arbisoft


DeepSeek R1 was developed by DeepSeek, a company supported by the High-Flyer Capital Management fund. It is reported that 2,000 Nvidia chips were used during the model’s development and that its production cost was approximately $5.6 million. This situation has drawn attention in the industry as an artificial intelligence model trained with fewer chips and at a lower cost.

Key Features of the Model

DeepSeek R1 is described as a model with “reasoning” capabilities. It is designed to deliver strong performance in mathematics, coding, and logic problems while possessing the general capabilities of large language models.

Open source: The fact that the core model of DeepSeek R1 is open source enables researchers and developers to conduct work on the model opportunity.

Low cost and efficient chip usage: It is reported that fewer chips were used in the development of this model compared to its competitors and that it was trained at a lower cost.

Natural language processing (NLP) capabilities: The model can be used for various natural language processing tasks such as language understanding, text generation, translation, and summarization like.

Development Process

DeepSeek R1 was developed as DeepSeek’s first large language model (LLM). The model was released on 20 January 2025. It was published with open source code, making it accessible to researchers and developers worldwide.


A total of 2,000 Nvidia chips were used in the development of DeepSeek R1. The total cost of training the model is reported to be approximately $5.6 million. This cost is recorded as significantly lower when compared to similar large language models in the industry.

While it is stated that OpenAI’s GPT-4 model required 16,000 chips and over $100 million in funding, DeepSeek R1’s development at a much lower cost has sparked new discussions in the field of artificial intelligence regarding efficiency.


DeepSeek R1 responds using its reasoning capability. - DeepSeek


Technical Specifications

Model Architecture and Technologies Used

DeepSeek R1 is a natural language processing (NLP) model belonging to the large language models (LLM) class. Although publicly available information about the model’s architecture and technical details is limited, it is stated to have a transformer-based structure.


It is claimed that DeepSeek R1 possesses reasoning capabilities comparable to OpenAI’s GPT-4, Google’s Gemini, and Meta’s LLaMA models. DeepSeek R1 is designed to use a step step-by-step reasoning approach to generate more consistent and logical outputs when solving complex problems.


Additionally, it is noted that the model includes optimizations for memory management and computational efficiency. Its ability to achieve high accuracy rates despite lower chip usage is made possible through optimized parameter management and a reduced model size.

Training Process and Datasets Used

Official and comprehensive information regarding the datasets used to train DeepSeek R1 has not been disclosed. It is stated that the model was trained on extensive multilingual datasets and programming languages. However, user reports have identified that due to its development in Türkiye, DeepSeek R1 operates with specific censorship mechanisms to comply with the Chinese government’s content policies.

Security and Cyber Attacks

Large-Scale Cyber Attacks Faced by DeepSeek

Following the release of DeepSeek R1, large-scale cyber attacks targeting its web-based services were reported on 27 January 2025. The company stated that these attacks were organized by malicious actors and posed serious security threats capable of disrupting service continuity.


After the attacks, a notification was published on the company’s official website stating: “Due to large-scale malicious attacks on DeepSeek services, we are temporarily restricting access to ensure service continuity.” No public clarification has been provided regarding the technical details of these attacks or the identities of the perpetrators.


DeepSeek’s web platform experienced access issues and outages due to the cyber attacks. According to user reports from January 2025, service slowdowns occurred in some regions while service was completely interrupted in others.


No restrictions were imposed on the mobile applications, and user numbers continued to rise in app stores. DeepSeek’s app has maintained its popularity in the Chinese and USA markets.

Market Impact

The release of DeepSeek R1 caused fluctuations in global technology stocks. US-based technology companies began questioning their competitive advantages due to the model’s low development cost and reduced chip usage. Following these developments, the Nasdaq index lost more than 3%, and Nvidia’s shares dropped 17%, resulting in a loss exceeding $50 billion in market value. Declines were also observed in the shares of semiconductor companies such as AMD, Qualcomm, and Micron Technology.

The model received significant user interest and became one of the most downloaded AI applications in the US and Chinese app stores. Apple It surpassed ChatGPT as the most downloaded app on the App Store and reached a broad user base on Google Play Store.

Global and Political Implications

Statements by US President Donald Trump and Other Leaders

The release of DeepSeek R1 and the resulting market fluctuations in US technology stocks have brought new discussions to the forefront in Washington regarding AI competition. US President Donald Trump described the release of DeepSeek R1 as “a wake-up call for the American AI sector.” Trump emphasized the need for large-scale government-supported investments to enable US companies to maintain technological leadership.


Microsoft CEO water Satya Nadella characterized DeepSeek’s efficiency as “super impressive” and stressed that the US must take China’s achievements in AI seriously. OpenAI CEO Sam Altman found DeepSeek R1’s low-cost development impressive but noted that the model’s long long-term sustainability must be tested.

Bibliographies

Anadolu Ajansı. "Nvidia Calls DeepSeek’s R1 Model an ‘Excellent AI Advancement.’" Anadolu Ajansı, January 27, 2025. https://www.aa.com.tr/en/economy/nvidia-calls-deepseek-s-r1-model-an-excellent-ai-advancement/3464061.

Anadolu Ajansı. "Teknoloji Hisseleri DeepSeek ile Sarsıldı." Anadolu Ajansı, January 27, 2025. https://www.aa.com.tr/tr/bilim-teknoloji/teknoloji-hisseleri-deepseek-ile-sarsildi/3463775.

Anadolu Ajansı. “Ucuza Mal Edilen Çinli Yapay Zeka Modeli DeepSeek Dünyanın Gündemine Oturdu.” Anadolu Ajansı, January 27, 2025. https://www.aa.com.tr/tr/bilim-teknoloji/ucuza-mal-edilen-cinli-yapay-zeka-modeli-deepseek-dunyanin-gundemine-oturdu/3464570.

BBC News. “DeepSeek: The Chinese AI App That Has the World Talking.” BBC News, January 28, 2025. https://www.bbc.com/news/articles/c5yv5976z9po.

CNN. “China Celebrates DeepSeek’s Breakout AI Success as Tech Race Heats Up.” CNN, January 28, 2025. https://edition.cnn.com/2025/01/28/china/china-deepseek-ai-success-tech-intl-hnk/index.html.

DeepSeek. "Official Website." Accessed January 29, 2025. https://www.deepseek.com/.

Reuters. "What Is DeepSeek and Why Is It Disrupting the AI Sector?" Reuters, January 27, 2025. https://www.reuters.com/technology/artificial-intelligence/what-is-deepseek-why-is-it-disrupting-ai-sector-2025-01-27/.

The Guardian. “Chinese AI Chatbot DeepSeek Censors Itself in Real Time, Users Report.” The Guardian, January 28, 2025. https://www.theguardian.com/technology/2025/jan/28/chinese-ai-chatbot-deepseek-censors-itself-in-realtime-users-report.

Author Information

Avatar
AuthorEdanur KarakoçDecember 25, 2025 at 8:19 AM

Tags

Discussions

No Discussion Added Yet

Start discussion for "DeepSeek R1" article

View Discussions

Contents

  • Key Features of the Model

  • Development Process

    • Model Architecture and Technologies Used

    • Training Process and Datasets Used

  • Security and Cyber Attacks

    • Large-Scale Cyber Attacks Faced by DeepSeek

  • Market Impact

  • Global and Political Implications

    • Statements by US President Donald Trump and Other Leaders

Ask to Küre