American company wants to recreate DeepSeek-R1 and make it fully open source

Screenshot

Дмитро Джугалик Автор новин на Mezha.Media. Пишу про те, чим сам активно захоплююся, а саме технології, ігри та кіно.

29 January, 04:37 PM

Leandro von Werra, Head of Research at Hugging Face, along with several other engineers at the company, launched the Open-R1 project, which aims to create a duplicate of the R1. The goal of this project is, among other things, to disclose the data used for training.

DeepSeek's R1 model is technically already "open source" and anyone can use it without any restrictions. However, this model does not fall under the generally accepted definition of open source software.

Researchers from Hugging Face told TechCrunch that hidden elements make it difficult to replicate and further research the model. In particular, the Chinese startup does not disclose the training dataset, details of experiments, and intermediate models.

The American company plans to create a replica of R1 within a few weeks. For this purpose, a special research server Science Cluster equipped with 768 NVIDIA H100 GPUs will be used.

Another goal of Huggin Face with Open-R1 is that, if the project is successful, users and developers will be able to create the next generations of reasoning LLMs, including open source.

Technologies Software Artificial Intelligence DeepSeek DeepSeek-R1 Hugging Face Open-R1

American company wants to recreate DeepSeek-R1 and make it fully open source

Top Discussion

Latest News

Новини партнерів