LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs

LLM-jp, Akiko Aizawa, Eiji Aramaki, Bowen Chen, Fei Cheng, Hiroyuki Deguchi, Rintaro Enomoto, Kazuki Fujii, Kensuke Fukumoto, Takuya Fukushima, Namgi Han, Yuto Harada, Chikara Hashimoto, Tatsuya Hiraoka, Shohei Hisada, Sosuke Hosokawa, Lu Jie, Keisuke Kamata, Teruhito Kanazawa, Hiroki Kanezashi, Hiroshi Kataoka, Satoru Katsumata

2024-07-08

LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs

Summary

This paper talks about LLM-jp, a collaborative project aimed at creating strong and open-source large language models (LLMs) specifically for the Japanese language, involving over 1,500 participants from various fields.

What's the problem?

The main problem is that many existing language models do not adequately support the Japanese language, with only a small percentage of Japanese data included in popular models like GPT-3. Additionally, the development of these models typically requires a lot of resources and expertise, which are often controlled by a few large organizations. This can limit access and innovation in creating effective Japanese language models.

What's the solution?

To solve these issues, the authors established LLM-jp, which brings together researchers and industry professionals to collaboratively develop Japanese LLMs. The project focuses on transparency and openness, sharing all aspects of their work, including models, data, and methodologies. They have formed several working groups to tackle different challenges in building these models and have already released multiple model versions for public use.

Why it matters?

This research is important because it aims to enhance the capabilities of AI in understanding and generating Japanese text. By developing open-source models, LLM-jp can foster innovation and accessibility in AI technology for the Japanese-speaking community. This could lead to better applications in education, business, and everyday communication for people who use the Japanese language.

Abstract

This paper introduces LLM-jp, a cross-organizational project for the research and development of Japanese large language models (LLMs). LLM-jp aims to develop open-source and strong Japanese LLMs, and as of this writing, more than 1,500 participants from academia and industry are working together for this purpose. This paper presents the background of the establishment of LLM-jp, summaries of its activities, and technical reports on the LLMs developed by LLM-jp. For the latest activities, visit https://llm-jp.nii.ac.jp/en/.

View Paper