Although Google and OpenAI might seem like the frontrunners in natural language processing (NLP) and artificial intelligence (AI), the rapid growth of open-source projects is becoming an increasingly significant threat to their dominance. Open-source models have accelerated at a remarkable pace, achieving impressive results with smaller budgets and shorter timeframes than their more established counterparts.
Some notable achievements and developments in the open-source AI community include:
- The leak of LLaMA, an open-source foundation model, which led to rapid community-driven innovation in fine-tuning and applications.
- Stanford’s Alpaca project, which introduced low-rank adaptation (LoRA) for faster and more affordable fine-tuning of models on a single GPU.
- Successful deployment of open-source models on low-power devices like Raspberry Pi and MacBook CPUs, thanks to minification efforts and techniques like 4-bit quantization.
- The development of Vicuna, a 13B open-source model with performance comparable to ChatGPT at a fraction of the training cost.
- The use of small, highly curated datasets in open-source projects, leading to more efficient and effective AI models.
- Training the GPT-3 architecture from scratch by Cerebras, using Chinchilla’s optimal compute schedule and μ-parameterization, making the community independent of LLaMA.
- Development of multimodal models like LLaMA-Adapter, which can be fine-tuned for instruction tuning and multimodality within just one hour of training.
These achievements highlight the power of collective innovation and the rapid pace of progress in the open-source community. The affordability, accessibility, and customizability of open-source models make them increasingly attractive to users and developers, potentially undermining the value proposition of proprietary AI models developed by Google and OpenAI.
To adapt to this fast-growing open-source environment and stay competitive, Google and OpenAI should consider the following recommendations:
- Collaborate with and learn from open-source projects by enabling third-party integrations.
- Reevaluate their value proposition, as users may be less inclined to pay for restricted models when free, unrestricted alternatives are available.
- Focus on rapid iteration with smaller models, as they can be more quickly improved and adapted to user needs.
Instead of competing with open-source projects, both organizations should embrace them, establishing themselves as leaders in the open-source community. By cooperating with and learning from the broader conversation, Google and OpenAI can continue driving AI advancements while benefiting from the collective knowledge and creativity of the open-source community. This may involve taking some uncomfortable steps, like publishing model weights for smaller variants, but embracing community-driven innovation can ultimately prove beneficial in the long run.