Image by Author
The LLaMA project encompasses a set of foundational language models that vary in size from 7 billion to 65 billion parameters. These models were training on millions of tokens, and it was training on publicly available datasets exclusively. As a result, LLaMA-13B outperforms GPT-3 (175B), and LLaMA-65B is performing similarly to the best models like Chinchilla-70B and PaLM-540B.
Image from LLaMA
Resources:
Stanford Alpaca claims that it can compete with ChatGPT and anyone can reproduce it in less than 600$. The Alpaca 7B is finetuned from the LLaMA 7B model on 52K instruction-following demonstrations.
Training recipe | Image from Stanford CRFM
Resources:
Vicuna is finetuned from the LLaMA model on user-shared conversations collected from ShareGPT. The model Vicuna-13B has achieved more than 90%* quality of OpenAI ChatGPT and Google Bard. It has also outperformed LLaMA and Stanford Alpaca models in 90% of cases. The cost of training Vicuna was around 300$.
Image from Vicuna
Resources:
OpenChatKit: Open-Source ChatGPT Alternative is a complete tools kit for creating your chatbot. It provides instruction for training your own Instruction-tuned large language model, fine-tuning the model, extensible retrieval system for updating the bot response, and bot moderation for filtering out questions.
Image from TOGETHER
As we can see, the GPT-NeoXT-Chat-Base-20B model has outperformed base mode GPT-NoeX on question and answer, extraction, and classification tasks.
Resources:
GPT4ALL is a community-driven project and was trained on a massive curated corpus of assistant interactions, including code, stories, depictions, and multi-turn dialogue. The team has provided datasets, model weights, data curation process, and training code to promote open-source. Furthermore, they have released quantized 4-bit versions of the model that can run on your laptop. You can even use a Python client to run the model inference.
Gif from GPT4ALL
Resources:
Raven RWKV 7B is an open-source chatbot that is powered by the RWKV language model that produces similar results to ChatGPT. The model uses RNNs that can match transformers in quality and scaling while being faster and saving VRAM. The Raven was fine-tuned on Stanford Alpaca, code-alpaca, and more datasets.
Image from Raven RWKV 7B
Resources:
OPT: Open Pre-trained Transformer Language Models is not great as ChatGPT, but it has shown remarkable capabilities for zero- and few-shot learning and Stereotypical Bias analysis. You can also integrate it with Alpa, Colossal-AI, CTranslate2, and FasterTransformer to get even better results.
Note: It is on the list because of its popularity, as it has 624,710 monthly downloads in the text generation category.
Image from (arxiv.org)
Resources:
Flan-T5-XXL fine-tuned T5 models on a collection of datasets phrased as instructions. The instruction fine-tuning dramatically improves performance on a variety of model classes such as PaLM, T5, and U-PaLM. The Flan-T5-XXL model is fine-tuned on more than 1000 additional tasks covering also more languages.
Image from Flan-T5-XXL
Resources:
There are many open-source options available, and I have mentioned popular ones. The open-source chatbots and models are getting better, and in the next few months, you will see a new model that can completely overtake ChatGPT in terms of performance.
In this blog, I have provided a list of models/chatbot frameworks that can help you train and build chatbots similar to ChatGPT and GPT-4. Don’t forget to give them likes and stars.
Do let me know if you have better suggestions in the comment section. I would love to add it in the future.
Abid Ali Awan (@1abidaliawan) is a certified data scientist professional who loves building machine learning models. Currently, he is focusing on content creation and writing technical blogs on machine learning and data science technologies. Abid holds a Master’s degree in Technology Management and a bachelor’s degree in Telecommunication Engineering. His vision is to build an AI product using a graph neural network for students struggling with mental illness.
- SEO Powered Content & PR Distribution. Get Amplified Today.
- Platoblockchain. Web3 Metaverse Intelligence. Knowledge Amplified. Access Here.
- Source: https://www.kdnuggets.com/2023/04/8-opensource-alternative-chatgpt-bard.html?utm_source=rss&utm_medium=rss&utm_campaign=8-open-source-alternative-to-chatgpt-and-bard
- :is
- 7
- 8
- a
- achieved
- Additional
- AI
- alternative
- analysis
- and
- answer
- anyone
- ARE
- around
- AS
- Assistant
- available
- base
- because
- being
- BEST
- Better
- bias
- Billion
- Blog
- blogs
- Bot
- build
- Building
- by
- CAN
- capabilities
- cases
- Category
- Certified
- chatbot
- chatbots
- ChatGPT
- claims
- classes
- classification
- client
- code
- collection
- comment
- Community Driven
- compete
- complete
- completely
- content
- content-creation
- conversations
- Cost
- covering
- Creating
- creation
- curated
- curation
- Currently
- data
- data science
- data scientist
- datasets
- Degree
- dialogue
- Dont
- downloads
- dramatically
- encompasses
- Engineering
- Even
- exclusively
- extraction
- faster
- few
- filtering
- focusing
- For
- frameworks
- from
- Furthermore
- future
- generation
- get
- getting
- gif
- Give
- graph
- Graph Neural Network
- great
- Have
- help
- holds
- HTML
- HTTPS
- i
- illness
- image
- improves
- in
- Including
- instructions
- integrate
- interactions
- IT
- ITS
- jpg
- KDnuggets
- kit
- Know
- language
- Languages
- laptop
- large
- learning
- like
- List
- Llama
- love
- machine
- machine learning
- management
- many
- massive
- master
- Match
- mental
- Mental illness
- mentioned
- millions
- Mode
- model
- models
- moderation
- monthly
- months
- more
- network
- Neural
- neural network
- New
- next
- of
- on
- open
- open source
- OpenAI
- Options
- Outperforms
- own
- palm
- parameters
- performance
- performing
- plato
- Plato Data Intelligence
- PlatoData
- Popular
- popularity
- powered
- process
- Product
- professional
- project
- promote
- provided
- provides
- publicly
- Python
- quality
- question
- Questions
- recipe
- released
- remarkable
- response
- result
- Results
- Run
- s
- saving
- scaling
- Science
- Scientist
- Section
- set
- shown
- similar
- Similarly
- Size
- stanford
- Stars
- Stories
- Struggling
- Students
- such
- system
- tasks
- team
- Technical
- Technologies
- Technology
- telecommunication
- terms
- text generation
- that
- The
- The Future
- Them
- These
- to
- together
- Tokens
- tools
- Train
- trained
- Training
- transformers
- updating
- use
- variety
- VICUNA
- vision
- while
- WHO
- will
- with
- would
- writing
- Your
- zephyrnet