GPT Newsroom: AI Researchers Declare Ability to Double the Efficiency of Chatbots

The Future of AI Chatbots: Supercharging LLM Capabilities

Have you ever noticed that your AI chatbot get lost in the middle of a conversation, or it simply says it cannot handle prompts that are too long? Well, that is because each model has a limitation in its processing capabilities, and starts to suffer once it goes over that limit —pretty much like they suffered from some kind of a digital attention deficit disorder. But this could soon change thanks to a new method for supercharging LLM capabilities.

Expanding Context Capacities for LLMs

Current LLMs have limited context capacities. For example, ChatGPT taps just 8,000 tokens of context, while Claude handles 100,000. Tokens are the basic units of text or code used by an LLM AI to process and generate language. This restricts how much background information they can harness when formulating replies. Abacus AI has developed a method that allegedly doubles the usable context length for open-source LLMs like Meta’s Llama without compromising the model’s accuracy in practical application.

The Scaling Technique by Abacus AI

Their technique involves “scaling” the position embeddings that track word locations in input texts. According to their Github page, Abacus AI claims that its scaling method drastically increases the number of tokens that a model can handle.

Evaluating Scaled LlaMA Variants

The researchers evaluated two scaled LlaMA variants on tasks like substring location and open-book QA. The scale 16 model maintained accuracy on real-world examples up to 16,000-word contexts, versus only 2,000 words in baseline Llama. It even showed some coherence at 20,000+ words, something that was not possible to achieve with just fine-tuning techniques.

The Significance of Context Extension

The significance of context extension cannot be overstated. A narrow context window makes the model accurate but not really usable in complex tasks that require some background. Conversely, with an expanded context, LLMs can process and generate better responses but either take more time to do so or return sup-par results. Handling longer contexts efficiently could enable LLMs to absorb whole documents or multiple documents as background when generating text. This may lead to outputs that are more knowledge-grounded and consistent across long conversations.

Fine-Tuning Strategies and Future Work

However, the gains are not perfectly proportional to the scale factors. It’s still necessary to fine-tune strategies because scaling alone doesn’t guarantee high-quality outputs. The Abacus team is also exploring advanced position encoding schemes from recent papers to further extend context capacity.

Scaling Up Existing LLMs

Their work suggests that scaling up existing LLMs is a viable path to expanding usable context length. This could democratize access to Large Language Models capable of handling lots of context at once.

Open Source Access

Abacus AI has opened the doors of their repository “for research purposes only,” sharing code specific to their fine-tuning projects. This makes it possible to further iterate on its development and apply the fine-tuning methods on virtually any open source Large Language Model.

Next-Generation AI Assistants

With applications from personalized chatbots to creative writing aids, more memory-empowered LLMs could soon enable next-generation AI assistants that are conversant across diverse topics. For now, researchers are progressing rapidly to overcome technical constraints in pursuit of artificial general intelligence —meaning, generalized human cognitive abilities in an AI model. Maybe someday our digital friends will handle as many tabs as we humans can, but without the headache!

Editor Notes

It’s fascinating to see how advancements in AI are expanding the capabilities of chatbots and language models. Abacus AI’s method of scaling LLMs to handle longer contexts opens up new possibilities for more knowledgeable and coherent conversations. As we continue to push the boundaries of AI, we may soon witness the rise of AI assistants that can understand and engage in complex discussions across various domains. This is yet another step towards achieving artificial general intelligence. Keep up with the latest developments in the world of AI by visiting the GPT News Room.

Source link

from GPT News Room https://ift.tt/BGDRze5

GPT Newsroom

Friday, 4 August 2023

AI Researchers Declare Ability to Double the Efficiency of Chatbots

The Future of AI Chatbots: Supercharging LLM Capabilities

Expanding Context Capacities for LLMs

The Scaling Technique by Abacus AI

Evaluating Scaled LlaMA Variants

The Significance of Context Extension

Fine-Tuning Strategies and Future Work

Scaling Up Existing LLMs

Open Source Access

Next-Generation AI Assistants

Editor Notes

No comments:

Post a Comment

語言AI模型自稱為中國國籍，中研院成立風險研究小組對其進行審查【熱門話題】-20231012

Report Abuse

Labels