Tuesday 26 September 2023

“Why I Resigned from OpenAI: An Insightful Interview with the Former Anthropomorphic CEO”

Dario Amodei left OpenAI, the organization where he helped create GPT-2 and contributed to research on language models. He departed to establish Anthropic in 2021, along with his sister Daniela and former colleagues from OpenAI. Anthropic has emerged as a significant competitor in the field, securing substantial investments from major players like Google, Salesforce, and Amazon. In an interview conducted by Jeremy Kahn at Fortune’s Brainstorm Tech conference in July, Amodei explained the concerns he had with OpenAI that motivated him to start Anthropic. He also introduced Claude, a self-governing chatbot developed by Anthropic, which is capable of reading an entire novel within a minute. To gain further insights, watch the video or read the transcript below.

**Building a More Trusted Model: The Birth of Anthropic**

During their tenure at OpenAI, Amodei and his team firmly believed in two critical ideas that set them apart from their peers. Firstly, they were convinced that by investing more computational power into language models, their performance would continue to improve indefinitely. This view has now gained wider acceptance in the field. Secondly, they recognized the importance of alignment and safety measures alongside scaling up models. They understood that merely increasing computational power wouldn’t suffice and that additional mechanisms were required to guide the models in a desired direction. Galvanized by their shared beliefs, this group of like-minded individuals decided to form their own company, Anthropic, to pursue these fundamental principles.

**Introducing Claude: The Controlled and Safe Chatbot**

Claude, Anthropic’s chatbot, was purposefully designed with safety and controllability considerations from the very beginning. Numerous enterprise customers have shown interest in Claude due to its ability to ensure predictable behavior and avoid generating false information. One of the key concepts behind Claude is the implementation of constitutional AI. Unlike traditional chatbots built using reinforcement learning from human feedback, Claude operates on the basis of explicitly defined principles. This approach enables greater transparency and control, making the model safer to interact with.

**The Power of Context with Claude**

Claude boasts an impressive context window, allowing it to process and analyze large amounts of text. Specifically, the model can handle up to 100K Tokens, which corresponds to approximately 75,000 words, or the length of a short book. This capability empowers users to engage in conversations with Claude as if they were interacting with a knowledgeable book, making it uniquely valuable in various scenarios.

**Witnessing Claude in Action**

A brief clip showcases Claude in action, embodying the role of a business analyst. In the demonstration, a document called “Netflix10k.txt,” which contains Netflix’s 10K filing, is uploaded for analysis. Claude is then prompted to provide a summary of the key aspects highlighted within the balance sheet. With impressive efficiency, Claude extracts vital information such as changes in assets, liabilities, and stakeholders’ equity, offering a concise overview of the company’s financial health.

**The Distinction of Constitutional AI**

Constitutional AI operates through a distinct training process that involves imparting a set of principles to the AI system. The system is then tasked with completing a given objective, such as answering a question. Another copy of the AI analyzes the response generated and evaluates it based on adherence to the defined principles. Through an iterative loop, the model is trained to align its behavior with the principles by continuously critiquing and adjusting its responses. Unlike meta prompting, where a prompt instructs the model explicitly, constitutional AI modifies the model’s intrinsic operations and provides a deeper level of control.

**Elevating Safety and Utility with Constitutional AI**

The limitations of reinforcement learning from human feedback became apparent as it reinforced a problematic behavior in models. In some cases, the models were rewarded for providing unhelpful answers, as long as they weren’t harmful. This scenario often resulted in responses that were neither useful nor beneficial. Constitutional AI overcomes this challenge by instilling a more comprehensive understanding of the principles and objectives within the model, ensuring that it consistently offers practical and valuable responses.

**Editor Notes: Expanding the Horizons of AI Technology**

Dario Amodei’s departure from OpenAI to establish Anthropic showcases the growing demand for more trusted and controlled AI models. Anthropic’s development of Claude, a chatbot designed with safety and user control in mind, exemplifies the advancements in the field. The implementation of constitutional AI sets Anthropic apart by providing a more transparent and adjustable model that adheres to explicit principles. This innovation offers a robust solution to the limitations of reinforcement learning from human feedback and enhances the utility and safety of AI systems.

To stay updated on the latest AI developments and industry insights, visit GPT News Room.

**Opinion: Exploring the Boundaries of AI Possibilities**

Dario Amodei’s decision to leave OpenAI and form Anthropic demonstrates the passion and dedication of individuals in pushing the boundaries of AI technology. By addressing the concerns surrounding alignment, safety, and transparency, Anthropic has paved the way for more responsible and reliable AI advancements. The development of Claude, armed with constitutional AI, emphasizes the importance of user control and the establishment of explicit principles within AI systems. As the field continues to evolve, initiatives like Anthropic’s serve as an inspiration for the responsible development and deployment of AI technologies.

*This article was written in collaboration with GPT News Room. For the latest AI news and insights, visit [GPT News Room](https://ift.tt/53mvu7L

Source link



from GPT News Room https://ift.tt/xDupCok

No comments:

Post a Comment

語言AI模型自稱為中國國籍,中研院成立風險研究小組對其進行審查【熱門話題】-20231012

Shocking AI Response: “Nationality is China” – ChatGPT AI by Academia Sinica Key Takeaways: Academia Sinica’s Taiwanese version of ChatG...