Web 3.0 Ethiopia - DeFi & AI
696 subscribers
919 photos
21 videos
5 files
181 links
Bridging the Information Gap on DeFI and Artificial Intelligence for Ethiopians
Download Telegram
OpenAI's Vision for AI-Driven Automation in 2025

OpenAI’s Chief Product Officer, Kevin Weil, has outlined an ambitious vision for 2025, predicting that AI will automate coding by 99% by year’s end. This projection is supported by advancements in AI models, including enhanced pre-training and improved reasoning capabilities, as well as the development of tools like Deep Research, which offers insightful, specific information rather than generic responses. Beyond coding, OpenAI aims to democratize software access for everyone through its products and API, while also exploring the integration of AI into robotics.

This move reflects a broader goal to extend AI’s impact from digital tasks to physical applications, aligning with the company’s mission to create safe, beneficial, and universally accessible AI systems. Weil also expressed confidence in GPT-5, anticipating it will unify OpenAI’s O-series and GPT-series models, enhancing seamless AI interactions across various tasks.

@webthreeth
Mistral AI launches its new Mistral Small 3.1 model, which is able to be hosted and run locally

Mistral Small 3.1 is a cutting-edge language model developed by Mistral AI, designed to deliver exceptional performance in a compact size. With 24 billion parameters, it achieves an impressive 81% accuracy on standard benchmarks and processes 150 tokens per second. The model's efficiency is further enhanced by its ability to run locally on a single RTX 4090 GPU or a MacBook with 32GB RAM, which is particularly beneficial for hobbyists and organizations handling sensitive information.

Mistral Small 3.1 supports a wide range of languages and excels in both coding and generalist tasks, offering a cost-effective solution. Its advanced reasoning capabilities and strong adherence to system make it a versatile tool for various applications, from complex problem-solving to everyday tasks. The model's development without reinforcement learning or synthetic training data sets it apart from competitors.

@webthreeth
LG AI Research Unveils EXAONEDeep Today, Revolutionizing Agentic AI for Industry Applications

On March 18, 2025, Korean LG's AI wing, LG AI Research introduced EXAONEDeep, a groundbreaking next-generation AI model advancing "Agentic AI" with enhanced reasoning capabilities tailored for math, science, and coding tasks across professional and everyday applications.

Released as an open-source model at Nvidia’s GTC 2025, EXAONEDeep’s smaller variants—7.8B and 2.4B parameters—dominated major benchmarks, while the 32B version achieved the top spot on the AIME benchmark, outperforming a competitor at just 5% of its model size. This leap in efficiency and performance aligns with broader AI trends, such as Google’s recent focus on test-time compute scaling to improve model reasoning through self-verification, positioning EXAONEDeep as a leader in transforming real-world industry solutions.

@webthreeth
Web 3.0 Ethiopia - DeFi & AI
LG AI Research Unveils EXAONEDeep Today, Revolutionizing Agentic AI for Industry Applications On March 18, 2025, Korean LG's AI wing, LG AI Research introduced EXAONEDeep, a groundbreaking next-generation AI model advancing "Agentic AI" with enhanced reasoning…
AI intelligence is becoming more efficient and accessible over time. In AI, 'parameters' refer to the adjustable variables within a model that determine its capacity to learn, not the amount of training data itself.

LG today developed a model, EXAONEDeep, which is said to rival OpenAI’s o1 in performance while using only 4% of the parameters required for o1. This suggests that powerful AI can be achieved with fewer resources. LG’s model is also open-source, making it freely available.

For comparison, DeepSeek’s R1 model, trained on 671 billion parameters, is a leading open-source example, while LG’s EXAONE, potentially with 32 billion parameters, outperforms it. Such advancements could drive significant growth in AI, potentially paving the way for artificial general intelligence and super intelligence in the future.

Should I simplify posts like this to make the explanation better?
Gemini Introduces Audio Overview for Engaging Learning

Gemini is enhancing its platform with Audio Overview, a feature previously popular in NotebookLM, now available to Gemini and Gemini Advanced subscribers globally in English starting March 18, 2025.

This tool transforms uploaded documents, slides, and Deep Research reports into podcast-style audio discussions featuring two AI hosts who summarize content, connect topics, and provide dynamic insights. Users can upload various files like class notes or research papers, click a suggestion chip, and listen to an engaging breakdown on the go via the web or mobile app, with options to share or download.

This rollout aims to make learning fun and productive, with more languages planned soon, solidifying Gemini as a versatile collaborator at gemini.google.com.

@webthreeth
Google Unveils Canvas: A New Interactive Workspace for Gemini AI

On March 18, 2025, Google unveiled Canvas for its Gemini AI platform, introducing an interactive workspace designed to enhance productivity and creativity. This new feature allows users to draft, refine, and edit documents or code in real time with AI assistance, offering tools to adjust tone, length, and formatting, as well as generate and preview HTML/React code for web app prototypes.

Aimed at rivaling similar offerings like OpenAI’s Canvas and Anthropic’s Artifacts, Gemini’s Canvas integrates seamlessly with Google Docs for collaboration and supports a variety of use cases, from writing reports to coding interactive applications.

Launched globally for Gemini and Gemini Advanced subscribers, Canvas positions Gemini as a versatile tool for creators, developers, and students alike.

@webthreeth
The Length of AI Task Endurance Doubles Every 7 Months

The length of tasks that AI systems can successfully perform at a 50% success rate is doubling approximately every 7 months, as depicted in the METR scatter plot. Spanning from 2020 to 2024, the chart tracks various models, starting with GPT-2 and GPT-3, which could handle tasks lasting mere seconds, to more advanced models like GPT-4o and Sonnet 3.7, capable of managing tasks up to an hour by 2024.

This exponential growth highlights the accelerating capabilities of AI, with models like Sonnet 3.5, 3.6, and 3.7 showing significant leaps in performance within short timeframes, reflecting rapid advancements in AI efficiency and reliability over the years.

@webthreeth
Amharic Llama 3.2: Advancing AI for Ethiopian NLP

The Amharic Llama 3.2 model represents a significant step in natural language processing (NLP) for Amharic, one of Ethiopia’s most widely spoken languages.

Built on Meta’s Llama 3.2 transformer architecture, the model was trained from scratch using 300 million Amharic tokens and fine-tuned with high-quality datasets, including poems, stories, and Wikipedia articles. With 400 million parameters and a context length of 1024 tokens, it can generate fluent Amharic text, summarize content, and answer complex queries.

Its instruction-tuned version further enhances its capability to generate creative texts such as poems, jokes, and historical narratives, making it a valuable tool for research, education, and digital content creation.

Link to Try - https://huggingface.co/spaces/rasyosef/Llama-3.2-Amharic-Chat

Source - Yosef Worku Alemneh

@webthreeth
Perplexity AI Unveils Deep Research Update for Next Week

Perplexity AI is set to launch an enhanced version of its Deep Research feature next week, promising significant advancements in its analytical capabilities. This update will equip the tool with increased computing power, enabling it to think longer and deliver more detailed and comprehensive answers. The improved Deep Research will also incorporate code execution and the ability to render in-line charts, providing users with richer, data-driven insights.

Building on the foundation laid by its initial Deep Research launch in February 2025, this update aims to further streamline and accelerate in-depth research and analysis. The feature, designed to save users hours of work, will continue to autonomously conduct extensive searches, evaluate numerous sources, and synthesize the information into clear, actionable reports.

@webthreeth
Navigating the Future: Aravind Srinivas and Perplexity's Bold Leap into Agentic AI

Aravind Srinivas, as the CEO and co-founder of Perplexity AI, is steering the company toward an ambitious transformation by embracing the potential of agentic AI, as evidenced by his recent activities and statements on platforms like X.

In early 2025, Srinivas announced the launch of Perplexity Assistant, an agentic AI for Android devices capable of performing multi-step tasks autonomously, marking a significant shift from Perplexity’s origins as a conversational answer engine to a natively integrated assistant that can interact with apps and execute real-world actions.

@webthreeth
DeepSeek R2's Rumored to achieve 90% ARC-AGI Score

DeepSeek R2's rumored 90% score on the ARC-AGI benchmark represents a groundbreaking achievement, given that ARC-AGI—created by François Chollet—is one of the toughest tests for AI systems to demonstrate human-like reasoning and adaptability, focusing on abstract problem-solving without relying on cultural or acquired knowledge.

This score, far surpassing the 15-20% achieved by DeepSeek’s R1-Zero and R1 models (and other leading systems like OpenAI’s o1), suggests significant progress toward Artificial General Intelligence (AGI), potentially reshaping AI research, industry competition, and applications.

The implications extend beyond this technical milestone: DeepSeek’s advancements, as noted in web results, could lower barriers to AI adoption, disrupt proprietary model providers like those in the U.S., and accelerate innovation across fields.

@webthreeth
Tencent's Hunyuan-T1: A Breakthrough in AI Reasoning

Tencent's Hunyuan-T1, launched by the Hunyuan team, introduces a pioneering Hybrid-Mamba-Transformer MoE architecture, establishing it as the first ultra-large-scale reasoning model of its kind, designed for exceptional speed, accuracy, and efficiency in AI processing.

Building on the foundation of the earlier Hunyuan T1-Preview released in February 2025, and enhanced by large-scale reinforcement learning, the model outperforms or matches competitors like DeepSeek R1 and GPT-4.5 across various benchmarks, including MMLU-PRO, CEval, and AIME, demonstrating superior performance in knowledge, reasoning, math, and Chinese language tasks, as illustrated in the performance charts provided.

@webthreeth
One of the most remarkable aspects of Artificial Intelligence has been the rapid pace of its development over the past six months. I would argue that the level of disruption during this period has been significantly higher than in the preceding 24 months.

I don’t believe the pace of development will remain this intense until 2027—or perhaps until a model achieves AGI—but what we’ve witnessed in these last six months has been truly impressive. This also serves as a notable acknowledgment of China’s role in entering the competition and accelerating disruption by choosing to open-source everything.

As a result, to justify their pricing, proprietary models now need to deliver superior performance compared to open-source alternatives.

Source - McKinsey & Co. Superagency in the Workplace

@webthreeth
AI Revolution Unveiled: OpenAI and Grok Launch Image Editing Innovations

OpenAI and xAI's Grok significantly advanced their AI capabilities by introducing image editing features, marking a new era of accessibility in digital creativity. OpenAI integrated its image editing plugin with DALL-E 2 and later DALL-E 3 within the ResourceSpace platform, allowing users to regenerate specific areas of an image via text prompts, simplifying tasks like inpainting without requiring advanced technical skills.

Meanwhile, Grok, leveraging its Aurora model, rolled out a feature in March 2025 that enables users to upload images and modify them by describing changes—such as adding objects, altering backgrounds, or adjusting lighting—directly through natural language on the X platform. While OpenAI’s approach caters to structured editing within a professional environment, Grok’s implementation emphasizes ease and flexibility.

@webthreeth
AI-Powered Economic Boom: Predicting 30% to 100% Annual Growth by 2045

GATE, an AI and Automation Scenario Explorer built by Epoch AI, shows how AI automation might dramatically boost the global economy by replacing human labor with faster, more scalable computing power. It compares two scenarios: one where full automation leads to a 100% annual growth in Gross World Product (GWP), and a more conservative estimate with 30% yearly growth.

Starting around 2025, both scenarios predict an exponential rise in GWP, driven by a cycle where AI advancements increase economic output, which then funds further AI development. By 2045, this could result in a global economy 10 to 100 times larger than today, demonstrating AI's potential to transform economic growth over the coming decades.

@webthreeth
Not R2, But DeepSeek Made Stellar Improvements on the V3 Model

DeepSeek has made substantial improvements to its V3 model, a 671B parameter Mixture-of-Experts (MoE) language model, with a notable minor version update announced on March 24, 2025, as reflected in the official DeepSeek channels.

This update enhances the model’s reasoning and performance capabilities, building on its knowledge distillation pipeline that leverages reasoning patterns from the DeepSeek R1 series, incorporating advanced verification and reflection techniques to significantly boost its mathematical problem-solving and multi-task question-answering abilities.

@webthreeth
ARC-AGI-2 : New Rankings Instrument to measure the race towards AGI

The ARC-AGI-2 benchmark, launched alongside the ARC Prize 2025 competition, has introduced a new ranking landscape for artificial general intelligence (AGI) systems as of March 25, 2025.

Designed to be a more challenging iteration of the original ARC-AGI, this updated benchmark retains its core format—tasks that are easy for humans but difficult for AI—while raising the bar for machine reasoning capabilities. Current rankings reveal a stark contrast between human and AI performance: humans consistently achieve scores above 95% with minimal training, whereas leading AI models struggle significantly.

Notably, OpenAI's o3, which previously scored an impressive 75.7% on ARC-AGI-1’s semi-private evaluation set, is projected to drop below 30% on ARC-AGI-2, even with high compute resources. This new ranking underscores ARC-AGI-2’s effectiveness in exposing current AI limitations.

@webthreeth
I had an idea. So, I was thinking about uploading a ten minute discussion about the weekly updates of AI in the form of a podcast through Gemini Audio Overview. I use the Gemini Audio Overview daily to learn about new stuffs and I think people will like the idea of weekly update on AI market for ten minutes on Sunday Morning.

Should I try it out? Please like if you think this idea is cool. I will make it only once per week and it will help people digest the weekly updates easily.
👍6
Unveiling Gemini 2.5 Pro: Google DeepMind's Experimental AI Trial for Advanced Users

Gemini 2.5 Pro, an experimental AI model by Google DeepMind, has been released on a trial basis to select Gemini Advanced subscribers as of March 25, 2025. This rollout targets advanced users and includes features like a "thinking phase" for enhanced reasoning, aimed at handling complex tasks more effectively.

The model offers a 2M token context window, advanced coding capabilities, and multimodal input support, but some users have reported bugs, suggesting it's still in early testing. Access appears limited to those with a Gemini Advanced subscription, often part of the Google One AI Premium Plan, with no official confirmation from Google yet.

@webthreeth