Snowflake announced a state-of-the-art large language model uniquely designed to be the most open, enterprise-grade LLM on the market.
Snowflake
Snowflake Arctic - LLM for Enterprise AI
Introducing Snowflake Arctic, a top-tier enterprise focused LLM pushing the frontiers of cost-effective training and openness.
Nvidia acquired Run:ai for $700M
Run:ai a Tel Aviv-based company that makes it easier for developers and operations teams to manage and optimize their AI hardware infrastructure, for an undisclosed sum.
Investment philosophy: support companies that leverage its technology
2021: 14 investments
2022: 14 investments
2023: 40 investments
2024: 12 investments so far
What's going on:
- From looking at the market map, their investment strategy is solely focused on:
1. Companies building foundational models
(Cohere, Imbue, Runway, Inflection, etc).
2. Companies that help deploy these models
(Together, Replicate, Hugging Face, etc).
Run:ai a Tel Aviv-based company that makes it easier for developers and operations teams to manage and optimize their AI hardware infrastructure, for an undisclosed sum.
Investment philosophy: support companies that leverage its technology
2021: 14 investments
2022: 14 investments
2023: 40 investments
2024: 12 investments so far
What's going on:
- From looking at the market map, their investment strategy is solely focused on:
1. Companies building foundational models
(Cohere, Imbue, Runway, Inflection, etc).
2. Companies that help deploy these models
(Together, Replicate, Hugging Face, etc).
Drexel University announced a new machine-learning technology that enables accurate estimation of brain age using a low-cost EEG device.
YouTube
How Old Is Your Brain?
A team of researchers from Drexel and Stockton universities has developed a new and practical way to monitor general brain health and detect premature brain aging using a low-cost EEG headset and a machine learning algorithm, presenting a quick and easy way…
Immersive_tech_in_healthcare_1713358341.pdf
14.2 MB
This is a very interesting report focused on the use of immersive technology, like VR, in the healthcare sector.
The goal of the work is to help those in healthcare (including providers, built environment experts, and policy makers) to:
1. Advocate for the benefits of XR as a means to innovate health and social care
2. Increase debate and dialog across networks of expertise to create health-promoting environments
3. Understand the overriding priorities in making effective pathways to the implementation of XR.
This research is unique in its methodology that includes literature review but also semi-structured interviews.
The unique nature of this work really pays off in the knowledge gained.
The authors conclude that:
(a) both built environment and healthcare sectors can benefit from the various capabilities of XR through cross-sectional initiatives, evidence-based practices, and participatory approaches.
(b) a confluence of knowledge and methods of HCI and HBI can increase the interoperability and usability of XR for the patient-centered and value-based healthcare models.
(c) the XR-enabled technological regime will largely affect the new forms of value in healthcare premises by fostering more decentralized, preventive, and therapeutic characteristics in the future healthcare ecosystems.
The goal of the work is to help those in healthcare (including providers, built environment experts, and policy makers) to:
1. Advocate for the benefits of XR as a means to innovate health and social care
2. Increase debate and dialog across networks of expertise to create health-promoting environments
3. Understand the overriding priorities in making effective pathways to the implementation of XR.
This research is unique in its methodology that includes literature review but also semi-structured interviews.
The unique nature of this work really pays off in the knowledge gained.
The authors conclude that:
(a) both built environment and healthcare sectors can benefit from the various capabilities of XR through cross-sectional initiatives, evidence-based practices, and participatory approaches.
(b) a confluence of knowledge and methods of HCI and HBI can increase the interoperability and usability of XR for the patient-centered and value-based healthcare models.
(c) the XR-enabled technological regime will largely affect the new forms of value in healthcare premises by fostering more decentralized, preventive, and therapeutic characteristics in the future healthcare ecosystems.
Movement Labs raised $38 million led by Polychain Capital to build a layer-2 on Ethereum using Move, the programming language developed by Meta
Fortune Crypto
Movement Labs raises $38 million to build layer 2 blockchain on Ethereum with Facebook tech | Fortune Crypto
Polychain led the Series A funding round, which included participation from Hack VC, dao5, and Robot Ventures.
👍4
SenseTime launched SenseNova 5.0, which according to the report (translated from Chinese):
- Beats GPT-4T on nearly all benchmarks
- Has a 200k context window
- Is trained on more than 10TB tokens
- Has major advancements in knowledge, mathematics, reasoning, and coding capabilities
- Beats GPT-4T on nearly all benchmarks
- Has a 200k context window
- Is trained on more than 10TB tokens
- Has major advancements in knowledge, mathematics, reasoning, and coding capabilities
Zhidx
商汤甩出大模型豪华全家桶!秀拳皇暴打GPT-4,首晒“文生视频”,WPS小米现场助阵 - 智东西
“大模型+大算力”双轮驱动,运营算力达12000P。
xAI is raising $6 billion at a valuation of $18 billion from investors including Sequoia
Musk launched xAI in early 2023, and it released its chatbot Grok to premium subscribers on Musk’s social network X in December.
The company is currently training the second generation of Grok on 20,000 Nvidia H100s, the chips that the most advanced AI models operate on, according to Musk.
Musk launched xAI in early 2023, and it released its chatbot Grok to premium subscribers on Musk’s social network X in December.
The company is currently training the second generation of Grok on 20,000 Nvidia H100s, the chips that the most advanced AI models operate on, according to Musk.
The Information
Musk’s xAI Is Close to Raising $6 Billion from Sequoia, Others
Elon’s Musk xAI is close to having billions of dollars more in its coffers to make its chatbot Grok a more fearsome competitor to OpenAI’s ChatGPT. Musk’s year-old startup is raising $6 billion at a valuation of $18 billion not including the investment, according…
China’s most prominent pro-blockchain official, Yao Qian, is under investigation by the Chinese government for suspected violations of law.
The specific reasons are unknown.
He was the creator of China’s CBDC and served as the director of the central bank’s digital currency research institute.
On April 8, Yao published an article discussing the Bitcoin spot ETF approved by the USA, saying that the market expects Bitcoin prices to still rise, discussed the opinions of supporters and opponents of Bitcoin, and introduced the regulatory measures for cryptocurrencies in the USA.
The specific reasons are unknown.
He was the creator of China’s CBDC and served as the director of the central bank’s digital currency research institute.
On April 8, Yao published an article discussing the Bitcoin spot ETF approved by the USA, saying that the market expects Bitcoin prices to still rise, discussed the opinions of supporters and opponents of Bitcoin, and introduced the regulatory measures for cryptocurrencies in the USA.
Cnstock
姚前被查-新闻-上海证券报·中国证券网
上证报中国证券网讯 据中央纪委国家监委驻中国证监会纪检监察组、广东省纪委监委消息:中国证监会科技监管司司长、信息中心主任姚前涉嫌严重违纪违法,目前正接受中央纪委国家监委驻中国证监会纪检监察组纪律审查和广东省汕尾市监察委员会监察调查。
WebSim is such a fascinating look at what a truly generative Internet might look like.
The URL bar is a prompting engine that builds a fully interactive and customizable site based on your input.
You can instantly create websites, simulators, games, and more.
You can have fun on WebSim with no product knowledge (try typing in a URL of your own name)
Or, there's a much deeper and more complex language to learn if you want to really build on it - with some randomness thrown in 😂
It truly is the "hallucinated Internet"!
The URL bar is a prompting engine that builds a fully interactive and customizable site based on your input.
You can instantly create websites, simulators, games, and more.
You can have fun on WebSim with no product knowledge (try typing in a URL of your own name)
Or, there's a much deeper and more complex language to learn if you want to really build on it - with some randomness thrown in 😂
It truly is the "hallucinated Internet"!
❤1🤮1
Pleias published the largest dataset to date with automated OCR correction, 1 billion words in English, French, German and Italian.
OCR quality is primary concern of digitization in any large scale organization. Scans are not always on well-preserved and in many case existing OCR tools are not able to properly parse specific fonts or formats, especially in other languages than English.
Automated post-OCR correction has been made possible thanks to progress in open LLM research and several months of dedicated training and alignment by Pleias.
Results are now encouraging most of the time, on a variety of European languages, even when the text is severely degraded.
OCR quality is primary concern of digitization in any large scale organization. Scans are not always on well-preserved and in many case existing OCR tools are not able to properly parse specific fonts or formats, especially in other languages than English.
Automated post-OCR correction has been made possible thanks to progress in open LLM research and several months of dedicated training and alignment by Pleias.
Results are now encouraging most of the time, on a variety of European languages, even when the text is severely degraded.
huggingface.co
PleIAs/Post-OCR-Correction · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Google announced Med-Gemini, a family of Gemini models fine-tuned for medical tasks
Achieves SOTA on 10 of the 14 benchmarks, spanning text, multimodal & long-context applications.
Surpasses GPT-4 on all benchmarks!
Achieves SOTA on 10 of the 14 benchmarks, spanning text, multimodal & long-context applications.
Surpasses GPT-4 on all benchmarks!
arXiv.org
Capabilities of Gemini Models in Medicine
Excellence in a wide variety of medical applications poses considerable challenges for AI, requiring advanced reasoning, access to up-to-date medical knowledge and understanding of complex...
Meta announced Better & Faster Large Language Models via Multi-token Prediction
Large language models such as GPT and Llama are trained with a next-token prediction loss.
Large language models such as GPT and Llama are trained with a next-token prediction loss.
huggingface.co
Paper page - Better & Faster Large Language Models via Multi-token Prediction
Join the discussion on this paper page
Another triumph for Self-Play. Self-Play Preference Optimization (SPPO) has surpassed (iterative) DPO, IPO, Self-Rewarding LMs, and others on AlpacaEval, MT-Bench, and the Open LLM Leaderboard.
Remarkably, Mistral-7B-instruct-v0.2 fine-tuned by SPPO achieves superior performance to GPT-4 0613 without relying on any GPT-4 responses.
Remarkably, Mistral-7B-instruct-v0.2 fine-tuned by SPPO achieves superior performance to GPT-4 0613 without relying on any GPT-4 responses.
All you need is Kolmogorov–Arnold Network (KAN)
Kolmogorov-Arnold network obliterates Deepmind's results with much smaller networks and much more automation.
KANs also discovered new formulas for signature and discovered new relations of knot invariants in unsupervised ways.
GitHub.
Kolmogorov-Arnold network obliterates Deepmind's results with much smaller networks and much more automation.
KANs also discovered new formulas for signature and discovered new relations of knot invariants in unsupervised ways.
GitHub.
arXiv.org
KAN: Kolmogorov-Arnold Networks
Inspired by the Kolmogorov-Arnold representation theorem, we propose Kolmogorov-Arnold Networks (KANs) as promising alternatives to Multi-Layer Perceptrons (MLPs). While MLPs have fixed activation...
OpenAI is about to go after Google search.
This could be the most serious threat Google has ever faced.
OpenAI's SSL certificate logs now show they created search.chatgpt.com
Microsoft Bing would allegedly power the service.
This shouldn’t be too surprising, considering:
1. OpenAI has a web crawler, GPTBot.
2. ChatGPT Plus users can also use Browse with Bing to search the web.
3. Microsoft Bing uses OpenAI’s GPT-4, customized for search.
This could be the most serious threat Google has ever faced.
OpenAI's SSL certificate logs now show they created search.chatgpt.com
Microsoft Bing would allegedly power the service.
This shouldn’t be too surprising, considering:
1. OpenAI has a web crawler, GPTBot.
2. ChatGPT Plus users can also use Browse with Bing to search the web.
3. Microsoft Bing uses OpenAI’s GPT-4, customized for search.
Academic benchmarks are losing their potency. There’re 3 types of LLM evaluations that matter:
1. Privately held test set but publicly reported scores, by a trusted 3rd party who doesn’t have their own LLM to promote.Scale’s latest GSM1k is a great example. They are an unbiased neutral party who ensures that the test data is not leaked into anyone’s training.
2. Public, comparative benchmarks like Lmsys.org Chatbot Arena, reported in ELO score. You can’t game democracy.
3. Privately curated, internal benchmarks for each company’s own use cases. You can’t game your customers.
1. Privately held test set but publicly reported scores, by a trusted 3rd party who doesn’t have their own LLM to promote.Scale’s latest GSM1k is a great example. They are an unbiased neutral party who ensures that the test data is not leaked into anyone’s training.
2. Public, comparative benchmarks like Lmsys.org Chatbot Arena, reported in ELO score. You can’t game democracy.
3. Privately curated, internal benchmarks for each company’s own use cases. You can’t game your customers.
H-GAP is a generalist model for humanoid control.
Trained on large MoCap-derived data, it can generate diverse, natural motions & transfer skills to new tasks without fine-tuning!
Paper.
Trained on large MoCap-derived data, it can generate diverse, natural motions & transfer skills to new tasks without fine-tuning!
Paper.
Yingchen Xu
Humanoid Control with a Generalist Planner
Meta and Georgia institute of technology released a dataset + SOTA AI models to help accelerate research on Direct Air Capture — a key technology to combat climate change.
OpenDAC23 is the largest dataset of Metal Organic Frameworks characterized by their ability to adsorb CO2 in the presence of water — an order of magnitude larger than any other pre-existing dataset at this precision.
OpenDAC23 is the largest dataset of Metal Organic Frameworks characterized by their ability to adsorb CO2 in the presence of water — an order of magnitude larger than any other pre-existing dataset at this precision.
⚡4
"Neuro-GPT: Towards A Foundation Model for EEG" is available
Code on GitHub.
Pre-trained model on HuggingFace.
Code on GitHub.
Pre-trained model on HuggingFace.
arXiv.org
Neuro-GPT: Towards A Foundation Model for EEG
To handle the scarcity and heterogeneity of electroencephalography (EEG) data for Brain-Computer Interface (BCI) tasks, and to harness the power of large publicly available data sets, we propose...
⚡4
Unlearn AI released a new neural network architecture for learning to create digital twins of patients.
⚡4