Technology Innovation Institute Announces Launch of NOOR, the World’s Largest Arabic NLP Model
Technology Innovation Institute (TII), a global research center and applied research pillar of Abu Dhabi’s Advanced Technology Research Council, today announced the launch of NOOR, the world’s largest Arabic natural language processing (NLP) model to date.
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20220411005085/en/
Noor, the world's largest Arabic NLP Model - AI Cross-Center Unit, Technology Innovation Institute (Photo: AETOSWire)
TII’s team of advanced researchers and Artificial Intelligence (AI) specialists, has joined forces with LightOn, a technology company that unlocks extreme-scale machine intelligence for businesses, to transform the Arabic NLP model. The NOOR model has the capability to carry out tasks beyond the domain of language - offering end-to-end pipeline high quality data, including crawling, filtering, and curation at scale. The model facilitates extreme-scale distributed training and serving – to deliver applications with efficient inference and model specialization.
Dr. Ray O. Johnson, CEO, TII and ASPIRE, said: “With this development, we are well on track to enhance our research capabilities and credentials as well as elevate the status of Abu Dhabi and the UAE as a serious research ecosystem. Our expert teams have demonstrated yet again that this region can achieve breakthrough R&D outcomes to impact the world.”
Dr. Ebtesam Almazrouei, Director, AI Cross-Center Unit, TII, said: “Large language models have taken the world of natural language processing by storm, and we are proud to introduce this cutting-edge model with 10 billion parameters - the world’s largest Arabic NLP model. The uniquely large Arabic dataset collected to train the model is the result of months of work that included curating, scrapping, and filtering of varied sources. A special thank you to the entire team that worked on this project to make NOOR the go-to exploration model in Arabic for academicians and businesses everywhere.”
Speaking on the launch, Prof. Mérouane Debbah, Chief Researcher, Digital Science Research Center and AI Cross-Center Unit, TII, said: “With NOOR, TII has expanded the scope of the modern standard Arabic model by leveraging know-how in large language models to build cross-disciplinary, cutting-edge expertise in this new generation of AI research.”
To curate the world’s largest high-quality cross-domain Arabic datasets, NOOR’s unique dataset of more than 30 billion words combines web data with books, poetry, news articles, and technical information to significantly widen the applicability of the model.
Dr. Ebtesam Almazrouei said the NOOR model is based on the popular Transformer architecture. As a decoder-only model, similar in structure to GPT-3, it is programmed to tackle generative tasks with architecture upgraded to reflect the latest developments in the world of machine learning, including improvements such as better positional embeddings. To help ensure quality at scale in the NOOR dataset, the TII team designed an automated filtering pipeline based on machine learning techniques. These tools identify text like quality references and safeguard the model from exposure to spam content.
Leveraging state-of-the-art 3D parallelism, NOOR was trained on a High-Performance Computing resource with 128 A100 GPUs, allowing for the distribution of computations and ensuring efficient use of the available hardware resources.
The Director of the AI Cross-Center Unit noted that this was only the first step in the Unit’s efforts to contribute to the wider UAE Strategy for Artificial Intelligence.
Named for the Arabic word "light", the model has been so called to establish the correlation of the Arabic language model to enlightening the mind.
About Technology Innovation Institute (TII)
For more information, visit www.tii.ae
To view this piece of content from cts.businesswire.com, please give your consent at the top of this page.
Technology Innovation Institute
Sneha Sivanand, firstname.lastname@example.org
About Business Wire
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
HCL and UNLEASH Partner to Develop Solutions for Aquatic Ecosystem Conservation20.5.2022 17:59:00 CEST | Press release
HCL Group and UNLEASH, a global innovation program for the UN Sustainable Development Goals (SDGs), announced a year-long collaboration to mobilize youth and develop innovative solutions to promote aquatic ecosystem conservation. These solutions will aim to tackle challenges from Source (mountains & glaciers) to Sink (oceans and seas) and their links to terrestrial ecosystems. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20220520005178/en/ Nature and ocean conservation play a critical role in our survival. Terrestrial and aquatic ecosystems provide us with food, water, oxygen, energy, and medicines. They regulate our climate, provide pollination to crops, and reduce the impact of natural hazards. Despite the vital importance of our planet’s ecosystems, we are experiencing a human-caused deterioration of our natural habitats: human activity has altered almost 75% of the Earth’s terrestrial surface, squeezing wildlife and natu
Tecnotree utses till Årets förändringsskapare av Helsingforsbörsens stiftelse20.5.2022 17:24:00 CEST | Pressmeddelande
Tecnotree, en teknikleverantör från Esbo, utses till Årets förändringsskapare. Företaget har bidragit till att leverera europeisk-finsk innovation för att driva på tillväxten i framväxande marknader. Dess 5G-färdiga digitala produkter och lösningar, som har tagits väl emot globalt i Europa, Latinamerika, Mellanöstern, Afrika samt Asien och Stillahavsområdet, är avgörande för att ge telekomsektorn och dess kunder avancerad digital kapacitetstillgång till viktiga digitala tjänster över hela världen inom hälso- och sjukvård, utbildning och betalningstjänster. Detta pressmeddelande använder multimedia. Se den fullständiga versionen här: https://www.businesswire.com/news/home/20220518005762/sv/ Tecnotree Padma Ravichander and Minna Heusala of the Stock Exchange Foundation at the Stock Exchange Gala on May 17, 2022. Picture taken by Tuomas Pietinen. (Photo: Business Wire) ”Förändring och omvandling utgör själva kärnan i vår verksamhet. Under det senaste decenniet har vi infört en intern kult
PPG Showcases Innovations in Paints, Coatings, Specialty Materials That Enhance Sustainability, Efficiency, Mobility20.5.2022 14:00:00 CEST | Press release
PPG (NYSE:PPG) today showcased its latest innovations to media at its production and research and development (R&D) facility in Amsterdam. The event focused on advancements in three key areas – sustainability, efficiency and mobility – reflecting the company’s goals of helping customers lower costs, reduce their environmental footprint and support the global shift to electric vehicles (EVs). This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20220520005101/en/ PPG showcased its latest innovations to media at its production and research and development facility in Amsterdam on May 20. The event focused on advancements in three key areas – sustainability, efficiency and mobility – reflecting the company’s goals of helping customers lower costs, reduce their environmental footprint and support the global shift to electric vehicles. (Photo: Business Wire) Recent PPG innovations highlighted during the event include: PPG CORACHAR® batte
Ecopia AI Partners with Snap Inc. Subsidiary to Pilot 3D Map Content Integration20.5.2022 13:00:00 CEST | Press release
Ecopia AI announced today that it was selected by a Snap Inc. subsidiary to provide high-precision vector mapping data. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20220520005091/en/ Sample of the 3D Vector Map of Buildings and Vegetation Generated by Ecopia AI Leveraging Airbus Imagery (Photo: Business Wire) Ecopia leverages advanced AI-based mapping systems to mine the most up-to-date commercially-available geospatial imagery, accessed through its global partner network, outputting high-precision vector maps. For this initiative, Ecopia turned to Airbus for access to their global premium 30-50cm high-resolution imagery database, which is serving as the input imagery for large-scale map content production. “Ecopia has proven their ability to deliver highly-accurate mapping data at a large-scale with unparalleled speed,” said Snap, Inc subsidiary spokesperson. “Ecopia’s mission is to digitize the world using AI, offering hi
B2Broker Announced Annual Payments for B2Core, MarksMan, and B2Trader Products20.5.2022 10:00:00 CEST | Press release
B2Broker is excited to announce that it now offers an annual payment option for the three core products: MarksMan, B2Core, and B2Trader. With the introduction of this new plan, customers will be provided with a discount and a simpler approach to planning their budget. This change will allow the company to streamline its finances and improve cash flow. The annual plan is already applicable to all three products. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20220520005015/en/ B2Broker Announced Annual Payments for B2Core, MarksMan, and B2Trader Products (Graphic: Business Wire) MarksMan Whether you're a seasoned pro or just getting started in the world of digital assets, MarksMan is the perfect solution. With support for spot and perpetual futures liquidity, along with easy access to liquidity pools on major crypto exchanges, MarksMan has everything you need. There's no better time to test it out than now, with the basic packa
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.Visit our pressroom