Dataocean AI Launched High Quality Off-the-Shelf Datasets and Frontier Data Solutions at Interspeech 2024
19.9.2024 17:00:00 CEST | Business Wire | Press Release
In the rapidly growing AI market that especially focused on foundation models and Generative AI, the quality of datasets directly impacts the performance. In real-world applications, data is messy and improving models is not the only way to get better performance. As AI continues to transform industries, the need for high quality datasets has become critical for developing responsive, adaptable, and intelligent systems.
This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20240919575026/en/
Dataocean AI at Interspeech 2024 (Photo: Business Wire)
At the Interspeech 2024, Dataocean AI, a global leader in AI data solutions, officially launched its latest offerings: high-quality off-the-shelf datasets. This exciting announcement further illustrates the company's position as a pioneer in the AI technology domain.
Dataocean AI introduced its newest corpus designed to meet the demands of various application scenarios - “Massively Multilingual Speech Corpus”. This corpus was recording from 215,891 speakers with total of 259,672 hours, covering over 100 languages. Along with this corpus, Dataocean AI also showcased its datasets in European languages. These meticulously labeled high quality datasets, covering English, French, Spanish, Turkish and Swedish, known for their diversity and accuracy, promise to enhance the performance of AI models across industries, such as smart finance, AI assistant, in-cabin, smart home, and other trendy topics related to AI.
The key strength of Dataocean AI’s datasets lies in their ability to deliver high precision across different fields.
- For data collection process, Dataocean AI leverages its extensive global network, comprising native speakers who professionally record in over 200+ languages. The company owns a team of native and professional speakers for these recordings and employs high-fidelity equipment within professional recording studios including indoor, outdoor, and in-cabin environments.
- For data labeling process, the company offer datasets that are labeled with their advanced self-developed platform with human in the loop. The expert team consist of scholars and specialists that covering multiple scenarios, and they have successfully build over 1100 speech datasets that match top quality standards, fulfilling the evolving needs of the AI industry.
In addition to speech datasets, Dataocean AI also owns over 1600 high-quality training datasets with proprietary intellectual property rights, covering a wide range of fields including foundation models, autonomous driving, finance, healthcare, and law. At the same time, its self-developed data processing platform, DOTS, equipped with more than 200 algorithms and hundreds of data processing tools, can achieve powerful functions such as automated labeling and assisted labeling, better helping customers reduce costs and increase efficiency. Additionally, they have earned data security regulations such as the European GDPR, and obtained certifications for ISO 9001, ISO 27001, and ISO 27001, ensuring safety and compliance.
Along with the high-quality datasets, Dataocean AI also empower LLMs through world-class live data collection for pre-trained and SFT/RLHF/red teaming for fine-tuning, as well as model evaluation.
Dataocean AI’s goal is to deliver one-stop data solution that ensuring their partners and clients can build reliable, adaptable AI models. This commitment to excellence is central to the company's mission of driving innovation in AI.
For more information about Dataocean AI’s latest datasets and their innovative data solutions, visit their official website at www.dataoceanai.com.
About Dataocean AI
With nearly 20 years project experience, Dataocean AI empower more than 1000 internet companies, AI enterprises and academic institutes with data total solutions. We offer over 1600 high quality off-the-shelf datasets and frontier data services, including data collection and data labeling serving for deep learning technology and enable clients’ AI models leading in the market.
View source version on businesswire.com: https://www.businesswire.com/news/home/20240919575026/en/
Contacts
contact@dataoceanai.com
(c) 2024 Business Wire, Inc., All rights reserved.
Business Wire, a Berkshire Hathaway company, is the global leader in multiplatform press release distribution.
Subscribe to releases from Business Wire
Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.
Latest releases from Business Wire
Trimontium Launches with $1.5 billion in AUM, Redefining Flexible Capital Solutions16.6.2026 01:01:00 CEST | Press Release
Trimontium (the “Firm”), an institutionally backed alternative asset manager specialising in flexible capital solutions, today announced its launch with $1.5 billion in assets under management. The Firm’s investment approach is rooted in credit and special-situations expertise, with the flexibility to originate and execute tailored financing solutions across the full capital structure for a wide range of corporate needs. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260615892895/en/ Trimontium Founder and CIO, Vlado Spasov Founded by former Blackstone executive Vlado Spasov, Trimontium is one of the largest first-time alternative asset managers based in Europe focused on flexible capital solutions to launch, according to available market data. The Firm is backed by leading institutional partners in the United States, Canada, Asia, and Australia, who collectively manage over $15 trillion in assets. Trimontium has sourced all
Newmont Announces Key Executive Appointments for the Next Phase of Delivery15.6.2026 23:20:00 CEST | Press Release
Newmont Corporation (NYSE: NEM, ASX: NEM, PNGX: NEM) (“Newmont”) today announced leadership appointments that further shape its go-forward Executive Leadership Team under President and Chief Executive Officer Natascha Viljoen and reflect the depth of leadership talent within the company. Effective July 1, 2026, Brian Tabolt has been appointed Chief Financial Officer, Mark Rodgers has been appointed Chief Operating Officer, and David Thornton has been appointed Chief Technical Officer. In addition, David Fry has been promoted to Executive Vice President, Project Development, reflecting the importance of disciplined project development and execution as Newmont advances its highest-return growth opportunities. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260615487768/en/ Mark Rodgers - COO “These appointments bring together respected leaders with deep industry experience and a strong understanding of our operational, financia
Westlake Expands Global Chlorovinyls Manufacturing Capacity With Acquisition of PVC and VCM Plants in Wilhelmshaven, Germany15.6.2026 20:18:00 CEST | Press Release
Westlake Corporation (NYSE: WLK) (“Westlake”) announced today that its German subsidiary, Westlake Vinnolit GmbH & Co. KG, has completed the previously-announced acquisition of a polyvinyl chloride and vinyl chloride monomer production site located in Wilhelmshaven, Germany (the “Wilhelmshaven plant”). The Wilhelmshaven plant, which was previously in insolvency administration, has the capacity to produce 380,000 metric tons of PVC per year. “This acquisition strengthens our Performance & Essential Materials business by expanding our global chlorovinyls manufacturing footprint and complements our existing chlorovinyl production facilities in Europe and North America,” said Jean-Marc Gilson, President and Chief Executive Officer of Westlake. “The Wilhelmshaven plant, which is located in Lower Saxony on Germany’s North Sea coast, benefits from advantageous logistical infrastructure, including a deep-water dock that enables efficient raw-materials supply. We look forward to welcoming the s
Onera Announces Integration of the Onera hPSG® Solution With Somnoware15.6.2026 19:35:00 CEST | Press Release
Onera Health, a leader in transforming sleep medicine, announces that its end-to-end home polysomnography solution, the Onera hPSG® solution, now integrates with Somnoware by ResMed sleep lab management software. This integration enables clinicians to conduct Polysomnography tests (PSGs) where patients sleep most comfortably, in their own home, while managing the entire workflow in Somnoware. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260615106079/en/ Onera hPSG®, an end-to-end home polysomnography solution from Onera Health, is now integrated into Somnoware, enabling their shared customers to conduct Polysomnography tests (PSGs) in the patient's home while managing the entire workflow in Somnoware. “The integration with Somnoware is a welcomed enhancement that broadens access to the Onera hPSG® solution,” states Ruben de Francisco, Founder and CEO of Onera Health. “Many sleep centers are customers of both Onera and Somn
Digital Cooperation Organization Launches Global Expert Community to Accelerate International Digital Cooperation15.6.2026 18:18:00 CEST | Press Release
The Digital Cooperation Organization (DCO), the world's first standalone international organization dedicated to inclusive and sustainable digital economy growth, today announced the launch of the Global Expert Community (GEC) — a new platform designed to mobilize expertise and advance international collaboration in support of high-impact digital initiatives across DCO Member States and beyond. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20260615565781/en/ Digital Cooperation Organization Launches Global Expert Community to Accelerate International Digital Cooperation (Graphic: AETOSWire) The GEC reflects the DCO's continued commitment to turning digital cooperation into action by expanding access to specialized expertise and strengthening collaboration across sectors and borders. As digital transformation reshapes economies and societies worldwide, the Community is designed to convert global perspectives and practical expe
In our pressroom you can read all our latest releases, find our press contacts, images, documents and other relevant information about us.
Visit our pressroom