71 to 80 of 107 Results
Sep 18, 2025 - Language Technologies Laboratory
De Luca Fornaciari, Francesca; Mash, Audrey; Melero, Maite; Villegas, Marta, 2025, "CA-GL_Parallel_Corpus", https://dataverse.bsc.es/dataset.xhtml?persistentId=perma:BSC/VUTENU, BSC Dataverse, V1
The CA-GL Parallel Corpus is a Catalan-Galician synthetic dataset of parallel sentences created to support the use of co-official languages from Spain, such as Catalan and Galician, in NLP tasks, specifically Machine Translation. The dataset can be used to train Bilingual Machine Translation models between Galician and Catalan in any direction, as... |
Sep 18, 2025 - Language Technologies Laboratory
De Luca Fornaciari, Francesca; Mash, Audrey; Melero, Maite; Villegas, Marta, 2025, "CA-EU_Parallel_Corpus", https://dataverse.bsc.es/dataset.xhtml?persistentId=perma:BSC/A9UJA9, BSC Dataverse, V1
The CA-EU Parallel Corpus is a Catalan-Basque synthetic dataset of parallel sentences created to support the use of co-official languages from Spain, such as Catalan and Basque, in NLP tasks, specifically Machine Translation. The dataset can be used to train Bilingual Machine Translation models between Basque and Catalan in any direction, as well a... |
Sep 18, 2025 - Earth Sciences
Bowdalo, Dene, 2025, "GHOST: A globally harmonised dataset of surface atmospheric composition measurements", https://dataverse.bsc.es/dataset.xhtml?persistentId=perma:BSC/1YNJTT, BSC Dataverse, V1
GHOST: Globally Harmonised Observations in Space and Time, represents one of the biggest collection of harmonised measurements of atmospheric composition at the surface. In total, 7,275,148,646 measurements from 1970-2023, of 227 different components, from 38 reporting networks, are compiled, parsed, and standardised. Components processed include g... |
Sep 17, 2025 - Earth Sciences
Di Tomaso, Enza, 2025, "MONARCH high-resolution reanalysis data set of desert dust aerosol over Northern Africa, the Middle East and Europe", https://doi.org/10.82201/1APRWJ, BSC Dataverse, V1
This repository contains a high resolution regional reanalysis data set of desert dust aerosols. It covers Northern Africa, the Middle East and Europe along with the Mediterranean sea and parts of Central Asia, and the Atlantic and Indian Oceans between 2007 and 2016 at the horizontal resolution of 0.1° latitude × 0.1° longitude in rotated grid, an... |
Sep 16, 2025 - Language Technologies Laboratory
Saiz Antón, José Javier; Palomar-Giner, Jorge; Villegas, Marta, 2025, "CATalog", https://dataverse.bsc.es/dataset.xhtml?persistentId=perma:BSC/FAFYBH, BSC Dataverse, V2
CATalog is a diverse, open-source Catalan corpus for language modelling. It consists of text documents from 26 different sources, including web crawling, news, forums, digital libraries and public institutions, totaling in 17.45 billion words. |
Sep 15, 2025
|
Sep 15, 2025 - Earth Sciences
Bretonnière, Pierre-Antoine, 2025, "CORDEX data", https://dataverse.bsc.es/dataset.xhtml?persistentId=perma:BSC/EDVBJN, BSC Dataverse, V1
This is a partial replica of the "CORDEX" data (Coordinated Regional Climate Downscaling Experiment: https://cordex.org/) hosted and downloadable from https://esgf-metagrid.cloud.dkrz.de/search?project=CORDEX They include multiple models and experiments from different downscaling domains, and at different frequencies and for different variables. |
Sep 15, 2025 - Earth Sciences
Bretonnière, Pierre-Antoine, 2025, "Climate Data Store seasonal forecasts", https://dataverse.bsc.es/dataset.xhtml?persistentId=perma:BSC/PDC68P, BSC Dataverse, V1
This is a subset of the seasonal forecasts from the Climate Data Store, originally hosted at https://cds.climate.copernicus.eu/datasets/seasonal-original-single-levels?tab=overview. They offer several multi-member models operationally available, from surface and pressure levels, covering range from 1993 to present, from 6hourly to monthly means of... |
Aug 27, 2025Computational Social Sciences & Humanities Dataverse
Advance archaeological research through AI, HPC, remote sensing, and geospatial analysis. Automate the detection, mapping, and protection of archaeological sites and features, towards the reconstruction of ancient societies by examining aspects such as urban development, economic networks, human mobility, and ancient agricultural practices. |
