Metrics
33,620 Downloads
The BSC Dataverse is the institutional research data repository of the Barcelona Supercomputing Center - Centro Nacional de Supercomputación (BSC-CNS). It seeks to enable the storage, sharing, and search of research data coming from the BSC researchers, collaborators, and affiliated projects.
Featured Dataverses

In order to use this feature you must have at least one published or linked dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

71 to 80 of 107 Results
Sep 18, 2025 - Language Technologies Laboratory
De Luca Fornaciari, Francesca; Mash, Audrey; Melero, Maite; Villegas, Marta, 2025, "CA-GL_Parallel_Corpus", https://dataverse.bsc.es/dataset.xhtml?persistentId=perma:BSC/VUTENU, BSC Dataverse, V1
The CA-GL Parallel Corpus is a Catalan-Galician synthetic dataset of parallel sentences created to support the use of co-official languages from Spain, such as Catalan and Galician, in NLP tasks, specifically Machine Translation. The dataset can be used to train Bilingual Machine Translation models between Galician and Catalan in any direction, as...
Sep 18, 2025 - Language Technologies Laboratory
De Luca Fornaciari, Francesca; Mash, Audrey; Melero, Maite; Villegas, Marta, 2025, "CA-EU_Parallel_Corpus", https://dataverse.bsc.es/dataset.xhtml?persistentId=perma:BSC/A9UJA9, BSC Dataverse, V1
The CA-EU Parallel Corpus is a Catalan-Basque synthetic dataset of parallel sentences created to support the use of co-official languages from Spain, such as Catalan and Basque, in NLP tasks, specifically Machine Translation. The dataset can be used to train Bilingual Machine Translation models between Basque and Catalan in any direction, as well a...
Sep 18, 2025 - Earth Sciences
Bowdalo, Dene, 2025, "GHOST: A globally harmonised dataset of surface atmospheric composition measurements", https://dataverse.bsc.es/dataset.xhtml?persistentId=perma:BSC/1YNJTT, BSC Dataverse, V1
GHOST: Globally Harmonised Observations in Space and Time, represents one of the biggest collection of harmonised measurements of atmospheric composition at the surface. In total, 7,275,148,646 measurements from 1970-2023, of 227 different components, from 38 reporting networks, are compiled, parsed, and standardised. Components processed include g...
Sep 17, 2025 - Earth Sciences
Di Tomaso, Enza, 2025, "MONARCH high-resolution reanalysis data set of desert dust aerosol over Northern Africa, the Middle East and Europe", https://doi.org/10.82201/1APRWJ, BSC Dataverse, V1
This repository contains a high resolution regional reanalysis data set of desert dust aerosols. It covers Northern Africa, the Middle East and Europe along with the Mediterranean sea and parts of Central Asia, and the Atlantic and Indian Oceans between 2007 and 2016 at the horizontal resolution of 0.1° latitude × 0.1° longitude in rotated grid, an...
Sep 16, 2025 - Language Technologies Laboratory
Saiz Antón, José Javier; Palomar-Giner, Jorge; Villegas, Marta, 2025, "CATalog", https://dataverse.bsc.es/dataset.xhtml?persistentId=perma:BSC/FAFYBH, BSC Dataverse, V2
CATalog is a diverse, open-source Catalan corpus for language modelling. It consists of text documents from 26 different sources, including web crawling, news, forums, digital libraries and public institutions, totaling in 17.45 billion words.
Computer Applications in Science and Engineering(Barcelona Supercomputing Center)
Sep 15, 2025
Sep 15, 2025 - Earth Sciences
Bretonnière, Pierre-Antoine, 2025, "CORDEX data", https://dataverse.bsc.es/dataset.xhtml?persistentId=perma:BSC/EDVBJN, BSC Dataverse, V1
This is a partial replica of the "CORDEX" data (Coordinated Regional Climate Downscaling Experiment: https://cordex.org/) hosted and downloadable from https://esgf-metagrid.cloud.dkrz.de/search?project=CORDEX They include multiple models and experiments from different downscaling domains, and at different frequencies and for different variables.
Sep 15, 2025 - Earth Sciences
Bretonnière, Pierre-Antoine, 2025, "Climate Data Store seasonal forecasts", https://dataverse.bsc.es/dataset.xhtml?persistentId=perma:BSC/PDC68P, BSC Dataverse, V1
This is a subset of the seasonal forecasts from the Climate Data Store, originally hosted at https://cds.climate.copernicus.eu/datasets/seasonal-original-single-levels?tab=overview. They offer several multi-member models operationally available, from surface and pressure levels, covering range from 1993 to present, from 6hourly to monthly means of...
Computational Archaeology(Barcelona Supercomputing Center)
Aug 27, 2025Computational Social Sciences & Humanities Dataverse
Advance archaeological research through AI, HPC, remote sensing, and geospatial analysis. Automate the detection, mapping, and protection of archaeological sites and features, towards the reconstruction of ancient societies by examining aspects such as urban development, economic networks, human mobility, and ancient agricultural practices.
Add Data

Sign up or log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.