PerSCiDO facilitates the exploration of research datasets.

Share your research datasets using PerSCiDO!

Datasets: 28
Downloaded: 311

Explore PerSCiDO research data collections and related publications

Recent datasets

Recently Published By Scientific Field By Data Type
  • Open
  • Experimental data
  • F-TRACT, ATLAS June 2017
  • Olivier David
  • Dataset that contains connectivity probablility and peak latency estimated from CCEP data recorded in 174 patients, only in the MarsAtlas parcellation scheme.
  • Restricted
  • Web data
  • Bull-IA on Osirim platform
  • THOMAZEAU Jacques
  • Archives 2006-2016 de la liste de diffusion du GDR I3. Sont envoyées par son intermédiaire des nouvelles se rapportant aussi bien aux conférences à venir, aux appels à communication, aux annonces de financement ( post doc, des postes,.). La liste des abonnes au BULLetin du GDR I3 regroupe l'ensemble des membres (industriels, chercheurs, enseignants-chercheurs, doctorants,.) des communautes Information, Intelligence et Interaction concernes par les problematiques au coeur de ces domaines de recherche, au travers de ses differents groupes de travail
  • Open
  • Image data
  • Laurent Besacier
  • SPEECH-COCO is an augmentation of MS-COCO dataset where speech is added to image and text. Speech captions were generated using text-to-speech (TTS) synthesis resulting in 616,767 spoken captions (>600h) paired with images. Disfluencies and speed perturbation were added to the signal in order to sound more natural. Each speech signal (WAV) is paired with a JSON file containing exact timecode for each word/syllable/phoneme in the spoken caption. Such a corpus could be used for Language and Vision (LaVi) tasks including speech input or output instead of text.
  • Open
  • Web data
  • A sample of the INA RDF data
  • Manuel Atencia
  • This is a sample of the RDF data owned by INA (Institut national de l'audiovisuel) — a repository of all French radio and television audiovisual archives — made it publicly available for scientific purposes. The whole INA RDF data (around 6 million RDF facts) was used in experiments for evaluating a novel import-by-query algorithm for data interlinking (see the related publication). These experiments allowed discovering person homonyms in the INA dataset (see the related dataset "A sample of owl:sameAs links within the INA RDF data").
  • Open
  • Web data
  • A sample of owl:sameAs links within the INA RDF dataset
  • Manuel Atencia
  • This is a sample of the owl:sameAs links discovered by the import-by-query algorithm (see the related publication) within the INA RDF dataset. The sample refers to person homonyms. The algorithm used DBpedia as a external source and a set of 35 rules translating semantic constraints associated to the RDF datasets, domain knowledge, vocabulary mappings, and owl:sameAs transitivity. In total, 4,884 owl:sameAs links and 9,764 owl:differentFrom links were discovered. A sample of the corresponding INA RDF data may be found in the related dataset "A sample of INA RDF data" also available at Perscido platform.