PerSCiDO facilitates the exploration of research datasets.

Share your research datasets using PerSCiDO!

Numbers
Datasets: 27
Downloaded: 248

Explore PerSCiDO research data collections and related publications

Recent datasets


Recently Published By Scientific Field By Data Type
  • Restricted
  • Web data
  • Bull-IA on Osirim platform
  • THOMAZEAU Jacques
  • Archives 2006-2016 de la liste de diffusion du GDR I3. Sont envoyées par son intermédiaire des nouvelles se rapportant aussi bien aux conférences à venir, aux appels à communication, aux annonces de financement ( post doc, des postes,.). La liste des abonnes au BULLetin du GDR I3 regroupe l'ensemble des membres (industriels, chercheurs, enseignants-chercheurs, doctorants,.) des communautes Information, Intelligence et Interaction concernes par les problematiques au coeur de ces domaines de recherche, au travers de ses differents groupes de travail
  • Open
  • Image data
  • SPEECH-COCO
  • Laurent Besacier
  • SPEECH-COCO is an augmentation of MS-COCO dataset where speech is added to image and text. Speech captions were generated using text-to-speech (TTS) synthesis resulting in 616,767 spoken captions (>600h) paired with images. Disfluencies and speed perturbation were added to the signal in order to sound more natural. Each speech signal (WAV) is paired with a JSON file containing exact timecode for each word/syllable/phoneme in the spoken caption. Such a corpus could be used for Language and Vision (LaVi) tasks including speech input or output instead of text.
  • Open
  • Web data
  • A sample of the INA RDF data
  • Manuel Atencia
  • This is a sample of the RDF data owned by INA (Institut national de l'audiovisuel) — a repository of all French radio and television audiovisual archives — made it publicly available for scientific purposes. The whole INA RDF data (around 6 million RDF facts) was used in experiments for evaluating a novel import-by-query algorithm for data interlinking (see the related publication). These experiments allowed discovering person homonyms in the INA dataset (see the related dataset "A sample of owl:sameAs links within the INA RDF data").
  • Open
  • Web data
  • A sample of owl:sameAs links within the INA RDF dataset
  • Manuel Atencia
  • This is a sample of the owl:sameAs links discovered by the import-by-query algorithm (see the related publication) within the INA RDF dataset. The sample refers to person homonyms. The algorithm used DBpedia as a external source and a set of 35 rules translating semantic constraints associated to the RDF datasets, domain knowledge, vocabulary mappings, and owl:sameAs transitivity. In total, 4,884 owl:sameAs links and 9,764 owl:differentFrom links were discovered. A sample of the corresponding INA RDF data may be found in the related dataset "A sample of INA RDF data" also available at Perscido platform.
  • Restricted
  • Video data
  • MobileRGBD
  • Dominique Vaufreydaz
  • MobileRGBD is corpus dedicated to low level RGB-D algorithms benchmarking on mobile platform. We reversed the usual corpus recording paradigm. Our goal is to facilitate ground truth annotation and reproducibility of records among speed, trajectory and environmental variations. As we want to get rid of unpredictable human moves, we used dummies in order to play static users in the environment. Interest of dummies resides in the fact that they do not move between two recordings. It is possible to record the same robot move in order to evaluate performance of detection algorithms varying speed. This benchmark corpus is intended for "low level" RGB-D algorithm family like 3D-SLAM, body/skeleton tracking or face tracking using a mobile robot.