graviti
ProductOpen DatasetsApps MarketSolutionsResourcesCompany
189
0
0
RoadText-1K
General
Discussion
Code
Activities
c77e04cd-8cd1-11eb-88ae-0e1f58d5e9a9
eb25185·
Jun 22, 2021 11:07 AM
·1Commits

Overview

Perceiving text is crucial to understand semantics of outdoor scenes and hence is a critical requirement to build intelligent systems for driver assistance and self-driving. Most of the existing datasets for text detection and recognition comprise still images and are mostly compiled keeping text in mind. This paper introduces a new "RoadText-1K" dataset for text in driving videos. The dataset is 20 times larger than the existing largest dataset for text in videos. Our dataset comprises 1000 video clips of driving without any bias towards text and with annotations for text bounding boxes and transcriptions in every frame. State of the art methods for text detection, recognition and tracking are evaluated on the new dataset and the results signify the challenges in unconstrained driving videos compared to existing datasets. This suggests that RoadText-1K is suited for research and development of reading systems, robust enough to be incorporated into more complex downstream tasks like driver assistance and self-driving.

Citation

Please use the following citation when referencing the dataset:

@article{reddy2020roadtext,
  title={RoadText-1K: Text Detection \& Recognition Dataset for Driving Videos},
  author={Reddy, Sangeeth and Mathew, Minesh and Gomez, Lluis and Rusinol, Mar{\c{c}}al and
Jawahar, CV and others},
  journal={arXiv preprint arXiv:2005.09496},
  year={2020}
}
🎉Many thanks to Graviti Open Datasets for contributing the dataset
Basic Information
Application ScenariosNot Available
AnnotationsNot Available
TasksNot Available
LicenseUnknown
Updated on2021-01-20 05:07:02
Metadata
Data TypeNot Available
Data Volume0
Annotation Amount0
File Size0B
Copyright Owner
CVIT(Centre for Visual Information Technology)
Annotator
Unknown
More Support Options