Databases or Datasets for Computer Vision Applications and Testing
Generally, to avoid confusion, in this bibliography, the word database
is used for
database systems or research and would apply to image database query techniques
rather than a database containing images for use in specific applications.
I have chosen to use dataset to describe
collections of images used by researchers in some domain.
In the past test data was difficult, but the advent of modern digital cameras
has simplified acquiring data. But in order to test and especially compare
algorithms, a common dataset is essential.
Test data is available in bits and pieces and in several larger repositories,
These listed datasets are selected from the references in the
Computer Vision Bibliography. There are other datasets and often
older ones get removed from web sites.
The links on the Author and Journal references in the list point to
entries in that database.
Current research and applications are highlighted in various
Computer Vision and Image Processing Conferences.
Some of these have evaluation sessions with related datasets.
Computer Vision resources include:
For more information on the topics, contact information, etc.
see the annotated
Computer Vision Bibliography or
the Complete
Conference Listing for Computer Vision and Image Analysis
Detailed Entries for Dataset
Khosla, A.[Aditya],
Raju, A.S.[Akhil S.],
Torralba, A.B.[Antonio B.],
Oliva, A.[Aude],
Understanding and Predicting Image Memorability at a Large Scale,
ICCV15(2390-2398)
IEEE DOI
Dataset, Memorability.
WWW Link. Benchmark testing
Berga, D.,
Vidal, X.R.F.,
Otazu, X.,
Pardo, X.M.,
SID4VAM: A Benchmark Dataset With Synthetic Images for Visual
Attention Modeling,
ICCV19(8788-8797)
IEEE DOI
Dataset, Gaze Tracking. gaze tracking, learning (artificial intelligence), neural nets,
SID4VAM, visual attention modeling, saliency metrics, Benchmark testing
Barnard, K.[Kobus], and
Funt, B.V.[Brian V.],
Camera characterization for color research,
ColorRes(27), No. 3, 2002, pp. 153-164.
PDF File.
Dataset, Color Calibration.
WWW Link.
Huang, X.Y.[Xin-Yu],
Wang, P.[Peng],
Cheng, X.J.[Xin-Jing],
Zhou, D.F.[Ding-Fu],
Geng, Q.C.[Qi-Chuan],
Yang, R.G.[Rui-Gang],
The ApolloScape Open Dataset for Autonomous Driving and Its
Application,
PAMI(42), No. 10, October 2020, pp. 2702-2719.
IEEE DOI
Dataset, Autonomous Driving. Semantics, Task analysis, Videos,
Labeling, Image segmentation,
3D understanding
Yu, F.[Fisher],
Chen, H.F.[Hao-Feng],
Wang, X.[Xin],
Xian, W.Q.[Wen-Qi],
Chen, Y.Y.[Ying-Ying],
Liu, F.C.[Fang-Chen],
Madhavan, V.[Vashisht],
Darrell, T.J.[Trevor J.],
BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask
Learning,
CVPR20(2633-2642)
IEEE DOI
WWW Link.
Dataset, Road Scenes. Task analysis, Visualization, Roads, Image segmentation, Meteorology,
Training, Benchmark testing
DDD17: End-To-End DAVIS Driving Dataset,
2017
WWW Link.
Dataset, Road Scenes. Over 12 h of a 346x260 pixel DAVIS sensor recording highway and city
driving in daytime, evening, night, dry and wet weather.
Waymo Open Dataset,
2020
WWW Link.
Dataset, Road Scenes. high-resolution sensor data collected by autonomous vehicles operated
by the Waymo Driver in a wide variety of situations.
UZH FPV Drone Racing Dataset 2.0,
2024
WWW Link.
Dataset, Visual Odometry.
Dataset, SLAM. The dataset comprises dozens of real-world sequences where a quadrotor
controlled in first-person view (FPV) by a professional pilot has been
flown both indoors and outdoors. Each sequence contains images, IMU,
and events (from an event-based camera) recorded on-board, as well as
ground truth from a robotic total station or motion capture system.
The ROad event Awareness Dataset for Autonomous Driving (ROAD),
2021
WWW Link.
Dataset, Autonomous Driving. It contains 22 long-duration videos (ca 8 minutes each), ideal for
continual learning research, annotated in terms of road
events, defined as triplets E = (Agent, Action, Location) and
represented as tubes, i.e., a series of frame-wise bounding
box detections.
ROAD is a large, high-quality multi-label benchmark, with 122K
labelled video frames comprising 560K detection bounding boxes
associated with 1.7M unique individual labels (560K agent labels, 640K
action labels and 499K location labels).
DSEC: A Stereo Event Camera Dataset for Driving Scenarios,
2021.
HTML Version. CVPR 2021 competition dataset.
Dataset, Stereo.
Dataset, Driving.
Stereo Event Camera large-scale dataset for challenging driving
scenarios! DSEC features over 400GB of data including stereo VGA
Prophesee event cameras, stereo RGB cameras, Velodyne lidar, and
RTK-GPS, recorded in challenging high-dynamic-range, day and night,
sunrise and sunset, urban and Swiss-mountain driving scenarios.
Singh, G.[Gurkirt],
Akrigg, S.[Stephen],
di Maio, M.[Manuele],
Fontana, V.[Valentina],
Alitappeh, R.J.[Reza Javanmard],
Khan, S.[Salman],
Saha, S.[Suman],
Jeddisaravi, K.[Kossar],
Yousefi, F.[Farzad],
Culley, J.[Jacob],
Nicholson, T.[Tom],
Omokeowa, J.[Jordan],
Grazioso, S.[Stanislao],
Bradley, A.[Andrew],
di Gironimo, G.[Giuseppe],
Cuzzolin, F.[Fabio],
ROAD: The Road Event Awareness Dataset for Autonomous Driving,
PAMI(45), No. 1, January 2023, pp. 1036-1054.
IEEE DOI
Dataset, Autonomous Driving. Roads, Autonomous vehicles, Task analysis, Videos, Benchmark testing,
Decision making, Vehicle dynamics, Autonomous driving,
decision making
Li, L.[Li],
Ismail, K.N.[Khalid N.],
Shum, H.P.H.[Hubert P. H.],
Breckon, T.P.[Toby P.],
DurLAR: A High-Fidelity 128-Channel LiDAR Dataset with Panoramic
Ambient and Reflectivity Imagery for Multi-Modal Autonomous Driving
Applications,
3DV21(1227-1237)
IEEE DOI
Dataset, Autonomous Driving. Reflectivity, Laser radar, Image resolution, Supervised learning,
Estimation, Benchmark testing, autonomous driving, dataset,
three dimensional
Zendel, O.[Oliver],
Honauer, K.[Katrin],
Murschitz, M.[Markus],
Steininger, D.[Daniel],
Domínguez, G.F.[Gustavo Fernández],
WildDash: Creating Hazard-Aware Benchmarks,
ECCV18(VI: 407-421).
Springer DOI
Dataset, Highway Hazards. Driving hazards.
Sakaridis, C.[Christos],
Dai, D.X.[Deng-Xin],
Van Gool, L.J.[Luc J.],
Semantic Foggy Scene Understanding with Synthetic Data,
IJCV(126), No. 9, September 2018, pp. 973-992.
Springer DOI
And:
ACDC: The Adverse Conditions Dataset with Correspondences for
Semantic Driving Scene Understanding,
ICCV21(10745-10755)
IEEE DOI
Dataset, Haze. Not just dehazing, actually understand the scene.
Training, Image segmentation, Visualization, Rain, Snow, Semantics,
Datasets and evaluation, Scene analysis and understanding,
Vision for robotics and autonomous vehicles
Dev Roy, S.,
Kanti Bhowmik, M.,
Oakley, J.,
A Ground Truth Annotated Video Dataset for Moving Object Detection in
Degraded Atmospheric Outdoor Scenes,
ICIP18(1318-1322)
IEEE DOI
Dataset, Object Detection. Object detection, Lighting, Meteorology, Cameras, Image restoration,
Streaming media, Atmospheric measurements, Image Enhancement
Zhang, Y.J.[Yu-Jun],
Zhu, L.[Lei],
Feng, W.[Wei],
Fu, H.Z.[Hua-Zhu],
Wang, M.Q.[Ming-Qian],
Li, Q.X.[Qing-Xia],
Li, C.[Cheng],
Wang, S.[Song],
VIL-100: A New Dataset and A Baseline Model for Video Instance Lane
Detection,
ICCV21(15661-15670)
IEEE DOI
Dataset, Lane Detection. Performance evaluation, Codes, Lane detection, Annotations,
Object segmentation, Streaming media,
grouping and shape
Codevilla, F.,
Santana, E.,
Lopez, A.,
Gaidon, A.,
Exploring the Limitations of Behavior Cloning for Autonomous Driving,
ICCV19(9328-9337)
IEEE DOI
Dataset, Driver Behavior.
WWW Link. behavioural sciences computing,
learning (artificial intelligence), neural nets, Vehicle dynamics
Lee, G.H.[Gim Hee],
Achtelik, M.,
Fraundorfer, F.,
Pollefeys, M.,
Siegwart, R.,
A benchmarking tool for MAV visual pose estimation,
ICARCV10(1541-1546).
IEEE DOI
Dataset, SLAM. Large scale SLAM dataset with more sensors. For UAV algorithm evaluations.
Ji, R.R.[Rong-Rong],
Duan, L.Y.[Ling-Yu],
Chen, J.[Jie],
Yang, S.[Shuang],
Huang, T.J.[Tie-Jun],
Yao, H.X.[Hong-Xun],
Gao, W.[Wen],
PKUBench: A context rich mobile visual search benchmark,
ICIP11(2545-2548).
IEEE DOI
Dataset, Landmarks. Landmark search aided by GPS.
Li, N.[Ning],
Zhao, Y.Q.[Yong-Qiang],
Pan, Q.[Quan],
Kong, S.G.[Seong G.],
Chan, J.C.W.[Jonathan Cheung-Wai],
Full-time Monocular Road Detection Using Zero-distribution Prior of
Angle of Polarization,
ECCV20(XXV:457-473).
Springer DOI
Dataset, Road Detection.
WWW Link.
Winkens, C.,
Sattler, F.,
Adams, V.,
Paulus, D.,
HyKo: A Spectral Dataset for Scene Understanding,
CVRoads17(254-261)
IEEE DOI
Dataset, Roads. Autonomous vehicles, Cameras, Hypercubes, Hyperspectral imaging,
Image color analysis, Sensors
Schmidt, A.[Adam],
Fularz, M.[Michal],
Kraft, M.[Marek],
Kasinski, A.[Andrzej],
Nowicki, M.[Michal],
An Indoor RGB-D Dataset for the Evaluation of Robot Navigation
Algorithms,
ACIVS13(321-329).
Springer DOI
Dataset, Navigation.
Swedish Trafic Signs,
Online2010
WWW Link.
Dataset, Traffic Signs.
Challenging Unreal and Real Environments for Traffic Sign Detection and Recognition,
Online2017
CURE-TSD and CURE-TSR
WWW Link.
WWW Link.
Dataset, Traffic Signs.
Dataset, CURE-TSR.
Dataset, CURE-TSD. Real-world and synthesized video sequences with challenging
conditions. In total, there are 5,733 video sequences and around 1.72
million frames.
CMU VASC Image Database,
Online1997
WWW Link.
Dataset, Motion. CMU has a collection of image datasets available. These include a number of
motion sequences, stereo (with and without ground truth),
faces and expressions, and cars.
PEIPA Computer Vision Software,
Online2004.
HTML Version.
Code, Computer Vision.
Dataset. Pilot European Image Processing Archive.
This lists a number of sources for various alogrithms.
They also include pointers to the usual set of image databases.
BBC Motion Gallery,
Video data.
Online2004
WWW Link. Video clips, including rights managed and
production ready royalty-free footage. Available to preview,
purchase and download.
Dataset, Retrieval.
Dataset, Video.
Large Scale Dataset for Cross-Model Multimedia Analysis,
2013.
HTML Version.
Dataset, Image Retrieval.
Dataset, Text Retrieval.
See also
Large Scale Video Database.
Shirahatti, N.V.[Nikhil V.],
Barnard, K.[Kobus],
Evaluating Image Retrieval,
CVPR05(I: 955-961).
IEEE DOI HTML Version.
Code, Image Retrieval.
Dataset, Image Retrieval.
Duygulu, P.,
Barnard, K.,
de Freitas, J.F.G.,
Forsyth, D.A.,
Object Recognition as Machine Translation:
Learning a Lexicon for a Fixed Image Vocabulary,
ECCV02(IV: 97 ff.).
Award, ECCV, Cognitive Vision.
Springer DOI HTML Version.
Dataset, Object Recognition.
Murray, N.[Naila],
Marchesotti, L.[Luca],
Perronnin, F.[Florent],
Learning to rank images using semantic and aesthetic labels,
BMVC12(110).
DOI Link
Earlier:
AVA: A large-scale database for aesthetic visual analysis,
CVPR12(2408-2415).
IEEE DOI
Dataset, Aesthetic Analysis.
Johnson, J.[Justin],
Hariharan, B.[Bharath],
van der Maaten, L.[Laurens],
Hoffman, J.,
Fei-Fei, L.[Li],
Zitnick, C.L.[C. Lawrence],
Girshick, R.[Ross],
Inferring and Executing Programs for Visual Reasoning,
ICCV17(3008-3017)
IEEE DOI
Earlier: A1, A2, A3, A5, A6, A7, Only:
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary
Visual Reasoning,
CVPR17(1988-1997)
IEEE DOI
Dataset, Visual Reasoning.
WWW Link.
backpropagation, image matching,
learning (artificial intelligence), neural nets,
Visualization.
Cognition, Image color analysis, Metals, Semantics, Shape.
Visual7W visual question answering,
Large-scale visual question answering (QA) dataset, with object-level
groundings and multimodal answers.
WWW Link.
Dataset, Visual Question Answering.
Visual Genome,
Visual Genome is a dataset, a knowledge base, an ongoing effort to
connect structured image concepts to language.
WWW Link.
WWW Link.
Dataset, Visual Question Answering.
Mathew, M.[Minesh],
Karatzas, D.[Dimosthenis],
Jawahar, C.V.,
DocVQA: A Dataset for VQA on Document Images,
WACV21(2199-2208)
IEEE DOI WWW Link.
Dataset, Visual Q-A. Visualization, Text analysis, Image recognition,
Image analysis, Layout
UT Zappos50K,
Dataset, Shoes.
WWW Link.
University of Texas shoe dataset. 50,025 images.
Xiao, J.X.[Jian-Xiong],
Hays, J.[James],
Ehinger, K.A.[Krista A.],
Oliva, A.[Aude],
Torralba, A.B.[Antonio B.],
SUN database: Large-scale scene recognition from abbey to zoo,
JEP:HPP(36), No. 6, 2010, pp. 1430-1442.
And:
CVPR10(3485-3492).
IEEE DOI
Dataset, Recognition.
WWW Link. 131067 images, 908 categories, objects and object categories.
Xiao, J.X.[Jian-Xiong],
Ehinger, K.A.[Krista A.],
Hays, J.[James],
Torralba, A.B.[Antonio B.],
Oliva, A.[Aude],
SUN Database: Exploring a Large Collection of Scene Categories,
IJCV(119), No. 1, August 2016, pp. 3-22.
Springer DOI
Dataset, Object Recognition.
WWW Link.
Le Cun, Y.L.[Yann L.],
Huang, F.J.[Fu Jie],
Bottou, L.[Leon],
Learning methods for generic object recognition with invariance to pose
and lighting,
CVPR04(II: 97-104).
IEEE DOI And:
PDF File.
WWW Link.
Dataset, Objects. Real time implementation. Find generic objects.
Blandfort, P.[Philipp],
Karayil, T.[Tushar],
Hees, J.[Jörn],
Dengel, A.[Andreas],
The Focus-Aspect-Value model for predicting subjective visual
attributes,
MultInfoRetr(9), No. 1, March 2020, pp. 47-60.
Springer DOI
Dataset, Retrieval.
WWW Link.
Philbin, J.[James],
Chum, O.[Ondrej],
Isard, M.[Michael],
Sivic, J.[Josef],
Zisserman, A.[Andrew],
Lost in quantization: Improving particular object retrieval in large
scale image databases,
CVPR08(1-8).
IEEE DOI HTML Version.
Dataset, Objects.
Chum, O.[Ondrej],
Philbin, J.[James],
Sivic, J.[Josef],
Isard, M.[Michael],
Zisserman, A.[Andrew],
Total Recall: Automatic Query Expansion with a Generative Feature Model
for Object Retrieval,
ICCV07(1-8).
IEEE DOI
And: A2, A1, A4, A3, A5:
Object retrieval with large vocabularies and fast spatial matching,
CVPR07(1-8).
IEEE DOI HTML Version.
Dataset, Buildings.
Award, Longuet-Higgins. (after 10 years).
Bell, S.[Sean],
Upchurch, P.[Paul],
Snavely, N.[Noah],
Bala, K.[Kavita],
Material recognition in the wild with the Materials in Context
Database,
CVPR15(3479-3487)
IEEE DOI
And:
MINC Dataset,
WWW Link.
Dataset, Materials.
Large Scale Video Database,
2012.
WWW Link.
Dataset, Video Database.
This database consists of 156,823 videos sequences
(2,907,447 keyframes), which were crawled from YouTube during the
period of July 2010 to September 2010. We provide the features as well
as the ground truth.
See also
Multiple feature hashing for real-time large scale near-duplicate video retrieval.
See also
Large Scale Dataset for Cross-Model Multimedia Analysis.
MA14KD: Movie Attraction 14K Dataset,
WWW Link.
Dataset, Visual Attractiveness. MA14KD provides a set of "Attractiveness" features extracted from
14000 movie and TV series trailers. The movie IDs are in agreement
with the movie IDs provided by a rating dataset, that contains
millions of ratings and thousands of tags.
Xu, J.[Jun],
Mei, T.[Tao],
Yao, T.[Ting],
Rui, Y.[Yong],
MSR-VTT:
A Large Video Description Dataset for Bridging Video and Language,
CVPR16(5288-5296)
IEEE DOI
Dataset, Video Analysis.
See also
MSR VTT Dataset.
Li, Y.C.[Yun-Cheng],
Song, Y.[Yale],
Cao, L.L.[Liang-Liang],
Tetreault, J.[Joel],
Goldberg, L.[Larry],
Jaimes, A.[Alejandro],
Luo, J.B.[Jie-Bo],
TGIF: A New Dataset and Benchmark on Animated GIF Description,
CVPR16(4641-4650)
IEEE DOI
WWW Link.
Dataset, Animations.
Liu, J.Z.[Jing-Zhou],
Chen, W.[Wenhu],
Cheng, Y.[Yu],
Gan, Z.[Zhe],
Yu, L.C.[Li-Cheng],
Yang, Y.M.[Yi-Ming],
Liu, J.J.[Jing-Jing],
Violin: A Large-Scale Dataset for Video-and-Language Inference,
CVPR20(10897-10907)
IEEE DOI
Dataset, Video. Task analysis, Visualization, Cognition, Natural languages, TV,
Motion pictures, Benchmark testing
Huang, Q.Q.[Qing-Qiu],
Xiong, Y.[Yu],
Rao, A.[Anyi],
Wang, J.Z.[Jia-Ze],
Lin, D.H.[Da-Hua],
Movienet: A Holistic Dataset for Movie Understanding,
ECCV20(IV:709-727).
Springer DOI
Dataset, Movie Understanding.
WWW Link.
Deep Video Understanding Dataset,
2020, used for workshops, and challenges.
WWW Link.
Dataset, Video Understanding.
Bakker, E.M.[Erwin M.],
Open and free datasets for multimedia retrieval,
MultInfoRetr(5), No. 3, September 2016, pp. 135-136.
WWW Link.
Dataset, Multimedia Retrieval.
Khan, M.,
Chakareski, J.,
NJIT 6DOF VR Navigation Dataset,
2020.
WWW Link.
Dataset, Virtual Reality. 6DOF (six degrees of freedom) virtual reality (VR) navigation data
comprising spatial position (x,y,z) and head orientation (rotation
angles yaw, pitch, and roll) of mobile VR users navigating a VR
environment in an indoor arena.
Animals with Attributes 2 Dataset,
2017
Dataset, Animals.
WWW Link.
Reference:
See also
Zero-Shot Learning: The Good, the Bad and the Ugly. Note the earlier AWA dataset has been removed due to copyright issues and
replaces with this version.
Cat Dataset,
2013
Dataset, Cats.
WWW Link. 9000 cat images with annotations.
Nilsback, M.E.[Maria-Elena],
Zisserman, A.[Andrew],
Automated Flower Classification over a Large Number of Classes,
ICCVGIP08(722-729).
IEEE DOI HTML Version.
Dataset, Flowers.
HTML Version.
CitDet: Comprehensive Citrus Fruit Detection and Classification Dataset,
2024
Dataset, Citrus Fruit.
WWW Link. CitDet consists of over 32,000 bounding box annotations for fruit
instances contained in 579 high-resolution images. Especially related
to Huanglongbing (HLB) disease in trees.
Plant Phenotyping Datasets for Computer Vision,
2016
WWW Link.
Dataset, Plants. We present a collection of benchmark datasets in the context of plant
phenotyping. We provide annotated imaging data and suggest suitable
evaluation criteria for plant/leaf segmentation, detection, tracking
as well as classification and regression problems. The figure
symbolically depicts the data available together with ground truth
segmentations and further annotations and metadata.
Article in press.
See also
Finely-grained annotated datasets for image-based plant phenotyping.
Wood image database,
2000.
WWW Link.
WWW Link. For information also see:
HTML Version.
Dataset, Lumber.
Beery, S.[Sara],
van Horn, G.[Grant],
Perona, P.[Pietro],
Recognition in Terra Incognita,
ECCV18(XVI: 472-489).
Springer DOI
Dataset, Animals.
WWW Link.
Tropical Coral Reef Fish Detection, Tracking And Classification,
Fish4Knowledge project datasets.
Online2014
WWW Link.
Dataset, Fish.
See also
University of Edinburgh.
See also
Fish4Knowledge: Collecting and Analyzing Massive Coral Reef Fish Video Data.
Swanson, A.[Alexandra],
Kosmala, M.[Margaret],
Lintott, C.[Chris],
Simpson, R.[Robert],
Smith, A.[Arfon],
Packer, C.[Craig],
Snapshot Serengeti, high-frequency annotated camera trap
images of 40 mammalian species in an African savanna,
ScientificData(2), June 2015, Article 150026.
DOI Link
Dataset, Animals. Covered by many news outlets. Thousands of pictures of animals from motion
activated cameras planted in the Serengeti. Includes interface for people to
identify, etc. A great resource for automated detection and identification.
Brookes, O.[Otto],
Mirmehdi, M.[Majid],
Stephens, C.[Colleen],
Angedakin, S.[Samuel],
Corogenes, K.[Katherine],
Dowd, D.[Dervla],
Dieguez, P.[Paula],
Hicks, T.C.[Thurston C.],
Jones, S.[Sorrel],
Lee, K.[Kevin],
Leinert, V.[Vera],
Lapuente, J.[Juan],
McCarthy, M.S.[Maureen S.],
Meier, A.[Amelia],
Murai, M.[Mizuki],
Normand, E.[Emmanuelle],
Vergnes, V.[Virginie],
Wessling, E.G.[Erin G.],
Wittig, R.M.[Roman M.],
Langergraber, K.[Kevin],
Maldonado, N.[Nuria],
Yang, X.Y.[Xin-Yu],
Zuberbühler, K.[Klaus],
Boesch, C.[Christophe],
Arandjelovic, M.[Mimi],
Kühl, H.[Hjalmar],
Burghardt, T.[Tilo],
PanAf20K: A Large Video Dataset for Wild Ape Detection and Behaviour
Recognition,
IJCV(132), No. 8, August 2024, pp. 3086-3102.
Springer DOI
Dataset, Apes.
Pre-Corrective Optics Space Telescope Axial Replacement Hubble Space Telescope star-cluster dataset,
Astronomy dataset.
Dataset, Astronomy.
WWW Link.
Ramanathan, S.[Subramanian],
Katti, H.[Harish],
Sebe, N.[Nicu],
Kankanhalli, M.[Mohan],
Chua, T.S.[Tat-Seng],
An Eye Fixation Database for Saliency Detection in Images,
ECCV10(IV: 30-43).
Springer DOI
Dataset, Eye Fixation.
Rivera-Rubio, J.[Jose],
Idrees, S.[Saad],
Alexiou, I.[Ioannis],
Hadjilucas, L.[Lucas],
Bharath, A.A.[Anil A.],
A dataset for Hand-Held Object Recognition,
ICIP14(5881-5885)
IEEE DOI
Dataset, Object Recognition.
And:
Small Hand-Held Object Recognition Test (SHORT),
WACV14(524-531)
IEEE DOI
Earlier:
Mobile Visual Assistive Apps:
Benchmarks of Vision Algorithm Performance,
ACVR13(30-40).
Springer DOI
Computer vision
Cameras
Spacenet,
2020.
Research Group, Europe.
WWW Link. Accelerating Geospatial Machine Learning
Dataset, Mapping.
WWW Link.
Koch, T.[Tobias],
d'Angelo, P.[Pablo],
Kurz, F.[Franz],
Fraundorfer, F.[Friedrich],
Reinartz, P.[Peter],
Körner, M.[Marco],
The TUM-DLR Multimodal Earth Observation Evaluation Benchmark,
SatStreet16(698-705)
IEEE DOI
Dataset, Remote Sensing.
WWW Link. Same scene, satellite, air, UAV, smartphone.
ISPRS Benchmarks,
Online2021
WWW Link.
Dataset, Urban Data.
Dataset, Building Detection.
Dataset, Object Detection.
Dataset, Point Cloud Segmentation. Multiple datasets. Some with associated benchmarks and challenges.
Includes: VAihingen/Enz, Toronto, Potsdam, UAVid, Gaofen,
EuroSDR, Urban classification.
See also
ISPRS: International Society for Photogrammetry and Remote Sensing.
Hong, D.F.[Dan-Feng],
Hu, J.L.[Jing-Liang],
Yao, J.[Jing],
Chanussot, J.[Jocelyn],
Zhu, X.X.[Xiao Xiang],
Multimodal remote sensing benchmark datasets for land cover
classification with a shared and specific feature learning model,
PandRS(178), 2021, pp. 68-80.
Elsevier DOI
Dataset, Remote Sensing. Benchmark datasets, Classification, Feature learning,
Hyperspectral, Land cover mapping, DSM, Multimodal, Specific features
Boguszewski, A.[Adrian],
Batorski, D.[Dominik],
Ziemba-Jankowska, N.[Natalia],
Dziedzic, T.[Tomasz],
Zambrzycka, A.[Anna],
LandCover.ai: Dataset for Automatic Mapping of Buildings, Woodlands,
Water and Roads from Aerial Imagery,
EarthVision21(1102-1110)
IEEE DOI
Dataset, Aerial Mapping. Deep learning, Image segmentation,
Image resolution, Satellites, Roads, Buildings
Shermeyer, J.,
Hogan, D.,
Brown, J.,
van Etten, A.,
Weir, N.,
Pacifici, F.,
Hänsch, R.,
Bastidas, A.,
Soenen, S.,
Bacastow, T.,
Lewis, R.,
SpaceNet 6: Multi-Sensor All Weather Mapping Dataset,
EarthVision20(768-777)
IEEE DOI
Dataset, Mapping. Synthetic aperture radar, Optical sensors, Optical imaging,
Adaptive optics, Optical polarization, Buildings
Chen, H.[Hao],
Shi, Z.W.[Zhen-Wei],
A Spatial-Temporal Attention-Based Method and a New Dataset for
Remote Sensing Image Change Detection,
RS(12), No. 10, 2020, pp. xx-yy.
DOI Link
WWW Link.
Dataset, Building Changes. LEVIR-CD Dataset.
Verma, S.[Sagar],
Panigrahi, A.[Akash],
Gupta, S.[Siddharth],
QFabric: Multi-Task Change Detection Dataset,
EarthVision21(1052-1061)
IEEE DOI
Dataset, Change Detection. Deep learning, Urban areas, Predictive models, Benchmark testing,
Metadata
Zhou, D.B.[Dong-Bo],
Liu, S.J.[Shuang-Jian],
Yu, J.[Jie],
Li, H.[Hao],
A High-Resolution Spatial and Time-Series Labeled Unmanned Aerial
Vehicle Image Dataset for Middle-Season Rice,
IJGI(9), No. 12, 2020, pp. xx-yy.
DOI Link
Dataset, Rice.
AerialWaste: a professionally curated dataset for waste detection in aerial images,
2023.
WWW Link.
Dataset, Garbage. AerialWaste is a dataset for landfill detection featuring airborne,
WorldView-3, and GoogleEarth images annotated by professional photo
interpreters. AerialWaste contains 10,434 images generated from tiles
of three different sources: AGEA Orthophotos (20 cm GSD),
WorldView-3 (30 cm GSD) and GoogleEarth (50 cm GSD).
Tan, W.,
Qin, N.,
Ma, L.,
Li, Y.,
Du, J.,
Cai, G.,
Yang, K.,
Li, J.,
Toronto-3D: A Large-scale Mobile LiDAR Dataset for Semantic
Segmentation of Urban Roadways,
EarthVision20(797-806)
IEEE DOI
Dataset, LiDAR. Semantics, Roads, Laser radar, Sensors,
Machine learning, Automobiles
Hudson, W.H.,
Nadadur, D.C.,
Thornton, K.B.,
Liu, X.,
Haralick, R.M.,
The Radius CDROM Ground Truthed Data Set,
ARPA96(511-519).
Dataset. Ground truth buildings for other users.
ISPRS benchmark on urban object detection and 3D building reconstruction,
2013
HTML Version.
Dataset, Building Detection. Provide state-of-the-art data sets which can be used by interested
researchers in order to test own methods and algorithms on urban
object classification and building reconstruction.
Teller, S.[Seth],
Antone, M.[Matthew],
Bodnar, Z.[Zachary],
Bosse, M.[Michael],
Coorg, S.[Satyan],
Jethwa, M.[Manish],
Master, N.[Neel],
Calibrated, Registered Images of an Extended Urban Area,
IJCV(53), No. 1, June 2003, pp. 93-107.
DOI Link
Dataset, Buildings.
Earlier:
CVPR01(I:813-820).
IEEE DOI
More the dataset than how to analyze the data.
WWW Link.
See also
Spherical Mosaics with Quaternions and Dense Correlation.
Meinel, G.[Gotthard],
Burckhardt, M.[Manuel],
The Digital Basic Geodata Sets Hausumringe and Hauskoordinaten:
Characterization and Pre-processing for Building Stock Analysis,
PFG(2013), No. 6, 2013, pp. 575-588.
DOI Link
Dataset, Buildings.
Weber, E.[Ethan],
Papadopoulos, D.P.[Dim P.],
Lapedriza, A.[Agata],
Ofli, F.[Ferda],
Imran, M.[Muhammad],
Torralba, A.[Antonio],
Incidents1M: A Large-Scale Dataset of Images With Natural Disasters,
Damage, and Incidents,
PAMI(45), No. 4, April 2023, pp. 4768-4781.
IEEE DOI
Dataset, Disasters. Social networking (online), Task analysis, Satellites,
Computational modeling, Data models, Visualization, Training,
incident detection
ISPRS Test Project on Urban Classification and 3D Building Reconstruction,
LIDAR data for building descrtiptions.
WWW Link.
Dataset, Building Extraction. Used for ISPRS 3D Labeling contest.
Ye, Z.[Zhen],
Xu, Y.S.[Yu-Sheng],
Huang, R.[Rong],
Tong, X.H.[Xiao-Hua],
Li, X.[Xin],
Liu, X.F.[Xiang-Feng],
Luan, K.F.[Kui-Feng],
Hoegner, L.[Ludwig],
Stilla, U.[Uwe],
LASDU: A Large-Scale Aerial LiDAR Dataset for Semantic Labeling in
Dense Urban Areas,
IJGI(9), No. 7, 2020, pp. xx-yy.
DOI Link
Dataset, LiDAR.
Gao, W.X.[Wei-Xiao],
Nan, L.L.[Liang-Liang],
Boom, B.J.[Bas J.],
Ledoux, H.[Hugo],
SUM: A benchmark dataset of Semantic Urban Meshes,
PandRS(179), 2021, pp. 108-120.
Elsevier DOI
Dataset, Urban Data. Texture meshes, Urban scene understanding, Mesh annotation,
Semantic segmentation, Over-segmentation, Benchmark dataset
Helsinki.
Cruz, S.[Steve],
Hutchcroft, W.[Will],
Li, Y.G.[Yu-Guang],
Khosravan, N.[Naji],
Boyadzhiev, I.[Ivaylo],
Kang, S.B.[Sing Bing],
Zillow Indoor Dataset: Annotated Floor Plans With 360° Panoramas and
3D Room Layouts,
CVPR21(2133-2143)
IEEE DOI
Dataset, Floor Plans. Annotations, Layout, Urban areas, Semantics, Estimation
NYU Depth Dataset V2,
HTML Version.
Dataset, RGBD.
Dataset, Indoor Scenes.
See also
Indoor Segmentation and Support Inference from RGBD Images.
Hu, Q.Y.[Qing-Yong],
Yang, B.[Bo],
Khalid, S.[Sheikh],
Xiao, W.[Wen],
Trigoni, N.[Niki],
Markham, A.[Andrew],
SensatUrban: Learning Semantics from Urban-Scale Photogrammetric Point
Clouds,
IJCV(130), No. 2, February 2022, pp. 316-343.
Springer DOI
WWW Link.
Dataset, Urban. Urban scale point cloud
Cordts, M.,
Omran, M.,
Ramos, S.,
Rehfeld, T.,
Enzweiler, M.,
Benenson, R.,
Franke, U.,
Roth, S.,
Schiele, B.,
The Cityscapes Dataset for Semantic Urban Scene Understanding,
CVPR16(3213-3223)
IEEE DOI WWW Link.
WWW Link.
WWW Link.
WWW Link.
Dataset, City Models.
Ros, G.,
Sellart, L.,
Materzynska, J.,
Vazquez, D.,
Lopez, A.M.,
The SYNTHIA Dataset: A Large Collection of Synthetic Images for
Semantic Segmentation of Urban Scenes,
CVPR16(3234-3243)
IEEE DOI
Dataset, City Models.
Ma, Y.C.[Yan-Chun],
Liu, Y.J.[Yong-Jian],
Xie, Q.[Qing],
Xiong, S.W.[Sheng-Wu],
Bai, L.H.[Li-Hua],
Hu, A.[Anshu],
A Tibetan Thangka data set and relative tasks,
IVC(108), 2021, pp. 104125.
Elsevier DOI
Dataset, Tibetan Culture. Chomo Yarlung Tibet version 1.
Image data set, Thangka data set, Tibetan culture,
Semantic content analysis, Image processing
CyArk,
3-D, Laser data collection and archiving.
WWW Link.
Vendor, Cultural Heritage.
Dataset, Cultural Heritage. Digiatal Archive of the world's heritage sites for preservation and
education.
Not a vendor as such, but an archive and group that will collect the data.
Some are small, some huge. Used for sites being destroyed, or for
reconstruction. Visualizations, etc.
Weir, N.,
Lindenbaum, D.,
Bastidas, A.,
Etten, A.,
Kumar, V.,
Mcpherson, S.,
Shermeyer, J.,
Tang, H.,
SpaceNet MVOI: A Multi-View Overhead Imagery Dataset,
ICCV19(992-1001)
IEEE DOI
Dataset, Stereo. feature extraction, image classification,
image colour analysis, image resolution, image segmentation
UCF-ARG,
Online2012
WWW Link.
Dataset, Surveillance.
Earlier:
UCF Aerial Action Dataset,
WWW Link. A:Aerial Camera, R: Roof top camera, G: Ground camera.
3 views of different actions.
The aerial subset
Jha, S.S.[Sudhanshu Shekhar],
Nidamanuri, R.R.[Rama Rao],
Gudalur Spectral Target Detection (GST-D): A New Benchmark Dataset
and Engineered Material Target Detection in Multi-Platform Remote
Sensing Data,
RS(12), No. 13, 2020, pp. xx-yy.
DOI Link
Dataset, Targets. Target detection, or sparsely distributed materials.
Xia, G.S.[Gui-Song],
Bai, X.[Xiang],
Ding, J.[Jian],
Zhu, Z.[Zhen],
Belongie, S.[Serge],
Luo, J.B.[Jie-Bo],
Datcu, M.[Mihai],
Pelillo, M.[Marcello],
Zhang, L.P.[Liang-Pei],
DOTA: A Large-Scale Dataset for Object Detection in Aerial Images,
CVPR18(3974-3983)
IEEE DOI Dataset, Vehicle Detection.
WWW Link. Object detection, Earth, Sports, Sensors,
Marine vehicles, Image sensors
Matzen, K.[Kevin],
Snavely, N.[Noah],
NYC3DCars: A Dataset of 3D Vehicles in Geographic Context,
ICCV13(761-768)
IEEE DOI
Dataset, Vehicles. 3D models; geography; object detection; structure from motion
Zhang, T.W.[Tian-Wen],
Zhang, X.L.[Xiao-Ling],
Li, J.W.[Jian-Wei],
Xu, X.W.[Xiao-Wo],
Wang, B.Y.[Bao-You],
Zhan, X.[Xu],
Xu, Y.Q.[Yan-Qin],
Ke, X.[Xiao],
Zeng, T.J.[Tian-Jiao],
Su, H.[Hao],
Ahmad, I.[Israr],
Pan, D.[Dece],
Liu, C.[Chang],
Zhou, Y.[Yue],
Shi, J.[Jun],
Wei, S.[Shunjun],
SAR Ship Detection Dataset (SSDD): Official Release and Comprehensive
Data Analysis,
RS(13), No. 18, 2021, pp. xx-yy.
DOI Link
WWW Link.
Dataset, Ships.
Lei, S.L.[Song-Lin],
Lu, D.D.[Dong-Dong],
Qiu, X.L.[Xiao-Lan],
Ding, C.[Chibiao],
SRSDD-v1.0: A High-Resolution SAR Rotation Ship Detection Dataset,
RS(13), No. 24, 2021, pp. xx-yy.
DOI Link
Dataset, Ship Detection.
Boat Detection,
Online2019
HTML Version.
Dataset, Ships.
WWW Link.
Public video dataset for boat detection/tracking from UAV video footage
See also
MULTIDRONE.
See also
Racing Bicycle Detection/Tracking from UAV Footage, UAV Detection.
Di, Y.H.[Yang-Hua],
Jiang, Z.G.[Zhi-Guo],
Zhang, H.[Haopeng],
A Public Dataset for Fine-Grained Ship Classification in Optical
Remote Sensing Images,
RS(13), No. 4, 2021, pp. xx-yy.
DOI Link
Dataset, Ships.
Liu, Z.Y.[Zhao-Ying],
Waqas, M.[Muhammad],
Yang, J.[Jia],
Rashid, A.[Ahmar],
Han, Z.[Zhu],
A Multi-Task CNN for Maritime Target Detection,
SPLetters(28), 2021, pp. 434-438.
IEEE DOI
Dataset, Ship Detection. MaRine ShiP (MRSP-13) Dataset.
Marine vehicles, Task analysis, Object detection,
Image segmentation, Boats, Feature extraction, Annotations,
cross-layer connections
He, B.[Boyong],
Li, X.J.[Xian-Jiang],
Huang, B.[Bo],
Gu, E.[Enhui],
Guo, W.J.[Wei-Jie],
Wu, L.[Liaoni],
UnityShip: A Large-Scale Synthetic Dataset for Ship Recognition in
Aerial Images,
RS(13), No. 24, 2021, pp. xx-yy.
DOI Link
Dataset, Ship Detection.
Gundogdu, E.[Erhan],
Solmaz, B.[Berkan],
Yücesoy, V.[Veysel],
Koç, A.[Aykut],
MARVEL: A Large-Scale Image Dataset for Maritime Vessels,
ACCV16(V: 165-180).
Springer DOI
Dataset, Ships.
Melzi, P.[Pietro],
Rodriguez-Albala, J.M.[Juan Manuel],
Morales, A.[Aythami],
Tolosana, R.[Ruben],
Fierrez, J.[Julian],
Vera-Rodriguez, R.[Ruben],
Fishing Gear Classification from Vessel Trajectories and Velocity
Profiles: Database and Benchmark,
IbPRIA23(629-638).
Springer DOI
Dataset, Ship Tracking.
WWW Link. To detect illegal fishing.
Mostajabi, M.[Mohammadreza],
Wang, C.M.[Ching Ming],
Ranjan, D.[Darsh],
Hsyu, G.[Gilbert],
High Resolution Radar Dataset for Semi-Supervised Learning of Dynamic
Objects,
PBVS20(450-457)
IEEE DOI
Dataset, Radar. Synthetic aperture radar, Radar imaging, Spaceborne radar,
Image resolution, Apertures, Azimuth
Japanese Character Image Database,
Cedar (Buffalo) database.
Dataset, OCR.
WWW Link. Test data for Japanese OCR.
Wang, D.H.[Da-Han],
Liu, C.L.[Cheng-Lin],
Yu, J.L.[Jin-Lun],
Zhou, X.D.[Xiang-Dong],
CASIA-OLHWDB1: A Database of Online Handwritten Chinese Characters,
ICDAR09(1206-1210).
IEEE DOI
Dataset, OCR.
Zhou, S.[Shusen],
Chen, Q.C.[Qing-Cai],
Wang, X.L.[Xiao-Long],
Guo, X.[Xinyi],
Li, H.[Hui],
An Empirical Evaluation on HIT-OR3C Database,
ICDAR11(1150-1154).
IEEE DOI
Dataset, OCR. Handwriting Chinese character database (HIT-OR3C)
Yan, H.Y.[Han-Yu],
Jin, L.W.[Lian-Wen],
Viard-Gaudin, C.[Christian],
Mouchere, H.[Harold],
SCUT-COUCH Textline_NU:
An Unconstrained Online Handwritten Chinese Text Lines Dataset,
FHR10(581-586).
IEEE DOI
Dataset, Chinese Characters.
Zhang, H.G.[Hong-Gang],
Guo, J.[Jun],
Chen, G.[Guang],
Li, C.G.[Chun-Guang],
HCL2000: A Large-scale Handwritten Chinese Character Database for
Handwritten Character Recognition,
ICDAR09(286-290).
IEEE DOI
Dataset, OCR.
Liu, C.L.[Cheng-Lin],
Yin, F.[Fei],
Wang, D.H.[Da-Han],
Wang, Q.F.[Qiu-Feng],
Online and offline handwritten Chinese character recognition:
Benchmarking on new databases,
PR(46), No. 1, January 2013, pp. 155-162.
Elsevier DOI
Earlier:
CASIA Online and Offline Chinese Handwriting Databases,
ICDAR11(37-41).
IEEE DOI
Dataset, OCR. Handwritten Chinese character recognition; Online; Offline; Databases;
Benchmarking
See also
Touching Character Database from Chinese Handwriting for Assessing Segmentation Algorithms, A.
Hull, J.J.,
A Database for Handwritten Text Recognition Research,
PAMI(16), No. 5, May 1994, pp. 550-554.
IEEE DOI
Dataset, Handwriting.
Handwriting Database.
Ground Truthed Handwritten Word Images,
Cambridge University dataset.
Dataset, Handwriting.
HTML Version.
On-line Handwriting Database,
Tokyo Univ. of Agri. & Tech., Nakagawa Laboratory.
Dataset, Handwriting.
WWW Link.
Shivram, A.,
Ramaiah, C.,
Setlur, S.,
Govindaraju, V.,
IBM_UB_1: A Dual Mode Unconstrained English Handwriting Dataset,
ICDAR13(13-17)
IEEE DOI
Dataset, OCR. handwriting recognition
Ben Abdelghani, I.A.[Imen Abroug],
Ben Amara, N.E.[Najoua Essoukri],
SID Signature Database:
A Tunisian Off-line Handwritten Signature Database,
EAHSP13(131-139).
Springer DOI
Dataset, Signatures.
Kleber, F.,
Fiel, S.,
Diem, M.,
Sablatnig, R.,
CVL-DataBase: An Off-Line Database for Writer Retrieval, Writer
Identification and Word Spotting,
ICDAR13(560-564)
IEEE DOI
Dataset, Writer Identification. XML
The Street View House Numbers (SVHN) Dataset ,
2011
WWW Link.
Dataset, House Numbers. 600,000 digit images.
USPS Office of Advanced Technology Database of Handwritten Cities,
States, ZIP Codes, Digits, and Alphabetic Characters,
Cedar (Buffalo) database.
Dataset, Handwriting.
WWW Link. Database for mail processing.
Dimauro, G.,
Impedovo, S.,
Modugno, R.,
Pirlo, G.,
A new database for research on bank-check processing,
FHR02(524-528).
IEEE Top Reference.
Dataset, Checks.
Ma, L.L.,
Liu, J.[Ji],
Wu, J.,
A new database for online handwritten Mongolian word recognition,
ICPR16(1131-1136)
IEEE DOI
Dataset, Mongolian Characters. Character recognition, Databases, Handwriting recognition, Layout,
Sampling methods, Training, Writing, CNN, MRG-OHMW, annotation,
evaluation, online, handwritten, Mongolian, word, recognition
Ali, H.[Hazrat],
UHaT: Urdu handwritten text dataset,
2020
WWW Link.
Dataset, Urdu. Urdu handwritten characters and digits.
Das, N.[Nibaran],
Acharya, K.[Kallol],
Sarkar, R.[Ram],
Basu, S.[Subhadip],
Kundu, M.[Mahantapas],
Nasipuri, M.[Mita],
A benchmark image database of isolated Bangla handwritten compound
characters,
IJDAR(17), No. 4, December 2014, pp. 413-431.
Springer DOI WWW Link.
Dataset, Bangla.
Sarkar, R.[Ram],
Das, N.[Nibaran],
Basu, S.[Subhadip],
Kundu, M.[Mahantapas],
Nasipuri, M.[Mita],
Basu, D.K.[Dipak Kumar],
CMATERdb1: A database of unconstrained handwritten Bangla and
Bangla-English mixed script document image,
IJDAR(15), No. 1, March 2012, pp. 71-83.
WWW Link.
Dataset, Bangla.
Nethravathi, B.,
Archana, C.P.,
Shashikiran, K.,
Ramakrishnan, A.G.,
Kumar, V.[Vijay],
Creation of a Huge Annotated Database for Tamil and Kannada OHR,
FHR10(415-420).
IEEE DOI
Dataset, OCR.
Sagheer, M.W.[Malik Waqas],
He, C.L.[Chun Lei],
Nobile, N.[Nicola],
Suen, C.Y.[Ching Y.],
A New Large Urdu Database for Off-Line Handwriting Recognition,
CIAP09(538-546).
Springer DOI
Dataset, Urdu Handwriting.
ERIM Arabic Document Database,
Machine printed Arabic documents.
Dataset, OCR.
Dataset, Arabic.
HTML Version.
Mahmoud, S.A.[Sabri A.],
Ahmad, I.[Irfan],
Al-Khatib, W.G.[Wasfi G.],
Alshayeb, M.[Mohammad],
Parvez, M.T.[Mohammad Tanvir],
Märgner, V.[Volker],
Fink, G.A.[Gernot A.],
KHATT: An open Arabic offline handwritten text database,
PR(47), No. 3, 2014, pp. 1096-1112.
Elsevier DOI
Dataset, Arabic Text. Arabic handwritten text database
Mahmoud, S.A.[Sabri A.],
Ahmad, I.[Irfan],
Alshayeb, M.[Mohammad],
Al-Khatib, W.G.[Wasfi G.],
Parvez, M.T.[Mohammad Tanvir],
Fink, G.A.[Gernot A.],
Margner, V.[Volker],
El Abed, H.[Haikal],
KHATT: Arabic Offline Handwritten Text Database,
FHR12(449-454).
IEEE DOI
Dataset, Handwritting, Arabic.
Lamghari, N.[Nidal],
Raghay, S.[Said],
DBAHCL: database for Arabic handwritten characters and ligatures,
MultInfoRetr(6), No. 3, September 2017, pp. 263-269.
Springer DOI
Dataset, Arabic Characters.
Al Maadeed, S.[Somaya],
Ayouby, W.[Wael],
Hassaine, A.[Abdelaali],
Aljaam, J.M.[Jihad Mohamad],
QUWI: An Arabic and English Handwriting Dataset for Offline Writer
Identification,
FHR12(746-751).
IEEE DOI
Dataset, Arabic.
Soleimani, A.,
Fouladi, K.,
Araabi, B.N.,
UTSig: A Persian offline signature dataset,
IET-Bio(6), No. 1, 2017, pp. 1-8.
DOI Link
Dataset, Persian. handwriting recognition
Ziaratban, M.[Majid],
Faez, K.[Karim],
Bagheri, F.[Fatemeh],
FHT: An Unconstraint Farsi Handwritten Text Database,
ICDAR09(281-285).
IEEE DOI
Dataset, OCR.
Haghighi, P.J.[Puntis Jifroodian],
Nobile, N.[Nicola],
He, C.L.[Chun Lei],
Suen, C.Y.[Ching Y.],
A New Large-Scale Multi-purpose Handwritten Farsi Database,
ICIAR09(278-286).
Springer DOI
Dataset, Farsi Handwriting.
NIST OCR Databases,
2005.
WWW Link.
Dataset, OCR.
Dataset, Documents. A series of datasets for OCR and document analysis.
Sauvola, J.,
Kauniskangas, H.,
Media Team Document Database II,
Online1999.
WWW Link.
Dataset, Document Analysis.
Todoran, L.[Leon],
Worring, M.[Marcel],
Smeulders, A.W.M.[Arnold W. M.],
The UvA color document dataset,
IJDAR(7), No. 4, September 2005, pp. 228-240.
Springer DOI
Dataset, Documents.
Earlier:
Data GroundTruth, Complexity, and Evaluation Measures for Color
Document Analysis,
DAS02(519 ff.).
Springer DOI
Bukhari, S.S.[Syed Saqib],
Shafait, F.[Faisal],
Breuel, T.M.[Thomas M.],
The IUPR Dataset of Camera-Captured Document Images,
CBDAR11(164-171).
Springer DOI
Dataset, Document Images.
Nagy, R.[Robert],
Dicker, A.[Anders],
Meyer-Wegener, K.[Klaus],
NEOCR: A Configurable Dataset for Natural Image Text Recognition,
CBDAR11(150-163).
Springer DOI
Dataset, Natural Image Text.
Ibrahim, A.[Ahmed],
Abbott, A.L.[A. Lynn],
Hussein, M.E.[Mohamed E.],
An Image Dataset of Text Patches in Everyday Scenes,
ISVC16(II: 291-300).
Springer DOI
Dataset, Scene Text.
Ikica, A.[Andrej],
Peer, P.[Peter],
Computer Vision Lab OCR DataBase: CVL OCR DB,
2011. A public annotated image database of text in natural scenes
WWW Link.
Dataset, Text in Images.
Guerin, C.,
Rigaud, C.,
Mercier, A.,
Ammar-Boudjelal, F.,
Bertet, K.,
Bouju, A.,
Burie, J.C.,
Louis, G.,
Ogier, J.M.,
Revel, A.,
eBDtheque: A Representative Database of Comics,
ICDAR13(1145-1149)
IEEE DOI
Dataset, Comics. entertainment
Schölch, L.[Lukas],
Steinhäuser, J.[Jonas],
Beichter, M.[Maximilian],
Seibold, C.[Constantin],
Yang, K.L.[Kai-Lun],
Knaeble, M.[Merlin],
Schwarz, T.[Thorsten],
Maedche, A.[Alexander],
Stiefelhagen, R.[Rainer],
Towards Automatic Parsing of Structured Visual Content through the
Use of Synthetic Data,
ICPR22(1607-1613)
IEEE DOI
Dataset, Graphics.
WWW Link. Training, Measurement, Visualization, Annotations,
Supervised learning, Static VAr compensators, Solids
Quiniou, S.[Solen],
Mouchere, H.[Harold],
Saldarriaga, S.P.[Sebastián Pen],
Viard-Gaudin, C.[Christian],
Morin, E.[Emmanuel],
Petitrenaud, S.[Simon],
Medjkoune, S.[Sofiane],
HAMEX: A Handwritten and Audio Dataset of Mathematical Expressions,
ICDAR11(452-456).
IEEE DOI
Dataset, OCR.
Stria, J.[Jan],
Bresler, M.[Martin],
Prua, D.[Daniel],
Hlavác, V.[Vaclav],
MfrDB: Database of Annotated On-Line Mathematical Formulae,
FHR12(542-547).
IEEE DOI
Dataset, Formula.
FlickrLogos-32,
2013
WWW Link.
Dataset, Logos. 32 logo classes, various orientations, surface shapes, etc.
UMD Logo Database,
Univ. Maryland database of 106 corportate logos.
Dataset, Logos.
HTML Version.
CrisisMMD Dataset,
2017.
WWW Link.
Dataset, Disasters.
CrisisMMD Dataset. Multimodal Twitter dataset consists of several thousands of manually
annotated tweets and images collected during seven major natural
disasters including earthquakes, hurricanes, wildfires, and floods
that happened in the year 2017.
Yang, Z.L.[Zhong-Liang],
Wang, K.[Ke],
Ma, S.[Sai],
Huang, Y.F.[Yong-Feng],
Kang, X.G.[Xian-Gui],
Zhao, X.F.[Xian-Feng],
ISTEGO100K: Large-scale Image Steganalysis Dataset,
IWDW19(352-364).
Springer DOI
Dataset, Setganalysis.
Rocha, A.[Anderson],
Goldenstein, S.K.[Siome K.],
Scheirer, W.J.[Walter J.],
Boult, T.E.[Terrance E.],
The Unseen Challenge data sets,
WVU08(1-8).
IEEE DOI
Dataset, Steganalysis.
Bai, W.M.[Wei-Ming],
Zhang, Z.P.[Zhi-Peng],
Li, B.[Bing],
Wang, P.[Pei],
Li, Y.X.[Yang-Xi],
Zhang, C.X.[Cong-Xuan],
Hu, W.M.[Wei-Ming],
Robust Texture-Aware Computer-Generated Image Forensic:
Benchmark and Algorithm,
IP(30), 2021, pp. 8439-8453.
IEEE DOI
Dataset, Image Forensics.
WWW Link. Ddistinguish computer generated from photographic images.
Benchmark testing, Feature extraction, Image forensics, Task analysis,
computer-generated images forensic
Unipen Project,
Online1994.
Dataset, Handwriting.
WWW Link. This is a working group organized through IAPR to maintain and protect (ensure
available to researchers) various databases of handwriting data.
Njah, S.[Sourour],
Ben Nouma, B.[Badreddine],
Bezine, H.[Hala],
Alimi, A.M.[Adel M.],
MAYASTROUN: A Multilanguage Handwriting Database,
FHR12(308-312).
IEEE DOI
Dataset, Handwriting.
Pérez, D.[Daniel],
Tarazón, L.[Lionel],
Serrano, N.[Nicolás],
Castro, F.[Francisco],
Terrades, O.R.[Oriol Ramos],
Juan, A.[Alfons],
The GERMANA Database,
ICDAR09(301-305).
IEEE DOI
Dataset, OCR. Handwritten Spanish manuscript from 1891.
Shi, Z.H.[Zheng-Hao],
Sand Dust Image DAta,
March 26, 2020.
Pictures in sandstorms.
DOI Link
WWW Link.
Dataset, Sandstorm.
Aytekin, Ç.,
Nikkanen, J.,
Gabbouj, M.,
A Data Set for Camera-Independent Color Constancy,
IP(27), No. 2, February 2018, pp. 530-544.
IEEE DOI
Dataset, Color Constancy. Cameras, Image color analysis, Lighting, Reflectivity, Robustness,
Sensitivity, Training, Color constancy, color shading,
platform independence
Barnard, K.[Kobus],
Martin, L.[Lindsay],
Funt, B.V.[Brian V.], and
Coath, A.[Adam],
A Data Set for Colour Research,
ColorRes(27), No 3, 2002, pp. 147-151.
HTML Version.
Dataset, Color Constancy.
HTML Version.
Soundararajan, P.[Padmanabhan],
Sarkar, S.[Sudeep],
An in-depth study of graph partitioning measures for perceptual
organization,
PAMI(25), No. 6, June 2003, pp. 642-660.
IEEE Abstract.
Evaluation, Segmentation.
WWW Link.
Code, Perceptual Grouping.
Dataset, Perceptual Grouping.
Earlier:
Empirical evaluation of graph partitioning measures for perceptual
organization,
EEMCV01(xx-yy).
Quality of groups generated by
minimum (
See also
Optimal Graph Theoretic Approach to Data Clustering: Theory and Its Application to Image Segmentation, An. ) or
average (
See also
Supervised Learning of Large Perceptual Organization: Graph Spectral Partitioning and Learning Automata. ) or
normalized (
See also
Normalized Cuts and Image Segmentation. ) cuts
are equivalent for recognition.
Wang, T.C.[Ting-Chun],
Zhu, J.Y.[Jun-Yan],
Hiroaki, E.[Ebi],
Chandraker, M.[Manmohan],
Efros, A.A.[Alexei A.],
Ramamoorthi, R.[Ravi],
A 4D Light-Field Dataset and CNN Architectures for Material Recognition,
ECCV16(III: 121-138).
Springer DOI
Dataset, Material Recognition.
Large Geometric Models Archive,
2008
WWW Link.
Dataset, 3-D Models. Detailed 3-D models from Georgia Institute of Technology.
Especially for graphics.
See also
Georgia Tech.
Digne, J.[Julie],
Audfray, N.[Nicolas],
Lartigue, C.[Claire],
Mehdi-Souzani, C.[Charyar],
Morel, J.M.[Jean-Michel],
Farman Institute 3D Point Sets: High Precision 3D Data Sets,
IPOL(2011), No. 1, 2011, pp. xx-yy.
DOI Link
Dataset, 3D Data.
ISPRS Terrestrial laser scanning and 3D imaging Datasets,
2008.
HTML Version.
Dataset, 3-D Data. 3-D datasets for large scale objects.
Sanmarina Byzantine church and Golden Buddha.
NaturePix: Visual Cognitive Modeling Research,
2007.
WWW Link.
Dataset, 3-D Data. ASU 3-D datasets.
Replaces former ASU dataset?
The Stanford 3D Scanning Repository,
2007.
WWW Link.
Dataset, 3-D Data. Stanford graphics databases
The Beazley Archive of Classical Art Pottery Database,
July 2013
WWW Link.
Dataset, Pottery.
Oliva, A.[Aude],
Torralba, A.B.[Antonio B.],
Modeling the Shape of the Scene:
A Holistic Representation of the Spatial Envelope,
IJCV(42), No. 3, May-June 2001, pp. 145-175.
DOI Link WWW Link.
Dataset, Outdoor Secens.
Earlier:
Scene-Centered Description from Spatial Envelope Properties,
BMCV02(263 ff.).
Springer DOI
Otherwise known as OSR dataset.
Spatial envelope: low dimensional representation of the secen.
Perceptual dimensions to represent the dominat satial structure.
Memotion Dataset 7k,
2019.
WWW Link.
Dataset, Sentinment.
Memotion Dataset. Dataset for sentiment classification of memes.
Patro, B.N.[Badri N.],
Lunayach, M.[Mayank],
Srivastava, D.[Deepankar],
Sarvesh, S.[Sarvesh],
Singh, H.[Hunar],
Namboodiri, V.P.[Vinay P.],
Multimodal Humor Dataset: Predicting Laughter tracks for Sitcoms,
WACV21(576-585)
IEEE DOI WWW Link.
Dataset, Humor. Annotations, Semantics, Bit error rate,
Manuals, Task analysis
Uy, M.A.,
Pham, Q.,
Hua, B.,
Nguyen, T.,
Yeung, S.,
Revisiting Point Cloud Classification: A New Benchmark Dataset and
Classification Model on Real-World Data,
ICCV19(1588-1597)
IEEE DOI
Dataset, Point Cloud.
WWW Link. CAD, feature extraction,
learning (artificial intelligence), neural nets
Hodan, T.[Tomáš],
Haluza, P.[Pavel],
Obdržálek, Š.[Štepán],
Matas, J.G.[Jirí G.],
Lourakis, M.[Manolis],
Zabulis, X.[Xenophon],
T-LESS:
An RGB-D Dataset for 6D Pose Estimation of Texture-Less Objects,
WACV17(880-888)
IEEE DOI
Dataset, RBG-D.
WWW Link. (Slow response)
Image color analysis, Image sensors, Pose estimation, Sensors,
Solid modeling, Training
Bellmann, A.[Anke],
Hellwich, O.[Olaf],
Rodehorst, V.[Volker],
Yilmaz, U.[Ulas],
A Benchmarking Dataset for Performance Evaluation of Automatic Surface
Reconstruction Algorithms,
BenCOS07(1-8).
IEEE DOI
Dataset, Surface Reconstruction.
Lee, S.K.[Seung-Kyu],
Liu, Y.X.[Yan-Xi],
Curved Glide-Reflection Symmetry Detection,
PAMI(34), No. 2, February 2012, pp. 266-278.
IEEE DOI
Earlier:
CVPR09(1046-1053).
IEEE DOI
Generalize Bilateral reflection symmetry to curved glide-reflection.
Leaf images.
Dataset, Symmetry Images.
Alpha Matting Evaluation Website,
2009.
WWW Link.
Dataset, Image Matting.
See also
perceptually motivated online benchmark for image matting, A.
Peng, B.[Bo],
Zhang, M.L.[Ming-Liang],
Lei, J.J.[Jian-Jun],
Fu, H.Z.[Hua-Zhu],
Shen, H.F.[Hai-Feng],
Huang, Q.M.[Qing-Ming],
RGB-D Human Matting:
A Real-World Benchmark Dataset and a Baseline Method,
CirSysVideo(33), No. 8, August 2023, pp. 4041-4053.
IEEE DOI
Dataset, Matting. Task analysis, Image color analysis, Semantics, Manuals,
Benchmark testing, Feature extraction, Semantic segmentation,
baseline
How2 Dataset,
2019
WWW Link. Instructional videos
Used in How2 Challenge at ICML 2009
Dataset, Instructional Video.
YouCook2,
2018
WWW Link. Cooking videos
Dataset, Instructional Video.
Alayrac, J.B.[Jean-Baptiste],
Bojanowski, P.[Piotr],
Agrawal, N.[Nishant],
Sivic, J.[Josef],
Laptev, I.[Ivan],
Lacoste-Julien, S.[Simon],
Learning from Narrated Instruction Videos,
PAMI(40), No. 9, September 2018, pp. 2194-2208.
IEEE DOI
Dataset, Instructional Video.
WWW Link.
Earlier:
Unsupervised Learning from Narrated Instruction Videos,
CVPR16(4575-4583)
IEEE DOI
Videos, Automobiles, Visualization, Tires, YouTube, Internet, Pragmatics,
Step discovery, narrated instruction videos, unsupervised learning.
Text and images from video for learning the steps.
Tang, Y.S.[Yan-Song],
Ding, D.J.[Da-Jun],
Rao, Y.M.[Yong-Ming],
Zheng, Y.[Yu],
Zhang, D.Y.[Dan-Yang],
Zhao, L.[Lili],
Lu, J.W.[Ji-Wen],
Zhou, J.[Jie],
COIN: A Large-Scale Dataset for Comprehensive Instructional Video
Analysis,
CVPR19(1207-1216).
IEEE DOI
Dataset, Instructional Video.
WWW Link.
Miech, A.,
Zhukov, D.,
Alayrac, J.,
Tapaswi, M.,
Laptev, I.,
Alayrac, J.B.[Jean-Baptiste],
HowTo100M: Learning a Text-Video Embedding by Watching Hundred
Million Narrated Video Clips,
ICCV19(2630-2640)
IEEE DOI
WWW Link.
Dataset, Instructional Video. Internet, learning (artificial intelligence),
natural language processing, social networking (online),
Computational modeling
Zhukov, D.[Dimitri],
Alayrac, J.B.[Jean-Baptiste],
Cinbis, R.G.[Ramazan Gokberk],
Fouhey, D.[David],
Laptev, I.[Ivan],
Sivic, J.[Josef],
Cross-Task Weakly Supervised Learning From Instructional Videos,
CVPR19(3532-3540).
IEEE DOI
Dataset, Instructional Video.
WWW Link.
STVD-FC: Large-Scale TV Dataset - Fact Checking',
2023
WWW Link.
Dataset, Content Analysis. Public dataset on the political content analysis and fact-checking
tasks. It consists of more than 1,200 fact-checked claims that have
been scraped from a fact-checking service with associated metadata.
Xu, Q.Z.[Qing-Zheng],
Chen, H.Q.[Hui-Qiang],
Du, H.M.[He-Ming],
Zhang, H.[Hu],
Lukasik, S.[Szymon],
Zhu, T.Q.[Tian-Qing],
Yu, X.[Xin],
M3A: A multimodal misinformation dataset for media authenticity
analysis,
CVIU(249), 2024, pp. 104205.
Elsevier DOI
Dataset, Misinformation. Misinformation detection, Media authenticity, Multimodal dataset
Vidal, R.G.[Rosaura G.],
Banerjee, S.[Sreya],
Grm, K.[Klemen],
Struc, V.[Vitomir],
Scheirer, W.J.[Walter J.],
UG^2: a Video Benchmark for Assessing the Impact of Image Restoration
and Enhancement on Automatic Visual Recognition,
WWW Link.
Earlier:
WACV18(1597-1606)
IEEE DOI
Dataset, Image Restoration. Used for restoration challenges at CVPR.
image classification, image enhancement, image restoration,
learning (artificial intelligence), object detection,
Visualization
Liu, X.W.[Xin-Wei],
Pedersen, M.[Marius],
Hardeberg, J.Y.[Jon Yngve],
CID:IQ: A New Image Quality Database,
ICISP14(193-202).
Springer DOI
Dataset, Image Quality.
Sun, W.[Wen],
Zhou, F.[Fei],
Liao, Q.M.[Qing-Min],
MDID: A Multiply Distorted Image Database for Image Quality
Assessment,
PR(61), No. 1, 2017, pp. 153-168.
Elsevier DOI
Dataset, Image Quality. Image database
Virtanen, T.,
Nuutinen, M.,
Vaahteranoksa, M.,
Oittinen, P.,
Hakkinen, J.,
CID2013: A Database for Evaluating No-Reference Image Quality
Assessment Algorithms,
IP(24), No. 1, January 2015, pp. 390-402.
IEEE DOI
Dataset, Image Quality. cameras
Gao, W.[Wei],
Yuan, H.[Hang],
Liao, G.[Guibiao],
Guo, Z.X.[Zi-Xuan],
Chen, J.N.[Jia-Ning],
PP8K: A New Dataset for 8K UHD Video Compression and Processing,
MultMedMag(30), No. 3, July 2023, pp. 100-109.
IEEE DOI WWW Link.
Dataset, Video Compression.
Lin, J.Y.[Joe Yuchieh],
Song, R.[Rui],
Wu, C.H.[Chi-Hao],
Liu, T.J.[Tsung-Jung],
Wang, H.[Haiqiang],
Kuo, C.C.J.[C.C. Jay],
MCL-V: A streaming video quality assessment database,
JVCIR(30), No. 1, 2015, pp. 1-9.
Elsevier DOI
Dataset, Video Streaming. Video quality
SAVAM, Visual Salience Dataset,
Saliency dataset.
WWW Link.
Dataset, Saliency.
41 scenes, eyetracker, high res, left and right stereo views.
Paper reference:
See also
Semiautomatic visual-attention modeling and its application to video compression.
Anaya, J.[Josue],
Barbu, A.[Adrian],
RENOIR-A dataset for real low-light image noise reduction,
JVCIR(51), 2018, pp. 144-154.
Elsevier DOI
Dataset, Noise Reduction. Image denoising, Denoising dataset, Low light noise,
Poisson-Gaussian noise model
The Chinese University of Hong Kong,
Computer Vision Laboratory
WWW Link.
Research Group, Hong Kong.
PETA: Pedestrian Attribute Recognition At Far Distance,
Dataset, Pedestrians.
HTML Version.
19,000 images.
Large-scale Fashion (DeepFashion) Dataset,
2016.
HTML Version.
Dataset, Fashion. 800,000 fashion images.
In-Shop Clothes Retrieval Database.
Lotus Hill Institute,
Imageparsing
WWW Link.
Research Group, China.
Dataset, Segmentation.
Code, Viewing. The Imageparsing site is devoted to providing ground truth datasets and
Matlab code for annotation and viewing.
See also
LHI Object Datasets.
See also
LHI Sports Activity Dataset.
See also
LHI Segmentation Dataset.
See also
LHI Surveillance Dataset.
Oxford,
Robotics.
WWW Link.
Visual Geometry Group
WWW Link.
Research Group, UK. Active vision, visual geometry, medical imaging, manufacturing systems,
sonar, robotics.
Oxford Image Examples,
Dataset.
HTML Version.
See also
Oxford Town Center.
Swiss Federal Institute of Technology in Zurich,
ETHComputer Vision Lab:
WWW Link.
Research Group, Switzerland. Interpretation of 2D and 3D image data sets from conventional
and non-conventional image sources.
Photogrammetry group:
WWW Link.
Aerial Image Dataset,
Dataset, Aerial Images.
WWW Link.
University Jaume I,
Institute of New Imaging Technologies
WWW Link.
Computer Vision Group
WWW Link.
Research Group, Spain. Spectral Imaging.
Spectral Imaging Data Base,
Dataset, Spectral Imaging.
WWW Link.
University of Toronto,
RBCV-TRAnd
Toronto
WWW Link.
eyeTap Personal Imaging Lab:
WWW Link.
Research Group, Canada. Open Vidia code.
See also
OpenVidia. Large group.
CIFAR-10 and CIFAR-100 Datasets,
Dataset, Tiny Images.
HTML Version.
10 classes, 10000 images per class. Or 100 classes t00 images each.
Abel Stock,
Commercial image database
WWW Link.
Dataset, Images.
California Institute of Technology,
Computational Vision Group
WWW Link.
Research Group, US. Computational foundations of vision. A number of datasets are available
online.
CalTech 101 Objects Categories,
Dataset, Objects.
HTML Version.
CalTech 256 Objects Categories,
Dataset, Objects.
WWW Link.
30607 images, 256 categories.
CalTech 100 Natural Scenes,
Dataset, Natural Scenes.
WWW Link.
CalTech 10000 Web Faces,
Dataset, Faces.
WWW Link.
CalTech Turntable Images,
Dataset, 3D Data.
WWW Link.
144 calibrated viewpoints, 3 lighting variations.
CalTech Archived Images,
Dataset, Images.
HTML Version.
CalTech-UCSD Birds 200 2011,
CUB-200-2011
Dataset, Images.
HTML Version.
Dataset, Birds. Extension of the CUB-200 dataset.
Massachusetts Institute of Technology, AI Lab,
Computer Science and Artificial Intelligence Lab
CSAILAI group memo
MIT AI Memoor
MIT AIor
MIT AIMAI Memos are shorter reports.
MIT AI-TRor
MIT AI TRAI Tech Reports are longer (often the thesis). Also Project MAC
Technical Reports
MAC-TRMost are available through:
AI TR and Memo series go to 2004, then the CSAIL series.
WWW Link.
CS & AI Lab Vision Research:
WWW Link.
Activity, learning, medical vision, and vision interfaces.
Perceptual Science Group:
WWW Link.
Sensing Perception Autonomy and Robot Kinetics
WWW Link.
Motion Magnification
WWW Link.
Research Group, US.
MIT Places Database for Scene Recognition,
Dataset, Recognition.
WWW Link. 205 scene categories, 2.5Million images.
SUN 397 Database,
Ohio State University,
Signal Analysis and Machine Perception Laboratory (SAMPL)
WWW Link.
Research Group, US. Broad research areas, hyper and multi-spectral, aerial images,
medical images, range processing, human motion, inspection.
Various datasets for 2-D and 3-D data.
OSU Datasets,
Dataset, Images.
HTML Version.
Princeton,
PrincetonComputer Science Department. Computer vision group.
WWW Link.
Research Group, US. Human action classification.
Dataset.
WWW Link.
SUNRGBD: A RGB-D Scene Understanding Benchmark Suite,
Dataset, RGBD.
WWW Link.
Indoor Scenes.
University of Illinois,
Urbana-Champaign
Various Departments,
UIUCOr
IllinoisVision Lab page:
WWW Link.
Quantitative Light Imaging (QLI) Laboratory
WWW Link.
Research Group, US. Robotics, Textures, 3-D recognition and representation, cameras, rendering,
HCI.
University of Illinois Datasets,
Dataset, Texture. 25 textures, 40 samples.
Dataset, Natural Scenes. 15 Categories.
Dataset, Stereo Data. 9 objects, 80 images
Dataset, Multi-View Data. 10 datasets, 24 images of a single object each.
Dataset, Visual Hull.
Dataset, Object Recognition. Birds, Butterflys, etc.
Dataset, Video.
WWW Link.
University of Southern California, Signal and Image Processing,
USC_SIPI
WWW Link.
Research Group, US.
Dataset, Images. Image processing. Some of the old standard image datasets (texture,
vehicles, compression).
Marszalek, M.[Marcin],
Schmid, C.[Cordelia],
Accurate Object Recognition with Shape Masks,
IJCV(97), No. 2, April 2012, pp. 191-209.
WWW Link.
Earlier:
Accurate Object Localization with Shape Masks,
CVPR07(1-8).
IEEE DOI
Dataset, People.
WWW Link. The dataset includes annotations. Derived from Graz dataset.
WWW Link.
PhotoTourism, Matching Challenge Dataset,
2020.
Dataset, Matching.
WWW Link. PhotoTourism dataset.
Large baseline matching.
Yang, G.[Gehua],
Stewart, C.V.[Charles V.],
Sofka, M.[Michal],
Tsai, C.L.[Chia-Ling],
Registration of Challenging Image Pairs:
Initialization, Estimation, and Decision,
PAMI(29), No. 11, November 2007, pp. 1973-1989.
IEEE DOI
Dataset, Matching.
HTML Version.
Earlier:
Automatic robust image registration system:
Initialization, estimation, and decision,
CVS06(23).
IEEE DOI
STVD-PVCD: Large-Scale TV Dataset,
2022.
WWW Link.
Dataset, Video Copy Detection.
Dataset, Copy Detection.
STVD is a public dataset on the Partial Video Copy Detection (PVCD) task.
It was
constituted with about 83,000 videos of more
than 10,000 hours duration and including more than 420,000
video copy pairs. It offers different test sets for
fine performance characterization (frame degradation, global
transformation, video speeding, etc.) with a frame level annotation
for real-time detection and video alignment. Baseline comparisons
are reported to show a room for improvement.
See also
Large-scale TV Dataset for Partial Video Copy Detection, A.
See also
University of Tours.
Zhang, J.C.[Jun-Cheng],
Liao, Q.M.[Qing-Min],
Liu, S.J.[Shao-Jun],
Ma, H.Y.[Hao-Yu],
Yang, W.M.[Wen-Ming],
Xue, J.H.[Jing-Hao],
Real-MFF: A large realistic multi-focus image dataset with ground
truth,
PRL(138), 2020, pp. 370-377.
Elsevier DOI
Dataset, Multi-Focus. Image fusion, Multi-focus images, Multi-focus dataset, Deep learning
Caye-Daudt, R.[Rodrigo],
Le Saux, B.[Bertrand],
Boulch, A.[Alexandre],
Gousseau, Y.[Yann],
Onera Satellite Change Detection (OSCD) Database,
2018
Dataset, Change Detection.
WWW Link.
WWW Link.
See also
Fully Convolutional Siamese Networks for Change Detection.
Goyette, N.,
Jodoin, P.M.,
Porikli, F.M.,
Konrad, J.,
Ishwar, P.,
A Novel Video Dataset for Change Detection Benchmarking,
IP(23), No. 11, November 2014, pp. 4663-4679.
IEEE DOI
Dataset, Change Detection. Adaptive optics
Wang, Y.[Yi],
Jodoin, P.M.[Pierre-Marc],
Porikli, F.M.[Fatih M.],
Konrad, J.[Janusz],
Benezeth, Y.[Yannick],
Ishwar, P.[Prakash],
CDnet 2014: An Expanded Change Detection Benchmark Dataset,
CDW14(393-400)
IEEE DOI
Dataset, Change Detection.
Goyette, N.[Nil],
Jodoin, P.M.[Pierre-Marc],
Porikli, F.M.[Fatih M.],
Konrad, J.[Janusz],
Ishwar, P.[Prakash],
Changedetection.net: A new change detection benchmark dataset,
CDW12(1-8).
IEEE DOI
Dataset, Change Detection.
Walas, K.[Krzysztof],
Leonardis, A.[Aleš],
UoB highly occluded object challenge (UoB-HOOC),
2016
WWW Link.
Dataset, Object Detection.
Wang, Y.M.[Ya-Ming],
Tan, X.[Xiao],
Yang, Y.[Yi],
Li, Z.,
Liu, X.,
Zhou, F.,
Davis, L.S.,
A Refined 3D Pose Dataset for Fine-Grained Object Categories,
R6D19(2797-2806)
IEEE DOI
Dataset, Object Recognition.
HTML Version. image segmentation, object recognition, pose estimation,
statistical analysis, image segmentation networks, IoU,
Fine grained objects
YCB-Video,
A large-scale video dataset for 6D object pose estimation. provides
accurate 6D poses of 21 objects from the YCB dataset observed in 92
videos with 133,827 frames.
WWW Link.
Dataset, Pose Estimation.
Drost, B.[Bertram],
Ulrich, M.[Markus],
Bergmann, P.,
Härtinger, P.,
Steger, C.T.[Carsten T.],
Introducing MVTec ITODD:
A Dataset for 3D Object Recognition in Industry,
6DPose17(2200-2208)
IEEE DOI
Dataset, Object Recognition. Cameras, Engines, Gray-scale, Object detection,
Sensor phenomena and characterization.
Hodan, T.[Tomáš],
Michel, F.[Frank],
Brachmann, E.[Eric],
Kehl, W.[Wadim],
Buch, A.G.[Anders Glent],
Kraft, D.[Dirk],
Drost, B.[Bertram],
Vidal, J.[Joel],
Ihrke, S.[Stephan],
Zabulis, X.[Xenophon],
Sahin, C.[Caner],
Manhardt, F.[Fabian],
Tombari, F.[Federico],
Kim, T.K.[Tae-Kyun],
Matas, J.G.[Jirí G.],
BOP: Benchmark for 6D Object Pose Estimation,
ECCV18(X: 19-35).
Springer DOI
Dataset, Object Pose.
Peters, G.[Gabriele],
Zitova, B.[Barbara],
von der Malsburg, C.[Christoph],
How to measure the pose robustness of object views,
IVC(20), No. 5-6, 15 April 2002, pp. 341-348.
Elsevier DOI
BMVC issue
And:
IVC(20), No. 4, April 2002, pp. 249-256.
Elsevier DOI HTML Version.
Dataset, 3-D Data.
Stegmann, M.B.[Mikkel B.],
Active Appearance Models,
Online2007.
WWW Link.
Code, Active Appearance Model.
Dataset, Active Appearance Model. AAM code and information.
See also
Technical University of Denmark.
Luo, C.[Cai],
Yu, L.J.[Lei-Jian],
Yang, E.[Erfu],
Zhou, H.Y.[Hui-Yu],
Ren, P.[Peng],
A benchmark image dataset for industrial tools,
PRL(125), 2019, pp. 341-348.
Elsevier DOI
Dataset, Tools. Benchmark, Industrial tools, Image dataset
MIT 67 Indoor Dataset,
Dataset, Indoor Images.
HTML Version.
See also
Recognizing indoor scenes.
Yang, K.,
Russakovsky, O.,
Deng, J.,
SpatialSense:
An Adversarially Crowdsourced Benchmark for Spatial Relation Recognition,
ICCV19(2051-2060)
IEEE DOI
Dataset, Spatial Relations.
WWW Link. crowdsourcing, image capture, image recognition,
image sampling, object recognition, SpatialSense benchmark, Genomics
Section, Multiple Entries: 13.4.6 Object Recognition, Retrieval Datasets
Chapter Contents (Back)
Evaluation, Recognition.
Dataset, Objects.
Dataset, Retrieval.
See also
Visual Question Answering, Query, VQA.
See also
Object Recognition Evaluation.
The PASCAL Object Recognition Database Collection,
2006.
Dataset, Objects.
HTML Version. Various datasets for object recognition. Pointers to some of the
others.
Video Objects: A Test Database for Video Object Recognition,
2006.
Dataset, Objects.
HTML Version. 180 videos of 15 objects.
Animals with Attributes: A dataset for Attribute Based Classification,
2006.
Dataset, Objects.
WWW Link. 30,000+ images, 40 animal classes.
Image Net, ImageNet Dataset,
2014.
WWW Link.
Dataset, Objects. Large set of images (or sets of datasets) for recognition.
Related to ImageNet Challanges for recognition.
14Million+ images.
Links to Stanford
See also
Stanford University, Computer Science Departent. and Princeton.
See also
Princeton.
Washington Ground Truth Image Database,
CBIR dataset.
Online2004
WWW Link.
Dataset.
Dataset, Retrieval.
LHI Object Datasets,
Includes hand segmentations, and annotations.
Online2004
HTML Version.
Dataset.
Dataset, Object Recognition.
Transportation images, Animals, Aerial Images, Objects,
Dataset also includes other data.
See also
Lotus Hill Institute.
NEC Animal Dataset,
Online2009
WWW Link.
Dataset.
Dataset, Object Recognition. It consists of about 5000 high quality images
from 60 toy animals taken at different poses against a plain
background.
Xcavator.Net,
Online2007
WWW Link.
Dataset, Object Recognition. Photo search for professional use. Searches stock databases, you then
purchase the image for use.
Part of CogniSign LLC.
The ETH-80 Dataset,
2017
Dataset, Objects.
WWW Link.
The ETH-80 dataset contains visual object images from 8 different
categories including apples, cars, cows,cups, dogs, horses, pears and
tomatoes.
See also
Covariance descriptors on a Gaussian manifold and their application to image set classification.
See also
Swiss Federal Institute of Technology in Zurich.
15 Scene Dataset,
Dataset, Objects.
HTML Version. The 15 scene categories are office, kitchen, living room, bedroom,
store, industrial, tall building, inside cite, street, highway, coast,
open country, mountain, forest, and suburb. Images in the dataset are
about 250*300 resolution, with 210 to 410 images per class.
Video Dataset Overview,
2021
WWW Link.
Dataset, Overview. A good collection of Video datasets for various uses, activity, instruction,
sports, etc..
Multi-Weather 4Seasons Dataset,
2021
Dataset, Driving.
WWW Link.
Vasiljevic, I.,
Kolkin, N.,
Zhang, S.,
Luo, R.,
Wang, H.,
DIODE: A Dense Indoor and Outdoor Depth Dataset,
2019
Dataset, Object Extraction.
WWW Link.
Blanco, J.L.[Jose-Luis],
Moreno, F.A.[Francisco-Angel],
Gonzalez, J.[Javier],
A collection of outdoor robotic datasets with centimeter-accuracy
ground truth,
AutRob(27), No. 4, 2009, pp. 327.
Springer DOI
WWW Link.
Dataset, SLAM. Malaga Parking
Geusebroek, J.M.[Jan-Mark],
Burghouts, G.J.[Gertjan J.],
Smeulders, A.W.M.[Arnold W.M.],
The Amsterdam Library of Object Images,
IJCV(61), No. 1, January 2005, pp. 103-112.
DOI Link
WWW Link.
Dataset, Objects. 1000 objects over 100 images per object.
Torralba, A.B.[Antonio B.],
Fergus, R.[Rob],
Freeman, W.T.[William T.],
80 Million Tiny Images: A Large Data Set for Nonparametric Object and
Scene Recognition,
PAMI(30), No. 11, November 2008, pp. 1958-1970.
IEEE DOI WWW Link.
And:
CSAIL-TR-2007-024, 2007.
Dataset, Retrieval. Images from the WWW, associated with a noun. Large comprehensive dataset.
Dataset with segmentations.
Russell, B.[Bryan],
Torralba, A.B.[Antonio B.],
Freeman, W.T.[William T.],
LableMe: The Open Annotation Tool,
Online2010.
WWW Link.
Dataset, Retrieval.
Code, Annotation. The site for the annotation tool, also the video version.
Zhou, B.[Bolei],
Lapedriza, A.[Agata],
Khosla, A.[Aditya],
Oliva, A.[Aude],
Torralba, A.B.[Antonio B.],
Places: A 10 Million Image Database for Scene Recognition,
PAMI(40), No. 6, June 2018, pp. 1452-1464.
IEEE DOI
Dataset, Retrieval. Context, Databases, Image recognition, Semantics, Sun, Training,
Visualization, Scene classification, deep feature, deep learning,
visual recognition
Escalante, H.J.[Hugo Jair],
Hernandez, C.A.[Carlos A.],
Gonzalez, J.A.[Jesus A.],
Lopez-Lopez, A.,
Montes-y-Gomez, M.[Manuel],
Morales, E.F.[Eduardo F.],
Sucar, L.E.[L. Enrique],
Villasenor, L.[Luis],
Grubinger, M.[Michael],
The segmented and annotated IAPR TC-12 benchmark,
CVIU(114), No. 4, April 2010, pp. 419-428.
Elsevier DOI
Dataset, Retrieval. Data set creation; Ground truth collection; Evaluation metrics;
Automatic image annotation; Image retrieval
Russakovsky, O.[Olga],
Deng, J.[Jia],
Su, H.[Hao],
Krause, J.[Jonathan],
Satheesh, S.[Sanjeev],
Ma, S.[Sean],
Huang, Z.H.[Zhi-Heng],
Karpathy, A.[Andrej],
Khosla, A.[Aditya],
Bernstein, M.[Michael],
Berg, A.C.[Alexander C.],
Fei-Fei, L.[Li],
ImageNet Large Scale Visual Recognition Challenge,
IJCV(115), No. 3, December 2015, pp. 211-252.
Springer DOI
Dataset, Object Category. Object category classification and detection on hundreds of object
categories and millions of images.
Loh, Y.P.[Yuen Peng],
Chan, C.S.[Chee Seng],
Getting to know low-light images with the Exclusively Dark dataset,
CVIU(178), 2019, pp. 30-42.
Elsevier DOI
Dataset, Low Light.
Aizawa, K.,
Fujimoto, A.,
Otsubo, A.,
Ogawa, T.,
Matsui, Y.,
Tsubota, K.,
Ikuta, H.,
Building a Manga Dataset 'Manga109' With Annotations for Multimedia
Applications,
MultMedMag(27), No. 2, April 2020, pp. 8-18.
IEEE DOI
Dataset, Manga. Machine learning, Visualization, Character recognition, Art,
Machine learning algorithms, Task analysis
Kuznetsova, A.[Alina],
Rom, H.[Hassan],
Alldrin, N.[Neil],
Uijlings, J.[Jasper],
Krasin, I.[Ivan],
Pont-Tuset, J.[Jordi],
Kamali, S.[Shahab],
Popov, S.[Stefan],
Malloci, M.[Matteo],
Kolesnikov, A.[Alexander],
Duerig, T.[Tom],
Ferrari, V.[Vittorio],
The Open Images Dataset V4,
IJCV(128), No. 7, July 2020, pp. 1956-1981.
Springer DOI
Dataset, Object Detection. 9.2M images with unified annotations.
HTML Version.
SynthCity:
A Large-Scale Synthetic Point Cloud,
2019.
WWW Link.
Dataset, Point Clouds. Synthetic point clouds and RGB data from a detailed city model.
WHU Datasets,
2020.
WWW Link.
Dataset, Buildings. Several datasets.
See also
Whuan University.
VisDrone Datasets,
2019.
WWW Link.
Dataset, Drone Images. Several datasets related to annual challenges..
Song, D.[Dan],
Nie, W.Z.[Wei-Zhi],
Li, W.H.[Wen-Hui],
Kankanhalli, M.[Mohan],
Liu, A.A.[An-An],
Monocular Image-Based 3-D Model Retrieval: A Benchmark,
Cyber(53), To be published.
IEEE DOI
Dataset, MI3DOR.
WWW Link.
Dataset, 3D Objects. Monocular image based 3D object retrieval
Tan, X.[Xin],
Xu, K.[Ke],
Cao, Y.[Ying],
Zhang, Y.H.[Yi-Heng],
Ma, L.Z.[Li-Zhuang],
Lau, R.W.H.[Rynson W. H.],
Night-Time Scene Parsing With a Large Real Dataset,
IP(30), 2021, pp. 9085-9098.
IEEE DOI
Dataset, NightCity. Streaming media, Urban areas, Image segmentation, Annotations,
Semantics, Computer science, Automobiles, Autonomous driving,
adverse conditions
Deschaud, J.E.[Jean-Emmanuel],
Duque, D.[David],
Richa, J.P.[Jean Pierre],
Velasco-Forero, S.[Santiago],
Marcotegui, B.[Beatriz],
Goulette, F.[François],
Paris-CARLA-3D: A Real and Synthetic Outdoor Point Cloud Dataset for
Challenging Tasks in 3D Mapping,
RS(13), No. 22, 2021, pp. xx-yy.
DOI Link
Dataset, Point Cloud.
Pham, K.[Khoi],
Kafle, K.[Kushal],
Lin, Z.[Zhe],
Ding, Z.H.[Zhi-Hong],
Cohen, S.[Scott],
Tran, Q.[Quan],
Shrivastava, A.[Abhinav],
Learning to Predict Visual Attributes in the Wild,
CVPR21(13013-13023)
IEEE DOI
WWW Link.
Dataset, VAW. Geometry, Visualization, Shape,
Image color analysis, Annotations, Prediction algorithms
Zhou, Q.[Qiang],
Wang, S.Y.[Shi-Yin],
Wang, Y.T.[Yi-Tong],
Huang, Z.L.[Zi-Long],
Wang, X.G.[Xing-Gang],
Human De-occlusion: Invisible Perception and Recovery for Humans,
CVPR21(3690-3700)
IEEE DOI
WWW Link.
Dataset, Human Occlusion. Annotations, Aggregates, Refining,
Predictive models, Task analysis
Changpinyo, S.[Soravit],
Sharma, P.[Piyush],
Ding, N.[Nan],
Soricut, R.[Radu],
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To
Recognize Long-Tail Visual Concepts,
CVPR21(3557-3567)
IEEE DOI
Dataset, Image Captioning. Conceptual 12M (CC12M), a dataset with 12 million image-text pairs.
Visualization, Image recognition, Pipelines,
Benchmark testing, Data collection, Knowledge discovery
Anderson, C.[Connor],
Teuscher, A.[Adam],
Anderson, E.[Elizabeth],
Larsen, A.[Alysia],
Shirley, J.[Josh],
Farrell, R.[Ryan],
Have Fun Storming the Castle(s)!,
WACV21(3702-3711)
IEEE DOI WWW Link.
Dataset, Castles. 2400 individual castles, palaces and fortresses from more than 90
countries, contains more than 770K images.
Visualization, Image recognition, Geology,
Computational modeling, Image retrieval
Figueiredo, A.[Augusto],
Brayan, J.[Johnata],
Reis, R.O.[Renan Oliveira],
Prates, R.[Raphael],
Schwartz, W.R.[William Robson],
MoRe: A Large-Scale Motorcycle Re-Identification Dataset,
WACV21(4033-4042)
IEEE DOI WWW Link.
Dataset, Vehicles. Training, Deep learning, Computational modeling,
Surveillance, Motorcycles, Traffic control
Le, H.A.[Hoang-An],
Mensink, T.[Thomas],
Das, P.[Partha],
Karaoglu, S.[Sezer],
Gevers, T.[Theo],
EDEN: Multimodal Synthetic Dataset of Enclosed GarDEN Scenes,
WACV21(1578-1588)
IEEE DOI WWW Link.
Dataset, Outdoor Scenes. Deep learning, Image segmentation,
Image color analysis, Computational modeling, Semantics
Scheck, T.[Tobias],
Seidel, R.[Roman],
Hirtz, G.[Gangolf],
Learning from THEODORE: A Synthetic Omnidirectional Top-View Indoor
Dataset for Deep Transfer Learning,
WACV20(932-941)
IEEE DOI
Dataset, Fisheye Images. Cameras, Image segmentation, Object detection,
Semantics, Solid modeling, Rendering (computer graphics)
Behley, J.,
Garbade, M.,
Milioto, A.,
Quenzel, J.,
Behnke, S.,
Stachniss, C.,
Gall, J.,
SemanticKITTI:
A Dataset for Semantic Scene Understanding of LiDAR Sequences,
ICCV19(9296-9306)
IEEE DOI
Dataset, LiDAR. distance measurement, image segmentation,
optical radar, stereo image processing, LiDAR sequences, Lasers
Wang, X.,
Wu, J.,
Chen, J.,
Li, L.,
Wang, Y.,
Wang, W.Y.,
VaTeX: A Large-Scale, High-Quality Multilingual Dataset for
Video-and-Language Research,
ICCV19(4580-4590)
IEEE DOI WWW Link.
Dataset, . language translation, linguistics, natural language processing,
video signal processing, unified multilingual model, Social network services
Gu, S.,
Lugmayr, A.,
Danelljan, M.,
Fritsche, M.,
Lamour, J.,
Timofte, R.,
DIV8K: DIVerse 8K Resolution Image Dataset,
AIM19(3512-3516)
IEEE DOI
Dataset, High Resolution. convolutional neural nets, image resolution,
learning (artificial intelligence), CNN, image processing
Mauceri, C.[Cecilia],
Palmer, M.[Martha],
Heckman, C.[Christoffer],
SUN-Spot: An RGB-D Dataset With Spatial Referring Expressions,
CLVL19(1883-1886)
IEEE DOI
Dataset, Recognition. image colour analysis, object detection, SLAM (robots),
spatial referring expressions, SUN-Spot, objects localization,
multimodal
Sølund, T.[Thomas],
Buch, A.G.[Anders Glent],
Krüger, N.[Norbert],
Aanæs, H.[Henrik],
A Large-Scale 3D Object Recognition Dataset,
3DV16(73-82)
IEEE DOI
Dataset, Object Recognition.
WWW Link. object recognition
Hua, B.S.[Binh-Son],
Pham, Q.H.[Quang-Hieu],
Nguyen, D.T.[Duc Thanh],
Tran, M.K.[Minh-Khoi],
Yu, L.F.[Lap-Fai],
Yeung, S.K.[Sai-Kit],
SceneNN: A Scene Meshes Dataset with aNNotations,
3DV16(92-101)
IEEE DOI
Dataset, RGB-D.
WWW Link. Cameras
Rotman, D.[Daniel],
Gilboa, G.[Guy],
A Depth Restoration Occlusionless Temporal Dataset,
3DV16(176-184)
IEEE DOI
Dataset, RGB-D.
Zhang, J.J.[Jun-Jie],
Zhang, J.[Jian],
Lu, J.F.[Jian-Feng],
Shen, C.H.[Chun-Hua],
Curr, K.[Kate],
Phua, R.[Robin],
Neville, R.[Richard],
Edmonds, E.[Elise],
SLNSW-UTS:
A Historical Image Dataset for Image Multi-Labeling and Retrieval,
DICTA16(1-6)
IEEE DOI
Dataset, Object Recognition. 29713 images, 119 labels.
Xiang, Y.[Yu],
Kim, W.[Wonhui],
Chen, W.[Wei],
Ji, J.W.[Jing-Wei],
Choy, C.[Christopher],
Su, H.[Hao],
Mottaghi, R.[Roozbeh],
Guibas, L.J.[Leonidas J.],
Savarese, S.[Silvio],
ObjectNet3D: A Large Scale Database for 3D Object Recognition,
ECCV16(VIII: 160-176).
Springer DOI
Dataset, Object Recognition.
WWW Link.
Lin, T.Y.[Tsung-Yi],
Maire, M.[Michael],
Belongie, S.J.[Serge J.],
Hays, J.[James],
Perona, P.[Pietro],
Ramanan, D.[Deva],
Dollár, P.[Piotr],
Zitnick, C.L.[C. Lawrence],
Microsoft COCO: Common Objects in Context,
ECCV14(V: 740-755).
Springer DOI
Dataset, Objects.
WWW Link.
Flickr30k Dataset,
From image descriptions to visual denotations.
WWW Link.
Dataset, Visual Question Answering. Extension of Flickr 8k dataset.
Plummer, B.A.[Bryan A.],
Wang, L.W.[Li-Wei],
Cervantes, C.M.[Chris M.],
Caicedo, J.C.[Juan C.],
Hockenmaier, J.[Julia],
Lazebnik, S.[Svetlana],
Flickr30k Entities: Collecting Region-to-Phrase Correspondences for
Richer Image-to-Sentence Models,
IJCV(123), No. 1, May 2017, pp. 74-93.
Springer DOI
Earlier:
ICCV15(2641-2649)
IEEE DOI
Dataset, Object Recognition. Benchmark testing
Fanello, S.R.[Sean Ryan],
Ciliberto, C.[Carlo],
Santoro, M.[Matteo],
Natale, L.[Lorenzo],
Metta, G.[Giorgio],
Rosasco, L.[Lorenzo],
Odone, F.[Francesca],
iCub World: Friendly Robots Help Building Good Vision Data-Sets,
GT13(700-705)
IEEE DOI
Dataset, Object Recognition. Human Robot Interaction; Object Categorization Dataset; iCub
Ponomarenko, N.[Nikolay],
Ieremeiev, O.[Oleg],
Lukin, V.[Vladimir],
Jin, L.[Lina],
Egiazarian, K.O.[Karen O.],
A New Color Image Database TID2013: Innovations and Results,
ACIVS13(402-413).
Springer DOI
Dataset, Color Images.
Ponce, J.,
Berg, T.L.,
Everingham, M.R.,
Forsyth, D.A.,
Hebert, M.,
Lazebnik, S.[Svetlana],
Marszalek, M.,
Schmid, C.,
Russell, B.C.,
Torralba, A.,
Williams, C.K.I.,
Zhang, J.,
Zisserman, A.,
Dataset Issues in Object Recognition,
CLOR06(29-48).
Springer DOI
Dataset, Discussion.
Campbell, R., and
Flynn, P.J.,
A WWW-Accessible 3D Image and Model Database for
Computer Vision Research,
EEMCV98(148-154).
And:
EEMTV98(xx)
Dataset, 3-D Data.
HTML Version.
Nene, S.A.,
Nayar, S.K.[Shree K.],
Murase, H.[Hiroshi],
Columbia Object Image Library (COIL-100),
ColumbiaTechnical Report CUCS-006-96, February 1996.
PS File. Also:
WWW Link. Also the COIL-20 database.
WWW Link.
Dataset, Objects.
Section, Multiple Entries: 13.6.3.3 Dataset Distillation, Dataset Summary, Dataset Quantization
Chapter Contents (Back)
Dataset Distillation.
Dataset Summarization.
Zhai, W.[Wei],
Luo, H.C.[Hong-Chen],
Zhang, J.[Jing],
Cao, Y.[Yang],
Tao, D.C.[Da-Cheng],
One-Shot Object Affordance Detection in the Wild,
IJCV(130), No. 10, October 2022, pp. 2472-2500.
Springer DOI
Dataset, Affordance.
WWW Link. Affordance: potential action possibilities of objects in the scene.
Bileschi, S.M.[Stanley M.],
CBCL StreetScenes Challenge Framework,
Online2007.
WWW Link.
Dataset, Object Detection. Primarily for Cars, people, and street scenes.
Data is labeled.
Hoiem, D.[Derek],
Efros, A.A.[Alexei A.],
Hebert, M.[Martial],
Recovering Surface Layout from an Image,
IJCV(75), No. 1, October 2007, pp. 151-172.
Springer DOI
Earlier:
Geometric Context from a Single Image,
ICCV05(I: 654-661).
IEEE DOI
Dataset, Recognition. The example data is available:
HTML Version. Kanade issue.
Coarse properties (ground plane, sky, planar regions) from one
image.
Probabilistic approach to estimate 3D geometry so that not every
possible view is needed.
Fu, H.[Huan],
Jia, R.F.[Rong-Fei],
Gao, L.[Lin],
Gong, M.M.[Ming-Ming],
Zhao, B.Q.[Bin-Qiang],
Maybank, S.J.[Steve J.],
Tao, D.C.[Da-Cheng],
3D-FUTURE: 3D Furniture Shape with TextURE,
IJCV(129), No. 12, December 2021, pp. 3313-3337.
Springer DOI
Dataset Furniture.
WWW Link.
Medical Dataset Archive,
2006.
Dataset, Medical Images.
WWW Link. Variety of medical data. CT dataset available from related web site.
Visible Human Project,
1994.
Dataset, Medical Images.
HTML Version. Complete data in MRI, CT, slices.
MOTA Object Tracking Benchmark,
2021 for workshop.
WWW Link.
Dataset, Cell Tracking.
CR Chisto Labeled Nuclei Dataset,
Online2016
WWW Link.
Dataset, Nuclei.
Dataset of colorectal cancer histology images consisting of nearly
30,000 dotted nuclei with over 22,000 labeled with the type of cell
they belong to.
FIRE Fundus Image Registration Dataset,
2016
WWW Link.
Dataset, Retinal.
Dataset, Registration.
134 retinal image pairs and ground truth for registration.
Kauppi, T.,
Kalesnykiene, V.,
Kamarainen, J.K.,
Lensu, L.,
Sorri, I.,
Raninen, A.,
Voutilainen, R.,
Uusitalo, H.,
Kalviainen, H.,
Pietila, J.,
The DIARETDB1 diabetic retinopathy database and evaluation protocol,
BMVC07(xx-yy).
PDF File.
Dataset, Retina.
MiniMammographic Database,
1995
WWW Link.
Dataset, Mammography.
DDSM: Digital Database for Screening Mammography,
2000, USF.
HTML Version.
Dataset, Mammography.
Developing Human Connectome Project (dHCP),
2017
WWW Link.
Dataset, fMRI. The imaging data includes structural imaging, structural connectivity
data (diffusion MRI) and functional connectivity data (resting-state
fMRI).
Andreopoulos, A.[Alexander],
Tsotsos, J.K.[John K.],
Cardiac MRI dataset,
Online2008.
WWW Link.
Dataset, Cardiac MRI.
CoronARe: A Coronary Artery Reconstruction Challenge,
2017.
Dataset, Angiography.
WWW Link. 3D Reconstrucion challange dataset.
Zimmermann, K.[Karel],
Matas, J.G.[Jirí G.],
Svoboda, T.[Thomáš],
Tracking by an Optimal Sequence of Linear Predictors,
PAMI(31), No. 4, April 2009, pp. 677-692.
IEEE DOI
Code, Tracking.
Dataset, Tracking.
Earlier: A1, A3, A2:
Simultaneous learning of motion and appearance,
MLMotion08(xx-yy).
Earlier: A1, A3, A2:
Adaptive Parameter Optimization for Real-time Tracking,
NRTL07(1-8).
IEEE DOI
Earlier: A1, A3, A2:
Multiview 3D Tracking with an Incrementally Constructed 3D Model,
3DPVT06(488-495).
IEEE DOI
Learning approach to tracking. Estimation of the pose given the pose of
the previous frame.
Matlab implementation available.
WWW Link.
Huang, Y.[Yan],
Essa, I.A.[Irfan A.],
Tracking Multiple Objects through Occlusions,
CVPR05(II: 1051-1058).
IEEE DOI WWW Link.
Dataset, Actions.
And:
CVPR05(II: 1182).
IEEE DOI
See also
Georgia Tech.
Hopkins 155,
Motion Dataset
Online2007.
WWW Link.
Dataset, Motion. Testing feature based motion segmentation algorithms.
See also
Johns Hopkins University.
Tracking Any Object, TAO, Dataset,
Motion Dataset
Online
WWW Link.
Dataset, Tracking. 2,907 high resolution videos, captured in diverse environments.
OTCBVS Benchmark Dataset Collection,
2001
WWW Link.
Dataset, Tracking.
Dataset, Face Recognition. Collection of datasets for benchmarking realted to the related
conferences. Includes face dataset.
UCF Parking Lot Tracking,
2012
WWW Link.
Dataset, Tracking.
Tracking multiple people in parking lot.
See also
Part-based multiple-person tracking with partial occlusion handling.
See also
GMCP-Tracker: Global Multi-object Tracking Using Generalized Minimum Clique Graphs.
Dubuisson, S.[Séverine],
Gonzales, C.[Christophe],
A survey of datasets for visual tracking,
MVA(27), No. 1, January 2016, pp. 23-52.
WWW Link.
Survey, Tracking.
Dataset, Tracking.
Huang, L.H.[Liang-Hua],
Zhao, X.[Xin],
Huang, K.Q.[Kai-Qi],
GOT-10k: A Large High-Diversity Benchmark for Generic Object Tracking
in the Wild,
PAMI(43), No. 5, May 2021, pp. 1562-1577.
IEEE DOI
WWW Link.
Dataset, Tracking. Training, Object tracking, Databases, Protocols, Benchmark testing,
Servers, Object tracking, benchmark dataset, performance evaluation
Bondi, E.,
Jain, R.,
Aggrawal, P.,
Anand, S.,
Hannaford, R.,
Kapoor, A.,
Piavis, J.,
Shah, S.,
Joppa, L.,
Dilkina, B.,
Tambe, M.,
BIRDSAI: A Dataset for Detection and Tracking in Aerial Thermal
Infrared Videos,
WACV20(1736-1745)
IEEE DOI
Dataset, Tracking.
WWW Link. Videos, Cameras, Surveillance, Animals, Task analysis, Benchmark testing
Dave, A.[Achal],
Khurana, T.[Tarasha],
Tokmakov, P.[Pavel],
Schmid, C.[Cordelia],
Ramanan, D.[Deva],
TAO: A Large-scale Benchmark for Tracking Any Object,
ECCV20(V:436-454).
Springer DOI
Dataset, Tracking.
Lukezic, A.,
Kart, U.,
Käpylä, J.,
Durmush, A.,
Kamarainen, J.,
Matas, J.G.,
Kristan, M.,
CDTB: A Color and Depth Visual Object Tracking Dataset and Benchmark,
ICCV19(10012-10021)
IEEE DOI
Dataset, Tracking. image colour analysis, image sequences, object detection,
object tracking, pose estimation, most diverse dataset,
Robot sensing systems
Müller, M.[Matthias],
Bibi, A.[Adel],
Giancola, S.[Silvio],
Alsubaihi, S.[Salman],
Ghanem, B.[Bernard],
TrackingNet: A Large-Scale Dataset and Benchmark for Object Tracking in
the Wild,
ECCV18(I: 310-327).
Springer DOI
Dataset, Tracking.
Valmadre, J.[Jack],
Bertinetto, L.[Luca],
Henriques, J.F.[João F.],
Tao, R.[Ran],
Vedaldi, A.[Andrea],
Smeulders, A.W.M.[Arnold W. M.],
Torr, P.H.S.[Philip H. S.],
Gavves, E.[Efstratios],
Long-Term Tracking in the Wild: A Benchmark,
ECCV18(III: 692-707).
Springer DOI
Dataset, Tracking.
Zhang, S.[Shu],
Staudt, E.[Elliot],
Faltemier, T.[Tim],
Roy-Chowdhury, A.K.[Amit K.],
A Camera Network Tracking (CamNeT) Dataset and Performance Baseline,
WACV15(365-372)
IEEE DOI
Dataset, Camera Tracking.
WWW Link. Cameras; Legged locomotion; Lighting; Target tracking; Trajectory; Videos
Jaynes, C.,
Kale, A.,
Sanders, N.,
Grossmann, E.,
The Terrascope Dataset:
Scripted Multi-Camera Indoor Video Surveillance with Ground-truth,
PETS05(309-316).
IEEE DOI WWW Link.
Dataset, Surveillance.
Visual Object Tracking Challenges, VOT,
Tracking Challenges and datasets.
Online
HTML Version.
Dataset, Tracking. Various VOT workshop datasets.
See also
Visual Object Tracking Challenge.
Li, A.,
Lin, M.,
Wu, Y.,
Yang, M.,
Yan, S.,
NUS-PRO: A New Visual Tracking Challenge,
PAMI(38), No. 2, February 2016, pp. 335-349.
IEEE DOI
Dataset, Tracking. Airplanes
Dendorfer, P.[Patrick],
Osep, A.[Aljosa],
Milan, A.[Anton],
Schindler, K.[Konrad],
Cremers, D.[Daniel],
Reid, I.D.[Ian D.],
Roth, S.[Stefan],
Leal-Taixé, L.[Laura],
MOTChallenge: A Benchmark for Single-Camera Multiple Target Tracking,
IJCV(129), No. 4, April 2021, pp. 845-881.
Springer DOI
WWW Link.
Dataset, Motion Tracking. There are a series of related datasets for annual challenges.
PETS Benchmark Datasets,
Online2006 Dataset:
HTML Version.
Dataset, Surveillance.
2014 Dataset:
HTML Version. 2015 Dataset:
HTML Version. 2016 Dataset:
HTML Version.
The KITTI Vision Benchmark Suite,
Online2013
WWW Link.
Dataset, Road Scenes.
Award, Everingham Prize. Stereo, Lidar, GPS, etc.
See also
Vision meets robotics: The KITTI dataset.
Per, J.[Janez],
Kenk, V.S.[Vildana Sulic],
Mandeljc, R.[Rok],
Kristan, M.[Matej],
Kovacic, S.[Stanislav],
Dana36: A Multi-camera Image Dataset for Object Identification in
Surveillance Scenarios,
AVSS12(64-69).
IEEE DOI
Dataset,Surveillance.
LHI Surveillance Dataset,
Annotated surveillance images.
Online2008
HTML Version.
Dataset, Segmentation.
Subset of larger dataset.
See also
Lotus Hill Institute.
CLEAR: Classification of Events, Activities and Relationships,
MTPH07 WWW Link.
Dataset, Activity Recogniton.
i-LIDS: Bag and vehicle detection challenge,
Online2007
AVSBS07 HTML Version.
Dataset, Activity Recogniton. Data used at Advanced Video and Signal Based Surveillance, 2007.
Multimedia Event Detection,
Series of Event and Activity Detection evaluations.
WWW Link.
WWW Link.
WWW Link.
Dataset, Activity Recogniton. MED13, MED12, MED11.
Multiview Extended Video with Activities,
MEVA Test 3:
WWW Link. Information also:
WWW Link.
Dataset, Activity Recogniton.
Dataset, MEVA. 333 hours of ground-camera and UAV videos and 28 hours of
MEVA training Annotations dataset.
See also
MEVA: A Large-Scale Multiview, Multimodal Video Dataset for Activity Detection.
PETS 2006 Benchmark Data,
Online2006
PETS06 HTML Version.
Dataset, Activity Recogniton. Data used at International Workshop on
Performance Evaluation of Tracking and Surveillance
2006.
PETS 2001 Benchmark Data,
Online2001
PETS01 WWW Link.
Dataset, Activity Recogniton. Data used at International Workshop on
Performance Evaluation of Tracking and Surveillance 2001.
OTCBVS Benchmark Dataset Collection,
OTCBVS072007
WWW Link.
Dataset, Activity Recogniton. Beyound the Visual Spectrum (IR especially).
Data for various OTCBVS workshops.
YouTube-8M Dataset,
Labed video dataset.
WWW Link.
WWW Link.
Dataset, Video Database. 4700+ visual entities.
Introduced in:
See also
YouTube-8M: A Large-Scale Video Classification Benchmark.
Fisher, R.B.[Robert B.],
CAVIAR Test Case Scenarios,
Online BookOctober 2004.
WWW Link.
Dataset, Video. From the EC funded CAVIAR project
(Context Aware Vision using Image-based Active Recognition).
The sequences are labelled (in XML) with both the tracked persons and
a semantic description of their activities.
81 video sequences comprising about 90K frames.
These sequences include indoor plaza and shopping center
observations of individuals and small groups of people walking, browsing,
window shopping, fighting, meeting, leaving packages behind, collapsing,
entering and exiting shops, etc.
Optic Flow Data,
Edinburgh2007.
Smoothed flow sequences for the Waverly train station scene.
WWW Link.
Dataset, Video. Behavior, pedestrian analysis.
BEHAVE Interactions Test Case Scenarios,
Edinburgh2007.
Two views of various scenarios of people acting out various interactions.
WWW Link.
Dataset, Video. Behavior, pedestrian analysis.
Includes ground truth bounding boxes for much of the data.
Sigal, L.[Leonid],
Balan, A.O.[Alexandru O.],
Black, M.J.[Michael J.],
HumanEva: Synchronized Video and Motion Capture Dataset and Baseline
Algorithm for Evaluation of Articulated Human Motion,
IJCV(87), No. 1-2, March 2010, pp. xx-yy.
Springer DOI
Dataset, Human Motion.
Earlier: A1, A3, Only:
HumanEva: Synchronized Video and Motion Capture Dataset for Evaluation of Articulated Human Motion,
BrownTechnical Report CS-06-08, September 2006.
HTML Version. For the dataset:
HTML Version. Calibrated video sequences synchronized with motion capture data.
Bolten, T.[Tobias],
Pohle-Fröhlich, R.[Regina],
Tönnies, K.D.[Klaus D.],
DVS-OUTLAB: A Neuromorphic Event-Based Long Time Monitoring Dataset
for Real-World Outdoor Scenarios,
EventVision21(1348-1357)
IEEE DOI
Dataset, Surveilance. Privacy, Rain, Power demand, Neuromorphics, Noise reduction,
Pipelines, Vision sensors
Li, L.Z.[Long-Zhen],
Nawaz, T.[Tahir],
Ferryman, J.M.,
PETS 2015: Datasets and challenge,
AVSS15(1-6)
IEEE DOI
Dataset, PETS 2015. object detection
Oh, S.M.[Sang-Min],
Hoogs, A.J.[Anthony J.],
Perera, A.[Amitha],
Cuntoor, N.[Naresh],
Chen, C.C.[Chia-Chih],
Lee, J.T.[Jong Taek],
Mukherjee, S.[Saurajit],
Aggarwal, J.K.,
Lee, H.T.[Hyung-Tae],
Davis, L.S.[Larry S.],
Swears, E.[Eran],
Wang, X.Y.[Xiao-Yang],
Ji, Q.A.[Qi-Ang],
Reddy, K.K.[Kishore K.],
Shah, M.[Mubarak],
Vondrick, C.[Carl],
Pirsiavash, H.[Hamed],
Ramanan, D.[Deva],
Yuen, J.[Jenny],
Torralba, A.B.[Antonio B.],
Song, B.[Bi],
Fong, A.[Anesco],
Roy-Chowdhury, A.K.[Amit K.],
Desai, M.[Mita],
A large-scale benchmark dataset for event recognition in surveillance
video,
CVPR11(3153-3160).
IEEE DOI
And:
AVSBS11(527-528).
IEEE DOI
Dataset, Action Recognition.
Dataset, Event Recognition.
Harvey, A.[Adam],
LaPlace, J.[Jules],
Exposing.ai,
Online2021.
WWW Link.
Dataset, Duke MTMC Dataset. Privacy issues in re-identification research and the use of large
datasets.
MIT Car Database MITC,
Online2000
HTML Version.
Dataset, Vehicles.
PKU-VD Dataset,
2017
HTML Version.
Dataset, Vehicles. VD1: 1,097,649 images. 1,232 vehicle models and 11 colors.
VD2: 807,260 images. 1,112 vehicle models and 11 colors.
Reference:
See also
Exploiting Multi-grain Ranking Constraints for Precisely Searching Visually-similar Vehicles.
PKU VehicleID Dataset,
2016
HTML Version.
Dataset, Vehicles. 10319 vehicles, 90196 images.
Reference:
See also
Deep Relative Distance Learning: Tell the Difference between Similar Vehicles.
Struwe, M.[Marvin],
Hasler, S.[Stephan],
Bauer-Wersing, U.[Ute],
Rendered Benchmark Data Set for Evaluation of Occlusion-Handling
Strategies of a Parts-Based Car Detector,
PSIVT15(99-110).
Springer DOI
Dataset, Vehicle Detection.
Racing Bicycle Detection/Tracking from UAV Footage, UAV Detection,
Motion Datasets
Online2019.
HTML Version.
Dataset, Vehicle Tracking.
Dataset, Drone Detection. Multiple datasets. UAV detection against variety of backgrounds.
See also
MULTIDRONE.
Stanford Cars Dataset,
2019.
A dataset for understanding human actions in still images
WWW Link.
HTML Version.
Dataset, Vehicles. 196 classes of cars, 16,185 images.
See also
Leveraging the Wisdom of the Crowd for Fine-Grained Recognition.
See also
Stanford University, Computer Science Departent.
Behrendt, K.,
Boxy Vehicle Detection in Large Images,
CVRSUAD19(840-846)
IEEE DOI
Dataset, Vehicles.
WWW Link. cameras, image resolution, image segmentation, object detection,
road vehicles, traffic engineering computing, individual teams,
dataset
UA-DETRAC Benchmark Suite,
2016.
WWW Link.
Dataset, Traffic.
See also
UA-DETRAC 2017: Report of AVSS2017 IWT4S Challenge on Advanced Traffic Monitoring.
Neuhold, G.[Gerhard],
Ollmann, T.[Tobias],
Bulò, S.R.[Samuel Rota],
Kontschieder, P.[Peter],
The Mapillary Vistas Dataset for Semantic Understanding of Street
Scenes,
ICCV17(5000-5009)
IEEE DOI
Dataset, Traffic. 25,000 images, 66 categories.
computational geometry, data visualisation, image annotation,
image resolution, image segmentation, road traffic,
Visualization
Koschorrek, P.[Philipp],
Piccini, T.[Tommaso],
Oberg, P.[Per],
Felsberg, M.[Michael],
Nielsen, L.[Lars],
Mester, R.[Rudolf],
A Multi-sensor Traffic Scene Dataset with Omnidirectional Video,
GT13(727-734)
IEEE DOI
Dataset, Traffic. automotive
da Cruz, S.D.[Steve Dias],
Wasenmüller, O.[Oliver],
Beise, H.P.[Hans-Peter],
Stifter, T.[Thomas],
Stricker, D.[Didier],
SVIRO: Synthetic Vehicle Interior Rear Seat Occupancy Dataset and
Benchmark,
WACV20(962-971)
IEEE DOI
Dataset, Vehicle Surveilance.
WWW Link. Task analysis, Benchmark testing, Training, Automobiles,
Cameras, Lightning
Massoz, Q.,
Langohr, T.,
Francois, C.,
Verly, J.G.,
The ULg multimodality drowsiness database (called DROZY) and examples
of use,
WACV16(1-7)
IEEE DOI
Dataset, Driver Monitoring. Cameras
Xu, Z.B.[Zhen-Bo],
Yang, W.[Wei],
Meng, A.[Ajin],
Lu, N.X.[Nan-Xue],
Huang, H.[Huan],
Ying, C.C.[Chang-Chun],
Huang, L.S.[Liu-Sheng],
Towards End-to-End License Plate Detection and Recognition:
A Large Dataset and Baseline,
ECCV18(XIII: 261-277).
Springer DOI
Dataset, License Plates.
Eraqi, H.M.[Hesham M.],
Abouelnaga, Y.[Yehya],
Saad, M.H.[Mohamed H.],
Moustafa, M.N.[Mohamed N.],
Distracted Driver Dataset,
WWW Link.
Dataset, Driver Monitoring. Includes Distracted Driver V1 and Distracted Driver V2.
MIT Pedestrian Database MITP,
Online2000
HTML Version.
Dataset, Surveillance.
UCF Action Recogniton Dataset 101,
Online2012
WWW Link.
Earlier:
UCF Action Recogniton Dataset 50,
Online2010
WWW Link.
Dataset, Surveillance.
101 action categories, consisting of realistic videos taken from youtube.
UCF 101 is an extension of UCF 50.
Categories include:
Baseball Pitch, Basketball Shooting, Bench Press, Biking, Biking,
Billiards Shot,Breaststroke, Clean and Jerk, Diving, Drumming,
Fencing, Golf Swing, Playing Guitar, High Jump, Horse Race, Horse
Riding, Hula Hoop, Javelin Throw, Juggling Balls, Jump Rope, Jumping
Jack, Kayaking, Lunges, Military Parade, Mixing Batter, Nun chucks,
Playing Piano, Pizza Tossing, Pole Vault, Pommel Horse, Pull Ups,
Punch, Push Ups, Rock Climbing Indoor, Rope Climbing, Rowing, Salsa
Spins, Skate Boarding, Skiing, Skijet, Soccer, Juggling, Swing,
Playing Tabla, TaiChi, Tennis Swing, Trampoline Jumping, Playing
Violin, Volleyball Spiking, Walking with a dog, and Yo Yo.
The printed reference:
See also
UCF101: A Dataset of 101 Human Action Classes from Videos in The Wild.
UCF-iPhone,
Online2012
WWW Link.
Dataset, Surveillance.
Aerobic actions using the Inertial Measurement Unit (IMU) on an Apple iPhone.
Biking, Climbing Stairs, Descending Stairs, Gym Biking, Jump Roping,
Running, Standing, Treadmill Walking and Walking.
See also
Macro-Class Selection for Hierarchical K-NN Classification of Inertial Sensor Data. for the paper.
Hollywood2 Human Actions and Scenes Dataset,
Online2016
WWW Link.
Dataset, Surveillance.
Part originally from:
See also
Actions in context.
HMDB: a large human motion database,
Online2016
WWW Link.
Dataset, Surveillance.
Award, ICCV, Helmholtz.
51 actions.
See also
HMDB: A large video database for human motion recognition.
TRECVID Workshop DAta,
Online2017
HTML Version.
Dataset, Surveillance.
Surveillance datasets from 2001 to 2017.
Privacy-Preserving Visual Recognition PA-HMDB51,
Online2019.
WWW Link.
Dataset, Actions.
Dataset, Privacy.
The dataset contains 592 videos selected from the HMDB51 dataset
(
See also
HMDB: A large video database for human motion recognition. ).
For each video, we provide with frame-level annotation of five
privacy attributes: skin color, gender, face, nudity, and
relationship.
See also
Towards Privacy-Preserving Visual Recognition via Adversarial Training: A Pilot Study.
HVU Dataset,
Online2021
WWW Link.
Dataset, Action. For Holistic Video Understanding workshop
EPIC-KITCHENS,
Online2018
WWW Link.
Dataset, Action.
Dataset, Daily Activities.
First-person (egocentric) vision; multi-faceted non-scripted
recordings in native environments - i.e. the wearers' homes, capturing
all daily activities in the kitchen over multiple days.
See also
EPIC-KITCHENS Dataset: Collection, Challenges and Baselines, The.
Egocentric Live 4D Perception (Ego4D) Dataset:
A large-scale first-person video dataset, supporting research
in multi-modal machine perception for daily life activity,
Online2021
WWW Link.
Dataset, Action.
Dataset, Egocentric. The Ego4D Consortium.
A large-scale first-person video dataset, supporting research in
multi-modal machine perception for daily life activity.
Kay, W.[Will],
Carreira, J.[Joao],
Simonyan, K.[Karen],
Zhang, B.[Brian],
Hillier, C.[Chloe],
Vijayanarasimhan, S.[Sudheendra],
Viola, F.[Fabio],
Green, T.[Tim],
Back, T.[Trevor],
Natsev, P.[Paul],
Suleyman, M.[Mustafa],
Zisserman, A.[Andrew],
The Kinetics Human Action Video Dataset,
Online2019.
WWW Link.
WWW Link.
Dataset, Actions.
Dataset, Human Action.
Tenorth, M.[Moritz],
Bandouch, J.[Jan],
Beetz, M.[Michael],
The TUM Kitchen Data Set of everyday manipulation activities for motion
tracking and action recognition,
THEMIS09(1089-1096).
IEEE DOI
Dataset, Activity Recognition.
Guerra-Filho, G.[Gutemberg],
Biswas, A.[Arnab],
The human motion database:
A cognitive and parametric sampling of human motion,
IVC(30), No. 3, March 2012, pp. 251-261.
Elsevier DOI
Earlier:
FG11(103-110).
IEEE DOI
Dataset, Activity Recognition. Human motion database; Quantitative evaluation; Parametric and
cognitive sampling; Motion synthesis and analysis
Chaquet, J.M.[Jose M.],
Carmona, E.J.[Enrique J.],
Fernandez-Caballero, A.[Antonio],
A survey of video datasets for human action and activity recognition,
CVIU(117), No. 6, June 2013, pp. 633-659.
Elsevier DOI
Survey, Activity Recognition.
Dataset, Activity Recognition. Human action recognition; Human activity recognition; Database;
Dataset; Review; Survey
Chavarriaga, R.[Ricardo],
Sagha, H.[Hesam],
Calatroni, A.[Alberto],
Digumarti, S.T.[Sundara Tejaswi],
Tröster, G.[Gerhard],
del R. Millán, J.[José],
Roggen, D.[Daniel],
The Opportunity challenge: A benchmark database for on-body
sensor-based activity recognition,
PRL(34), No. 15, 2013, pp. 2033-2042.
Elsevier DOI
Dataset, Activity Recognition. Activity recognition
Barrett, D.P.[Daniel Paul],
Xu, R.[Ran],
Yu, H.N.[Hao-Nan],
Siskind, J.M.[Jeffrey Mark],
Collecting and annotating the large continuous action dataset,
MVA(27), No. 7, October 2016, pp. 983-995.
Springer DOI
Dataset, Actions. LCA Dataset.
Hadfield, S.[Simon],
Lebeda, K.[Karel],
Bowden, R.[Richard],
Hollywood 3D: What are the Best 3D Features for Action Recognition?,
IJCV(121), No. 1, January 2017, pp. 95-110.
Springer DOI
Earlier: A1, A3, Only:
Hollywood 3D: Recognizing Actions in 3D Natural Scenes,
CVPR13(3398-3405)
IEEE DOI
Dataset, Attion Recognition. Hollywood3D dataset.
3.5d
Monfort, M.[Mathew],
Andonian, A.[Alex],
Zhou, B.L.[Bo-Lei],
Ramakrishnan, K.[Kandan],
Bargal, S.A.[Sarah Adel],
Yan, T.[Tom],
Brown, L.[Lisa],
Fan, Q.F.[Quan-Fu],
Gutfreund, D.[Dan],
Vondrick, C.[Carl],
Oliva, A.[Aude],
Moments in Time Dataset: One Million Videos for Event Understanding,
PAMI(42), No. 2, February 2020, pp. 502-508.
IEEE DOI
WWW Link.
Dataset, Action. Videos, Visualization, Feature extraction, Vocabulary, Animals,
Convolution, Video dataset, event recognition
Patino, L.[Luis],
Ferryman, J.M.[James M.],
PETS 2014: Dataset and challenge,
AVSS14(355-360)
IEEE DOI
Dataset, Surveillance. Cameras
Liu, C.[Ce],
Freeman, W.T.[William T.],
Adelson, E.H.[Edward H.],
Weiss, Y.[Yair],
Human-assisted motion annotation,
CVPR08(1-8).
IEEE DOI
Dataset, Motion.
WWW Link. Motion annotation then applied to datasets to provide ground truth.
Shi, Y.F.[Yi-Fan],
Huang, Y.[Yan],
Minnen, D.,
Bobick, A.F.,
Essa, I.A.,
Propagation networks for recognition of partially ordered sequential
action,
CVPR04(II: 862-869).
IEEE DOI HTML Version.
Dataset, Actions.
See also
Georgia Tech.
Crowd Detection/Recognition/Segmentation from UAV/Drone-Captured Images/Videos,
2022.
WWW Link.
Dataset, Crowd Detection. Under the auspices of the European Union's "Horizon 2020" research
framework programme. It is a collection of datasets suitable for
research on autonomous UAV/drone vision.
See also
Aristotle University of Thessaloniki.
VIPeR: Viewpoint Invariant Pedestrian Recognition,
Pedestrian dataset. 2007.
WWW Link.
Dataset, Pedestrians.
Akshatha, K.R.,
Karunakar, A.K.,
Shenoy, B.S.[B. Satish],
Pavan, K.P.[K. Phani],
Dhareshwar, C.V.[Chinmay V.],
Johnson, D.G.[Dennis George],
Manipal-UAV person detection dataset: A step towards benchmarking
dataset and algorithms for small object detection,
PandRS(195), 2023, pp. 77-89.
Elsevier DOI
Dataset, UAV Human Detection. Small object detection, Unmanned aerial vehicles,
Convolutional neural networks, Deep learning, Computer vision
Wang, D.[Dan],
Zhang, C.Y.[Chong-Yang],
Cheng, H.[Hao],
Shang, Y.F.[Yan-Feng],
Mei, L.[Lin],
SPID: Surveillance Pedestrian Image Dataset and Performance Evaluation
for Pedestrian Detection,
BEST16(III: 463-477).
Springer DOI
Dataset, Pedestrians.
Stanford 40 Actions,
A dataset for understanding human actions in still images
HTML Version.
Dataset, Action Recognition.
People Playing Musical Instrument (PPMI),
A dataset of human and object interaction activities
HTML Version.
Dataset, Action Recognition.
Kliper-Gross, O.[Orit],
Hassner, T.[Tal],
Wolf, L.B.[Lior B.],
The Action Similarity Labeling Challenge,
PAMI(34), No. 3, March 2012, pp. 615-621.
IEEE DOI
Dataset, Action Recognition. Labeled dataset. Same/not-same rather than recognition.
Distante, C.[Cosimo],
Diraco, G.[Giovanni],
Leone, A.[Alessandro],
Active Range Imaging Dataset for Indoor Surveillance,
BMVA(2010), No. 3, 2010, pp. 1-14.
PDF File.
Dataset, Action Recognition.
Blunsden, S.[Scott],
Fisher, R.B.[Robert B.],
The BEHAVE video dataset:
Ground truthed video for multi-person behavior classification,
BMVA(2010), No. 4, 2010, pp. 1-12.
PDF File.
Dataset, Action Recognition.
Hwang, S.[Soonmin],
Park, J.[Jaesik],
Kim, N.[Namil],
Choi, Y.[Yukyung],
Kweon, I.S.[In So],
Multispectral pedestrian detection: Benchmark dataset and baseline,
CVPR15(1037-1045)
IEEE DOI
Dataset, Pedestrian Detection.
Wallraven, C.[Christian],
Schultze, M.[Michael],
Mohler, B.[Betty],
Vatakis, A.[Argiro],
Pastra, K.[Katerina],
The POETICON enacted scenario corpus: A tool for human and
computational experiments on action understanding,
FG11(484-491).
IEEE DOI
Dataset, Actions.
Munder, S.,
Gavrila, D.M.[Dariu M.],
An Experimental Study on Pedestrian Classification,
PAMI(28), No. 11, November 2006, pp. 1863-1868.
IEEE DOI PDF File.
Dataset available:
HTML Version.
Dataset, Pedestrians. DaimlerChrysler Res.
Investigate global versus local and adaptive versus nonadaptive features.
PCA coefficients, Haar wavelets, and local receptive fields (LRFs).
SVM, Neural Nets, K-NN classifiers.
Combination of SVMs with LRF features performs best.
And boosted cascade of Haar wavelets is close.
Daimler Pedestrian Detection Benchmark,
2009.
HTML Version.
Dataset, Pedestrian Detection.
Dataset, Surveillance.
See also
Daimler. Training set: 15,560 pedestrian and non-pedestrian samples.
6744 additional images.
Test set: a sequence with more than 21,790 images with 56,492
pedestrian labels. From a vehicle in 27 minutes of urban driving.
VGA resolution.
Dataset used in:
See also
Monocular Pedestrian Detection: Survey and Experiments.
Edinburgh Informatics Forum Pedestrian Database,
2010.
WWW Link.
Dataset, Human Tracking.
Dataset, Surveillance. Overhead views, of a building atrium.
Several months of observations, with trajectories (computed).
Dalal, N.[Navneet],
INRIA Person Dataset,
Online2005
WWW Link.
Dataset, Human Motion. The collected dataset for the above paper, from various sources.
Wu, Y.[Yang],
Liu, Y.L.[Yuan-Liu],
Yuan, Z.J.[Ze-Jian],
Zheng, N.N.[Nan-Ning],
IAIR-CarPed: A psychophysically annotated dataset with fine-grained and
layered semantic labels for object recognition,
PRL(33), No. 2, 15 January 2012, pp. 218-226.
Elsevier DOI
Dataset, Pedestrian Detection. Object recognition; Image database; Object detection; Pedestrian
detection; Psychophysical experiments
García-Martín, Á.[Álvaro],
Martínez, J.M.[José M.],
Bescós, J.[Jesús],
A corpus for benchmarking of people detection algorithms,
PRL(33), No. 2, 15 January 2012, pp. 152-156.
Elsevier DOI
Dataset, Person Detection. People detection; Ground-truth; Corpus; Dataset; Surveillance video
Wang, Q.[Qi],
Gao, J.Y.[Jun-Yu],
Lin, W.[Wei],
Li, X.L.[Xue-Long],
NWPU-Crowd: A Large-Scale Benchmark for Crowd Counting and
Localization,
PAMI(43), No. 6, June 2021, pp. 2141-2149.
IEEE DOI WWW Link.
WWW Link.
Dataset, Crowd Counting. Benchmark testing, Task analysis, Head, Surveillance, Cameras,
Magnetic heads, Internet, Crowd counting, crowd localization,
benchmark website
Sindagi, V.,
Yasarla, R.,
Patel, V.,
Pushing the Frontiers of Unconstrained Crowd Counting:
New Dataset and Benchmark Method,
ICCV19(1221-1231)
IEEE DOI
Dataset, Crowd Counting. feature extraction, image classification,
learning (artificial intelligence), object detection, Error analysis
Rasouli, A.,
Kotseruba, I.,
Kunic, T.,
Tsotsos, J.K.[John K.],
PIE: A Large-Scale Dataset and Models for Pedestrian Intention
Estimation and Trajectory Prediction,
ICCV19(6261-6270)
IEEE DOI
Dataset, Pedestrians.
WWW Link. intelligent transportation systems, pedestrians,
large-scale dataset, pedestrian intention estimation,
Vehicle dynamics
Zheng, L.[Liang],
Bie, Z.[Zhi],
Sun, Y.F.[Yi-Fan],
Wang, J.D.[Jing-Dong],
Su, C.[Chi],
Wang, S.J.[Sheng-Jin],
Tian, Q.[Qi],
MARS: A Video Benchmark for Large-Scale Person Re-Identification,
ECCV16(VI: 868-884).
Springer DOI
Dataset, Re-Identification.
Yan, C.[Cheng],
Pang, G.S.[Guan-Song],
Wang, L.[Lei],
Jiao, J.[Jile],
Feng, X.T.[Xue-Tao],
Shen, C.H.[Chun-Hua],
Li, J.J.[Jing-Jing],
BV-Person: A Large-scale Dataset for Bird-view Person
Re-identification,
ICCV21(10923-10932)
IEEE DOI
Dataset, Re-Identification. Computational modeling, Benchmark testing, Cameras,
Video surveillance, Search problems, Birds,
Image and video retrieval
Figueira, D.[Dario],
Taiana, M.[Matteo],
Nambiar, A.[Athira],
Nascimento, J.C.[Jacinto C.],
Bernardino, A.[Alexandre],
The HDA+ Data Set for Research on Fully Automated Re-identification
Systems,
Re-Id14(241-255).
Springer DOI
Dataset, Re-Identification.
Ragheb, H.[Hossein],
Velastin, S.A.[Sergio A.],
Remagnino, P.[Paolo],
Ellis, T.[Tim],
Human action recognition using robust power spectrum features,
ICIP08(753-756).
IEEE DOI
And:
ViHASi: Virtual human action silhouette data for the performance
evaluation of silhouette-based action recognition methods,
ICDSC08(1-10).
IEEE DOI
And:
VNBA08(77-84).
DOI Link
And:
A Novel Approach for Fast Action Recognition using Simple Features,
VS08(xx-yy).
Dataset, Action Recognition. Silhouette based action recognition.
Chakraborty, A.[Anirban],
Das, A.[Abir],
Roy-Chowdhury, A.K.[Amit K.],
Network Consistent Data Association,
PAMI(38), No. 9, September 2016, pp. 1859-1871.
IEEE DOI
Earlier: A2, A1, A3:
Consistent Re-identification in a Camera Network,
ECCV14(II: 330-345).
Springer DOI
Dataset, Re-Identification.
WWW Link.
Bialkowski, A.,
Denman, S.,
Sridharan, S.,
Fookes, C.,
Lucey, P.,
A Database for Person Re-Identification in Multi-Camera Surveillance
Networks,
DICTA12(1-8).
IEEE DOI
Dataset, Re-Identification.
Gou, M.,
Karanam, S.,
Liu, W.,
Camps, O.,
Radke, R.J.,
DukeMTMC4ReID:
A Large-Scale Multi-camera Person Re-identification Dataset,
Re-Id17(1425-1434)
IEEE DOI
Dataset, Re-Identification. Airports, Benchmark testing, Cameras, Detectors, Feature extraction,
Measurement, Surveillance
MoCA: Moving Camouflaged Animals dataset,
Online2020.
WWW Link.
Dataset, Animals.
See also
Betrayed by Motion: Camouflaged Object Discovery via Motion Segmentation.
Indoor pig behavior RGBD video dataset,
Online2021.
WWW Link.
Dataset, Animals. There are approximately 3.5 million data frames in 1905 clips, each 5
minutes long, for a total of about 160 hours of video.
See also
Extracting Accurate Long-Term Behavior Changes from a Large Pig Dataset.
Truong, C.[Charles],
Barrois-Müller, R.[Rémi],
Moreau, T.[Thomas],
Provost, C.[Clément],
Vienne-Jumeau, A.[Aliénor],
Moreau, A.[Albane],
Vidal, P.P.[Pierre-Paul],
Vayatis, N.[Nicolas],
Buffat, S.[Stéphane],
Yelnik, A.[Alain],
Ricard, D.[Damien],
Oudre, L.[Laurent],
A Data Set for the Study of Human Locomotion with Inertial
Measurements Units,
IPOL(9), 2019, pp. 381-390.
DOI Link
Dataset, Gait. Data set of 1020 multivariate gait signals collected with two inertial
measurement units, from 230 subjects undergoing a fixed protocol:
standing still, walking 10 m, turning around, walking back and
stopping. In total, 8.5~h of gait time series are distributed.
Song, C.F.[Chun-Feng],
Huang, Y.Z.[Yong-Zhen],
Wang, W.N.[Wei-Ning],
Wang, L.[Liang],
CASIA-E: A Large Comprehensive Dataset for Gait Recognition,
PAMI(45), No. 3, March 2023, pp. 2801-2815.
IEEE DOI
Dataset, Gait Recognition. Videos, Gait recognition, Legged locomotion, Face recognition,
Training, Lighting, Benchmark testing, Deep learning, gait dataset,
soft biometrics
Zhu, Z.[Zheng],
Guo, X.D.[Xian-Da],
Yang, T.[Tian],
Huang, J.J.[Jun-Jie],
Deng, J.K.[Jian-Kang],
Huang, G.[Guan],
Du, D.L.[Da-Long],
Lu, J.W.[Ji-Wen],
Zhou, J.[Jie],
Gait Recognition in the Wild: A Benchmark,
ICCV21(14769-14779)
IEEE DOI
Dataset, Gait Recognition.
WWW Link. Biometrics, Datasets and evaluation, Emergency Reviewer
Baseline Algorithm and Performance for Gait Based Human ID
Challenge Problem,
2004, USF.
WWW Link.
Dataset, Gait.
Code, Gait.
Seely, R.D.[Richard D.],
Samangooei, S.[Sina],
Middleton, L.[Lee],
Carter, J.N.[John N.],
Nixon, M.S.[Mark S.],
The University of Southampton Multi-Biometric Tunnel and introducing a
novel 3D gait dataset,
BTAS08(1-6).
IEEE DOI
Dataset, Gait Recognition.
CMU Graphics Lab Motion Capture Database,
2004.
WWW Link.
Dataset, Motion Capture.
Code, Motion Capture. 2000+ examples of motion capture data. Includes some software.
Human3.6M,
Online2014.
WWW Link. Or the original:
WWW Link.
Dataset, Motion Capture.
Dataset, Human Actions.
3.6 Million human poses, various people, various actions.
For the description:
See also
Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments.
Voisard, C.[Cyril],
de l'Escalopier, N.[Nicolas],
Moreau, A.[Albane],
Vienne-Jumeau, A.[Alienor],
Ricard, D.[Damien],
Oudre, L.[Laurent],
A Reference Data Set for the Study of Healthy Subject Gait with
Inertial Measurements Units,
IPOL(13), 2023, pp. 314-320.
DOI Link
Dataset, Gait.
Hofmann, M.[Martin],
Geiger, J.[Jürgen],
Bachmann, S.[Sebastian],
Schuller, B.[Björn],
Rigoll, G.[Gerhard],
The TUM Gait from Audio, Image and Depth (GAID) database:
Multimodal recognition of subjects and traits,
JVCIR(25), No. 1, 2014, pp. 195-206.
Elsevier DOI
Dataset, Gait. Gait recognition
Kuehne, H.,
Jhuang, H.,
Garrote, E.,
Poggio, T.,
Serre, T.,
HMDB: A large video database for human motion recognition,
ICCV11(2556-2563).
IEEE DOI
Dataset, Action Recognition. The internet has billions of videos, most recognition datasets have
a dozen.
The dataset itself:
See also
HMDB: a large human motion database.
Edinburgh Ceilidh Overhead Video Data,
Dataset, Dance.
WWW Link.
16 ground-truthed dances viewed from overhead, where the 10 dancers
follow a structured dance pattern (2 different dances).
The dances are in the Scottish Ceilidh style (somewhat similar to
American Square Dancing).
Gorelick, L.[Lena],
Blank, M.[Moshe],
Shechtman, E.[Eli],
Irani, M.[Michal],
Basri, R.[Ronen],
Actions as Space-Time Shapes,
PAMI(29), No. 12, December 2007, pp. 2247-2253.
IEEE DOI
Dataset, Actions.
HTML Version.
Earlier: A2, A1, A3, A4, A5:
ICCV05(II: 1395-1402).
IEEE DOI
Award, Helmholtz Prize.
Human action as 3-D shapes induced by silhouettes in the spacetime volume.
Li, R.H.[Rong-Hui],
Zhao, J.[Junfan],
Zhang, Y.[Yachao],
Su, M.Y.[Ming-Yang],
Ren, Z.[Zeping],
Zhang, H.[Han],
Tang, Y.S.[Yan-Song],
Li, X.[Xiu],
FineDance: A Fine-grained Choreography Dataset for 3D Full Body Dance
Generation,
ICCV23(10200-10209)
IEEE DOI
Dataset, Dance.
de la Torre-Frade, F.[Fernando],
Hodgins, J.K.[Jessica K.],
Bargteil, A.W.[Adam W.],
Artal, X.M.[Xavier Martin],
Macey, J.C.[Justin C.],
Collado I Castells, A.[Alexandre], and
Beltran, J.[Josep],
Guide to the Carnegie Mellon University Multimodal Activity
(CMU-MMAC) Database,
CMU-RI-TR-08-22, April, 2008.
WWW Link.
Dataset, Activity Recognition.
Liu, J.[Jun],
Shahroudy, A.[Amir],
Perez, M.[Mauricio],
Wang, G.[Gang],
Duan, L.Y.[Ling-Yu],
Kot, A.C.[Alex C.],
NTU RGB+D 120: A Large-Scale Benchmark for 3D Human Activity
Understanding,
PAMI(42), No. 10, October 2020, pp. 2684-2701.
IEEE DOI WWW Link. Or:
WWW Link.
Dataset, Human Activity. Benchmark testing, Cameras,
Deep learning, Semantics, Lighting, Skeleton, Activity understanding,
large-scale benchmark
Shahroudy, A.[Amir],
Liu, J.,
Ng, T.T.[Tian-Tsong],
Wang, G.,
NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis,
CVPR16(1010-1019)
IEEE DOI
Dataset, Human Activity.
FCVID: Fudan-Columbia Video Dataset,
WWW Link.
Dataset, Activity Recognition. 90,000+ videos, manually annotated for 239 categories.
Human activities.
Ben-Shabat, Y.Z.[Yi-Zhak],
Yu, X.[Xin],
Saleh, F.[Fatemeh],
Campbell, D.[Dylan],
Rodriguez-Opazo, C.[Cristian],
Li, H.D.[Hong-Dong],
Gould, S.[Stephen],
The IKEA ASM Dataset: Understanding People Assembling Furniture
through Actions, Objects and Pose,
WACV21(846-858)
IEEE DOI WWW Link.
Dataset, Activity Recognition. Deep learning, Annotations,
Pose estimation, Object segmentation, Benchmark testing
Corona, K.[Kellie],
Osterdahl, K.[Katie],
Collins, R.[Roderic],
Hoogs, A.J.[Anthony J.],
MEVA: A Large-Scale Multiview, Multimodal Video Dataset for Activity
Detection,
WACV21(1059-1067)
IEEE DOI
Dataset, Activity Detection. Solid modeling, Visualization,
Annotations, NIST, Cameras
See also
Multiview Extended Video with Activities.
Damen, D.[Dima],
Doughty, H.[Hazel],
Farinella, G.M.[Giovanni Maria],
Furnari, A.[Antonino],
Kazakos, E.[Evangelos],
Ma, J.[Jian],
Moltisanti, D.[Davide],
Munro, J.[Jonathan],
Perrett, T.[Toby],
Price, W.[Will],
Wray, M.[Michael],
Rescaling Egocentric Vision: Collection, Pipeline and Challenges for
EPIC-KITCHENS-100,
IJCV(130), No. 1, January 2022, pp. 33-55.
Springer DOI
Dataset, Egocentric Actions.
Damen, D.[Dima],
Doughty, H.[Hazel],
Farinella, G.M.[Giovanni Maria],
Fidler, S.[Sanja],
Furnari, A.[Antonino],
Kazakos, E.[Evangelos],
Moltisanti, D.[Davide],
Munro, J.[Jonathan],
Perrett, T.[Toby],
Price, W.[Will],
Wray, M.[Michael],
Scaling Egocentric Vision: The Epic Kitchens Dataset,
ECCV18(II: 753-771).
Springer DOI
Dataset, Egocentric Actions.
Delgado, K.[Kevin],
Origgi, J.M.[Juan Manuel],
Hasanpoor, T.[Tania],
Yu, H.[Hao],
Allessio, D.[Danielle],
Arroyo, I.[Ivon],
Lee, W.[William],
Betke, M.[Margrit],
Woolf, B.[Beverly],
Bargal, S.A.[Sarah Adel],
Student Engagement Dataset,
ABAW21(3621-3629)
IEEE DOI
Dataset, Classrooms. Training, Deep learning, Visualization, Head,
Distance learning, Time series analysis
Edinburgh office monitoring video dataset,
2021.
WWW Link.
Dataset, Office Monitor.
This dataset consists of video, image frames, and ground truth for 20
days of monitoring people in 4 different offices. The data is
acquired using a fixed camera as a set of 1280*720 pixel color images
captured at an average of about 1 FPS. This dataset is interesting
because there are about 450K labeled frames of people doing standard
office activities. The ground truth is the position of each person in
each image with a bounding box, plus their behavior. Four behaviors
are annotated (standing/walking, sitting, two or three people are
talking, or the person in room has fallen).
Paper to appear CVPR21.
Zhang, J.[Jing],
Li, W.Q.[Wan-Qing],
Ogunbona, P.O.[Philip O.],
Wang, P.[Pichao],
Tang, C.[Chang],
RGB-D-based action recognition datasets: A survey,
PR(60), No. 1, 2016, pp. 86-105.
Elsevier DOI
Dataset, Action Recognition. Action recognition
Laptev, I.[Ivan],
Caputo, B.[Barbara],
Schuldt, C.[Christian],
Lindeberg, T.[Tony],
Local velocity-adapted motion events for spatio-temporal recognition,
CVIU(108), No. 3, December 2007, pp. 207-229.
Elsevier DOI
Earlier: A3, A1, A2, Only:
Recognizing human actions: a local SVM approach,
ICPR04(III: 32-36).
IEEE DOI
Dataset, Actions.
WWW Link. Motion; Local features; Motion descriptors; Matching; Velocity adaptation;
Action recognition; Learning; SVM
Penn Action Dataset,
2013.
WWW Link.
Dataset, Facial Landmarks.
See also
From Actemes to Action: A Strongly-Supervised Representation for Detailed Action Understanding.
Li, W.H.[Wen-Hui],
Wong, Y.K.[Yong-Kang],
Liu, A.A.[An-An],
Li, Y.[Yang],
Su, Y.T.[Yu-Ting],
Kankanhalli, M.[Mohan],
Multi-Camera Action Dataset for Cross-Camera Action Recognition
Benchmarking,
WACV17(187-196)
IEEE DOI
Dataset, Action Recognition.
HTML Version. Multi-Camera Action Dataset (MCAD).
Benchmark testing, Cameras, Heuristic algorithms,
Internet, Robustness, Surveillance
Barekatain, M.,
Martí, M.,
Shih, H.F.,
Murray, S.,
Nakayama, K.,
Matsuo, Y.,
Prendinger, H.,
Okutama-Action:
An Aerial View Video Dataset for Concurrent Human Action Detection,
PETS17(2153-2160)
IEEE DOI
Dataset, Okutama-Action. Cameras, Data collection, Mobile communication,
Surveillance, Training, Video, sequences
Zhao, H.,
Torralba, A.,
Torresani, L.,
Yan, Z.,
HACS: Human Action Clips and Segments Dataset for Recognition and
Temporal Localization,
ICCV19(8667-8677)
IEEE DOI
WWW Link.
Dataset, Human Actions. image classification, image motion analysis, image segmentation,
learning (artificial intelligence), video signal processing,
YouTube
Kong, Q.,
Wu, Z.,
Deng, Z.,
Klinkigt, M.,
Tong, B.,
Murakami, T.,
MMAct: A Large-Scale Dataset for Cross Modal Human Action
Understanding,
ICCV19(8657-8666)
IEEE DOI
Dataset, Human Actions. image colour analysis, image motion analysis,
image recognition, video signal processing, RGB videos,
Task analysis
Ofli, F.,
Chaudhry, R.,
Kurillo, G.,
Vidal, R.,
Bajcsy, R.,
Berkeley MHAD: A comprehensive Multimodal Human Action Database,
WACV13(53-60).
IEEE DOI
Dataset, Human Actions.
Ji, Y.L.[Yan-Li],
Yang, Y.,
Shen, F.,
Shen, H.T.,
Zheng, W.S.,
Arbitrary-View Human Action Recognition:
A Varying-View RGB-D Action Dataset,
CirSysVideo(31), No. 1, January 2021, pp. 289-300.
IEEE DOI
Dataset, Action Recognition. Skeleton, Sensors, Videos, Dictionaries, Robots, HRI
Vaquette, G.,
Orcesi, A.,
Lucat, L.,
Achard, C.,
The DAily Home LIfe Activity Dataset:
A High Semantic Activity Dataset for Online Recognition,
FG17(497-504)
IEEE DOI
Dataset, Smart Home. Cameras, Databases, Protocols, Semantics, Sensors, Skeleton, Videos
Ragusa, F.[Francesco],
Furnari, A.[Antonino],
Livatino, S.[Salvatore],
Farinella, G.M.[Giovanni Maria],
The MECCANO Dataset: Understanding Human-Object Interactions from
Egocentric Videos in an Industrial-like Domain,
WACV21(1568-1577)
IEEE DOI WWW Link.
Dataset, Interactions. Taxonomy, Motorcycles,
Object detection, Tools, Object recognition
UDIVA Dataset,
2021
WWW Link.
Dataset, Social Interaction. Non-acted datasetof face-to-face dyadic interactions.
WACV Paper.
See also
Context-Aware Personality Inference in Dyadic Scenarios: Introducing the UDIVA Dataset.
Gong, W.J.[Wen-Juan],
Gonzàlez, J.[Jordi],
Tavares, J.M.R.S.[João Manuel R.S.],
Xavier Roca, F.,
A New Image Dataset on Human Interactions,
AMDO12(204-209).
Springer DOI
Dataset, Action Recognition.
Burger, S.[Susanne],
The CHIL RT07 Evaluation Data,
MTPH07(xx-yy).
Springer DOI
Dataset, Activity Recogniton.
Truong, C.[Charles],
Atiq, M.[Mounir],
Minvielle, L.[Ludovic],
Serra, R.[Renan],
Mougeot, M.[Mathilde],
Vayatis, N.[Nicolas],
A Data Set for Fall Detection with Smart Floor Sensors,
IPOL(13), 2023, pp. 183-197.
DOI Link
Dataset, Fall Detection.
Fouhey, D.F.,
Kuo, W.,
Efros, A.A.,
Malik, J.,
From Lifestyle VLOGs to Everyday Interactions,
CVPR18(4991-5000)
IEEE DOI
Dataset, Action.
HTML Version. Videos, YouTube, Task analysis, Cameras, Internet, Benchmark testing
CVBASE Annotated Video Data,
2006.
HTML Version.
Dataset, Video.
Olympic Sports Dataset,
2010
WWW Link.
Dataset, Sports. The Olympic Sports Dataset contains videos of athletes practicing
different sports. We have obtained all video sequences from YouTube
and annotated their class label with the help of Amazon Mechanical
Turk.
Refer to:
See also
Modeling Temporal Structure of Decomposable Motion Segments for Activity Classification.
UCF Sports Action Dataset,
2008.
WWW Link. Details:
WWW Link. A large set of sports actions.
Dataset, Sports. Note that many of the other non UCF links to data on that page are out of
date.
LHI Sports Activity Dataset,
Subset of larger dataset.
Online2008
HTML Version.
Dataset, Sports.
See also
Lotus Hill Institute.
MEXaction2 action detection and localization dataset,
2015.
WWW Link.
Dataset, Actions. The aim of the MEXaction2 dataset is to support the development and
evaluation of methods for spotting instances of short actions in a
relatively large video database.
Actions: BullChargeCape (1324) and HorseRiding (651).
Zhang, W.C.[Wei-Chen],
Liu, Z.G.[Zhi-Guang],
Zhou, L.Y.[Liu-Yang],
Leung, H.[Howard],
Chan, A.B.[Antoni B.],
Martial Arts, Dancing and Sports dataset: A challenging stereo and
multi-view dataset for 3D human pose estimation,
IVC(61), No. 1, 2017, pp. 22-39.
Elsevier DOI
Dataset, Human Activities. Human pose estimation
Zalluhoglu, C.[Cemil],
Ikizler-Cinbis, N.[Nazli],
Collective Sports: A multi-task dataset for collective activity
recognition,
IVC(94), 2020, pp. 103870.
Elsevier DOI
Dataset, Sports. Collective activity recognition, Action recognition,
Convolutional neural networks, Multi-task learning, LSTM
Li, Y.X.[Yi-Xuan],
Chen, L.[Lei],
He, R.[Runyu],
Wang, Z.Z.[Zhen-Zhi],
Wu, G.S.[Gang-Shan],
Wang, L.M.[Li-Min],
MultiSports: A Multi-Person Video Dataset of Spatio-Temporally
Localized Sports Actions,
ICCV21(13516-13525)
IEEE DOI
Dataset, Sports. Location awareness, Annotations, Error analysis, Benchmark testing,
Complexity theory, Standards, Action and behavior recognition,
Datasets and evaluation
Zhang, C.L.[Chuan-Lei],
Liu, L.X.[Li-Xin],
Yao, M.[Minda],
Chen, W.[Wei],
Chen, D.F.[Du-Feng],
Wu, Y.L.[Yu-Liang],
HSiPu2: A New Human Physical Fitness Action Dataset for Recognition
and 3D Reconstruction Evaluation,
VOCVALC21(3166-3175)
IEEE DOI
Dataset, Physical Fitness. Support vector machines, Solid modeling,
Setti, F.[Francesco],
Conigliaro, D.[Davide],
Rota, P.[Paolo],
Bassetti, C.[Chiara],
Conci, N.[Nicola],
Sebe, N.[Nicu],
Cristani, M.[Marco],
The S-Hock dataset: A new benchmark for spectator crowd analysis,
CVIU(159), No. 1, 2017, pp. 47-58.
Elsevier DOI
Dataset, Crowd Analysis.
Earlier: A2, A3, A1, A4, A5, A6, A7:
The S-HOCK dataset: Analyzing crowds at the stadium,
CVPR15(2039-2047)
IEEE DOI
Spectator, monitoring
Ali, S.[Saad],
Shah, M.[Mubarak],
Floor Fields for Tracking in High Density Crowd Scenes,
ECCV08(II: 1-14).
Springer DOI PDF File.
Dataset, Tracking.
WWW Link.
Ali, S.[Saad],
Shah, M.[Mubarak],
A Lagrangian Particle Dynamics Approach for Crowd Flow Segmentation and
Stability Analysis,
CVPR07(1-6).
IEEE DOI PDF File.
Dataset, Surveillance. The dataset for this paper is available:
WWW Link. UCF Lists:
WWW Link. But no link to data.
Ali, S.[Saad],
Crowd Flow Segmentation and Stability Analysis,
Online2007
HTML Version. The more general discussion of the issues of the other papers.
Includes a more complete dataset and pointers to other useful code.
Dataset, Surveillance.
WWW Link.
Multimodal Meme Classification Identifying Offensive Content in Image and Text,
2019.
WWW Link.
Dataset, Offensive Images.
MultOFF Dataset.
Cheng, M.[Ming],
Cai, K.J.[Kun-Jing],
Li, M.[Ming],
RWF-2000: An Open Large Scale Video Database for Violence Detection,
ICPR21(4183-4190)
IEEE DOI
Dataset, Violence. Image motion analysis, Databases, Surveillance,
Logic gates, Cameras
Ntalampiras, S.[Stavros],
Arsic, D.[Dejan],
Hofmann, M.[Martin],
Andersson, M.[Maria],
Ganchev, T.[Todor],
PROMETHEUS: heterogeneous sensor database in support of research on
human behavioral patterns in unrestricted environments,
SIViP(8), No. 7, October 2014, pp. 1211-1231.
Springer DOI
Dataset, Human Activity.
IAUFD: A 100k images dataset for automatic football image/video analysis,
2022.
WWW Link.
Dataset, Event Detection.
Dataset, Sports.
Penate-Sanchez, A.[Adrian],
Freire-Obregón, D.[David],
Lorenzo-Melián, A.[Adrián],
Lorenzo-Navarro, J.[Javier],
Castrillón-Santana, M.[Modesto],
TGC20ReId: A dataset for sport event re-identification in the wild,
PRL(138), 2020, pp. 355-361.
Elsevier DOI
Dataset, Sports. Sport, Re-identification, Dataset
Abrams, A.[Austin],
Tucek, J.[Jim],
Little, J.[Joshua],
Jacobs, N.[Nathan],
Pless, R.[Robert],
LOST: Longterm Observation of Scenes (with Tracks),
WACV12(297-304).
IEEE DOI
Using the data, same half hour every day.
Dataset, Surveillance.
Rebecq, H.[Henri],
Ranftl, R.[René],
Koltun, V.[Vladlen],
Scaramuzza, D.[Davide],
High Speed and High Dynamic Range Video with an Event Camera,
PAMI(43), No. 6, June 2021, pp. 1964-1980.
IEEE DOI
Earlier:
Events-To-Video: Bringing Modern Computer Vision to Event Cameras,
CVPR19(3852-3861).
IEEE DOI
Code, HDR.
Dataset, HDR.
Dataset, E2VID.
HTML Version. Image reconstruction, Cameras, Streaming media, Dynamic range,
Brightness, Heuristic algorithms,
high dynamic range
DAVIS: Densely Annotated VIdeo Segmentation,
WWW Link.
2017.
Dataset, Video Segmentation. For the competition at CVPR 2017.
Video Instance Segmentation - YouTube-VOS,
WWW Link.
Dataset, Video Segmentation. Dataset for video instance segmentation.
Video Instance Segmentation - YouTube-VOS,
WWW Link.
Dataset, Video Segmentation. Dataset for video instance segmentation.
And related to Youtube-VIS.
Qi, J.Y.[Ji-Yang],
Gao, Y.[Yan],
Hu, Y.[Yao],
Wang, X.G.[Xing-Gang],
Liu, X.Y.[Xiao-Yu],
Bai, X.[Xiang],
Belongie, S.[Serge],
Yuille, A.L.[Alan L.],
Torr, P.H.S.[Philip H.S.],
Bai, S.[Song],
OVIS: Occluded Video Instance Segmentation,
Online2021.
WWW Link.
Dataset, Video Segmentation. Designed with the philosophy of perceiving object occlusions in
videos, which could reveal the complexity and the diversity of
real-world scenes.
Perazzi, F.[Federico],
Pont-Tuset, J.,
McWilliams, B.,
Van Gool, L.J.,
Gross, M.,
Sorkine-Hornung, A.[Alexander],
A Benchmark Dataset and Evaluation Methodology for Video Object
Segmentation,
CVPR16(724-732)
IEEE DOI
Dataset, Video Segmentation.
Change Detection Benchmark Website,
2012
Dataset, Motion Detection.
WWW Link. Dataset for the 2012 Change Detection workshop at CVPR.
Scene Background Initialization (SBI) Dataset,
2016
HTML Version.
Dataset, Background.
14 sequences with ground truth.
See also
Towards Benchmarking Scene Background Initialization.
Mahmood, M.H.[Muhammad Habib],
Díez, Y.[Yago],
Salvi, J.[Joaquim],
Lladó, X.[Xavier],
A collection of challenging motion segmentation benchmark datasets,
PR(61), No. 1, 2017, pp. 1-14.
Elsevier DOI
Dataset, Motion Segmentation. Motion segmentation
Mahmood, M.H.[Muhammad Habib],
Zappella, L.[Luca],
Díez, Y.[Yago],
Salvi, J.[Joaquim],
Lladó, X.[Xavier],
A New Trajectory Based Motion Segmentation Benchmark Dataset (UdG-MS15),
IbPRIA15(463-470).
Springer DOI
Dataset, Motion Segmentation.
Cuevas, C.[Carlos],
Yáñez, E.M.[Eva María],
García, N.[Narciso],
Labeled dataset for integral evaluation of moving object detection
algorithms: LASIESTA,
CVIU(152), No. 1, 2016, pp. 103-117.
Elsevier DOI
Dataset, Foreground Detection. Database
Vacavant, A.[Antoine],
Chateau, T.[Thierry],
Wilhelm, A.[Alexis],
Lequièvre, L.[Laurent],
A Benchmark Dataset for Outdoor Foreground/Background Extraction,
BMC12(I:291-300).
Springer DOI
Dataset, Foreground Extraction. Surveillance applications.
Image Stitching Database,
2010
HTML Version.
Dataset, Image Stitching.
Richter, S.R.[Stephan R.],
Hayder, Z.[Zeeshan],
Koltun, V.[Vladlen],
Playing for Benchmarks,
ICCV17(2232-2241)
IEEE DOI
Dataset, Video. image annotation, image resolution, image segmentation,
image sequences, object detection, object tracking,
Stottinger, J.[Julian],
Zambanini, S.[Sebastian],
Khan, R.[Rehanullah],
Hanbury, A.[Allan],
FeEval A Dataset for Evaluation of Spatio-temporal Local Features,
ICPR10(499-502).
IEEE DOI
Dataset, Motion.
Avola, D.,
Cinque, L.,
Foresti, G.L.,
Martinel, N.,
Pannone, D.,
Piciarelli, C.,
A UAV Video Dataset for Mosaicking and Change Detection From
Low-Altitude Flights,
SMCS(50), No. 6, June 2020, pp. 2139-2149.
IEEE DOI
Dataset, Change Detection. Video sequences, Change detection algorithms, Cameras,
Detection algorithms, Task analysis, Telemetry,
unmanned aerial vehicle (UAV)
SuperTex136,
2016
WWW Link.
Dataset, Superresolution. Refer to:
See also
Jointly Optimized Regressors for Image Super-resolution.
Set5, Set14, Urban 100, BSD 100, Sun-Hays 80 Datasets,
Dataset, Super Resolution. Linkd from:
WWW Link.
Wang, Y.Q.[Ying-Qian],
Wang, L.G.[Long-Guang],
Yang, J.G.[Jun-Gang],
An, W.[Wei],
Guo, Y.L.[Yu-Lan],
Flickr1024: A Large-Scale Dataset for Stereo Image Super-Resolution,
CLI19(3852-3857)
IEEE DOI
Dataset, Flickr.
Dataset, Super Resolution.
WWW Link. cameras, data acquisition, image resolution,
stereo image processing, large-scale stereo dataset, super resolution
Tulyakov, S.[Stepan],
Gehrig, D.[Daniel],
Georgoulis, S.[Stamatios],
Erbach, J.[Julius],
Gehrig, M.[Mathias],
Li, Y.[Yuanyou],
Scaramuzza, D.[Davide],
Time Lens: Event-based Video Frame Interpolation,
CVPR21(16150-16159)
IEEE DOI HTML Version.
Code, Frame Interpolation.
Dataset, Frame Interpolation. Interpolation, Visualization, Image color analysis,
Benchmark testing, Cameras, Sensors
Xiao, J.X.[Jian-Xiong],
Owens, A.[Andrew],
Torralba, A.B.[Antonio B.],
SUN3D:
A Database of Big Spaces Reconstructed Using SfM and Object Labels,
ICCV13(1625-1632)
IEEE DOI
Dataset, Scene Understanding.
WWW Link. RGB-D Video dataset. Camera pose and object labels.
Interactive reconstruction process.
Shugrina, M.[Maria],
Liang, Z.H.[Zi-Heng],
Kar, A.[Amlan],
Li, J.[Jiaman],
Singh, A.[Angad],
Singh, K.[Karan],
Fidler, S.[Sanja],
Creative Flow+ Dataset,
CVPR19(5379-5388).
IEEE DOI
Dataset, Optical Flow.
WWW Link. Video dataset richly labeled with per-pixel optical flow, occlusions,
correspondences, segmentation labels, normals, and depth.
Mayer, N.[Nikolaus],
Ilg, E.[Eddy],
Häusser, P.[Philip],
Fischer, P.[Philipp],
Cremers, D.[Daniel],
Dosovitskiy, A.[Alexey],
Brox, T.[Thomas],
A Large Dataset to Train Convolutional Networks for Disparity,
Optical Flow, and Scene Flow Estimation,
CVPR16(4040-4048)
IEEE DOI
Dataset, Optical Flow.
Baker, S.[Simon],
Scharstein, D.[Daniel],
Lewis, J.P.,
Roth, S.[Stefan],
Black, M.J.[Michael J.],
Szeliski, R.S.[Richard S.],
A Database and Evaluation Methodology for Optical Flow,
IJCV(92), No. 1, March 2011, pp. 1-31.
Springer DOI
Earlier: A1, A4, A2, A5, A3, A6:
ICCV07(1-8).
IEEE DOI
Dataset, Optical Flow.
WWW Link.
Song, H.O.,
Xiang, Y.,
Jegelka, S.[Stefanie],
Savarese, S.[Silvio],
Deep Metric Learning via Lifted Structured Feature Embedding,
CVPR16(4004-4012)
IEEE DOI
Stanford Online Products.
Dataset, Products.
Nascimento, S.M.C.,
Ferreira, F., and
Foster, D.H.,
Statistics of spatial cone-excitation ratios in natural scenes,
JOSA-A(19), No. 8, August 2002, pp. 1484-1490.
PDF File.
Dataset, Hyperspectral.
HTML Version.
Foster, D.H.,
Nascimento, S.M.C.,
Amano, K.,
Information limits on neural identification of coloured surfaces
in natural scenes,
Visual Neuroscience(21), 2004, pp. 331-336.
PDF File.
Dataset, Hyperspectral.
HTML Version.
Cerra, D.[Daniele],
Pato, M.[Miguel],
Alonso, K.[Kevin],
Köhler, C.[Claas],
Schneider, M.[Mathias],
de los Reyes, R.[Raquel],
Carmona, E.[Emiliano],
Richter, R.[Rudolf],
Kurz, F.[Franz],
Reinartz, P.[Peter],
Müller, R.[Rupert],
DLR HySU: A Benchmark Dataset for Spectral Unmixing,
RS(13), No. 13, 2021, pp. xx-yy.
DOI Link
Dataset, Unmixing.
Bossard, L.[Lukas],
Guillaumin, M.[Matthieu],
Van Gool, L.J.[Luc J.],
Food-101: Mining Discriminative Components with Random Forests,
ECCV14(VI: 446-461).
Springer DOI
Dataset, Food. 101 food categories, with 101’000 images
recognizing pictured dishes.
Wang, X.H.[Xiao-Han],
Eliott, F.M.[Fernanda M.],
Ainooson, J.[James],
Palmer, J.H.[Joshua H.],
Kunda, M.[Maithilee],
An Object is Worth Six Thousand Pictures:
The Egocentric, Manual, Multi-image (EMMI) Dataset,
Egocentric17(2364-2372)
IEEE DOI WWW Link.
Dataset, Learning. Egocentric, Manual, Multi-Image (EMMI) Dataset.
Automobiles, Cameras, Manuals, Object recognition,
Toy manufacturing industry, Training, Visualization
Agarwal, S.[Shivani],
Awan, A.[Aatif], and
Roth, D.[Dan],
Learning to Detect Objects in Images via a Sparse, Part-Based
Representation,
PAMI(26), No. 11, November 2004, pp. 1475-1490.
IEEE Abstract. Or:
PDF File.
WWW Link.
Dataset, Vehicles. Detecting specific object classes (e.g. cars).
Messikommer, N.[Nico],
Gehrig, D.[Daniel],
Loquercio, A.[Antonio],
Scaramuzza, D.[Davide],
Event-based Asynchronous Sparse Convolutional Networks,
ECCV20(VIII:415-431).
Springer DOI
WWW Link.
Code, Semantic Segmentation.
WWW Link.
Dataset, Semantic Segmentation.
Borji, A.[Ali],
Izadi, S.[Saeed],
Itti, L.[Laurent],
iLab-20M:
A Large-Scale Controlled Object Dataset to Investigate Deep Learning,
CVPR16(2221-2230)
IEEE DOI
Dataset, Learning.
300 Videos in the Wild,
2015
Dataset, Faces.
WWW Link. Used for the ICCV 2015 workshop challenge.
WIDER Attribute dataset,
2016.
WWW Link.
Dataset, Faces.
See also
Human Attribute Recognition by Deep Hierarchical Contexts.
Description of the Collection of Facial Images,
2007
Dataset, Faces.
HTML Version. Essex collection of faces. 395 people, 20 images each.
Annotated Facial Dataset,
2007
Dataset, Faces.
WWW Link.
The CMU Multi-PIE Face Database,
2010
Dataset, Faces.
WWW Link.
It contains 337 subjects, captured under 15 view points and 19
illumination conditions in four recording sessions for a total of more
than 750,000 images.
FaceScrub Annotated Face Dataset,
2014
Dataset, Faces.
HTML Version.
100,000 images of 530 people. Acquired from internet search with rejection
of pictures that do not match.
See also
data-driven approach to cleaning large face datasets, A.
GVVPerfcapEva Repository of Evaluation Data Sets,
2015
Dataset, Faces.
Dataset, Human Motion.
Dataset, Hand Tracking.
WWW Link. A set of dataset including:
GVVPerfCapEva: IDT - Full body skeletal motion capture results from from
body-worn inertial sensor data and depth camera recordings
GVVPerfCapEva: Dexter 1: Evaluation data set for 3D hand tracking with
depth and multi-view video data
GVVPerfCapEva: PDT 2013: Body shape estimation and real-time motion
capture with a depth camera
GVVPerfcapEva: BinoCap - Dense 3D full-body performance capture with
handheld stereo cameras (single + multiple person(s))
GVVPerfcapEva: MonFacecCap - Monocular dense face performance capture
GVVPerfCapEva: MVIC - markerless multi-view performance capture of
multiple interacting characters
GVVPerfCapEva: HKIC: Performance capture of interacting characters with
handheld Kinects
MPII Human Shape,
2015
Dataset, Human Pose.
WWW Link. Expressive 3D human body shape models and tools for human shape space building.
UB KinFace Database,
2011
Dataset, Faces.
HTML Version.
Yale Face Database,
Online2006.
First is 165 images.
HTML Version. And
5760 single light source images of 10 subjects each seen under
576 viewing conditions
HTML Version.
Dataset, Faces.
The University of Oulu Physics-Based Face Database,
2000.
125 different faces each in 16 different camera
calibration and illumination conditions.
WWW Link.
Dataset, Faces.
The University of Oulu Face Video Database,
2002.
WWW Link.
Dataset, Faces.
The CAS-PEAL Large-Scale Chinese Face Database
and Baseline Evaluations,
2004.
9,594 images of 1040 individuals (595 males and 445 females)
with varying Pose, Expression, Accessory, and Lighting
HTML Version.
Dataset, Faces.
MIT Face Recognition Database,
Online2000
Fi
Dataset, Faces.
HTML Version.
HTML Version. First one is small (19X19) images.
Second one has training and test data.
The UMIST Face Database,
1998.
Face Recognition.
HTML Version.
Dataset, Faces.
NIST Mugshot Identification Database,
2002.
HTML Version.
Dataset, Faces.
IARPA Janus Benchmark A (IJB-A) dataset,
2017.
WWW Link.
Dataset, Faces.
The ORL Database of Faces,
1992-1994.
More recently called the AT&T database.
HTML Version.
Dataset, Faces.
PubFig: Public Figures Face Database,
2015
Dataset, Faces.
WWW Link.
58,797 images of 200 people collected from the internet.
Refer to:
See also
Attribute and simile classifiers for face verification.
Peer, P.[Peter],
CVL Face Database,
Online1999.
Dataset, Faces.
HTML Version. 114 people, 7 images each.
POSTECH Face Database,
2001
Dataset, Faces.
Dataset, Expressions.
Dataset, Gesture.
HTML Version. A variety of datasets for face recognition, expression recognition,
gesture recognition, and video surveillance.
See also
POSTECH face database (PF07) and performance evaluation, The.
Face Recognition Vendor Test 2006,
Online2006.
WWW Link.
Dataset, Faces.
WWW Link.
Results in February 2007.
FacePix Database,
Online2009.
WWW Link.
Dataset, Faces. 181 poses 1 degree apart plus lighting (direction) changes.
See also
Arizona State University.
YouTube Faces DB,
2015
Dataset, Faces.
WWW Link. A database of face videos designed for studying the problem of
unconstrained face recognition in videos. The data set contains 3,425
videos of 1,595 different people.
Oxford Town Center,
2009
Dataset, Human Tracking.
WWW Link.
Pedestrian detection and tracking.
CHUK Datasets,
2009
Dataset, Pedestrian Tracking.
Dataset, Crowd Analysis.
Dataset, Pedestrian Detection.
Dataset, Re-Identification.
HTML Version.
Person search, re-identification
A View From Somewhere (AVFS),
2023
Dataset, Face Similarity.
WWW Link. A dataset of 638,180 human judgments of face similarity.
Jain, V.[Vidit],
Learned-Miller, E.G.[Erick G.],
FDDB: Face Detection Data Set and Benchmark,
UMass2010, Technical Report 2010-009.
WWW Link.
Dataset, Faces. annotations for 5171 faces in a set of 2845 images.
Subset of
See also
Labeled faces in the wild: A database for studying face recognition in unconstrained environments.
Huang, G.B.,
Ramesh, M.,
Berg, T.L.,
Learned-Miller, E.G.,
Labeled faces in the wild:
A database for studying face recognition in unconstrained environments,
UMass2007, Technical Report 07-49.
annotated faces captured from news articles on the web.
Dataset, Faces.
WWW Link. Detected using:
See also
Robust Real-Time Face Detection.
Phillips, P.J.,
Moon, H.J.,
Rizvi, S.A.,
Rauss, P.J.,
The FERET Evaluation Methodology for Face-Recognition Algorithms,
PAMI(22), No. 10, October 2000, pp. 1090-1104.
IEEE DOI
Evaluation, Faces.
Dataset, Faces.
Earlier: A1, A2, A4, A3:
CVPR97(137-143).
IEEE DOI PDF File.
Evaluation; data.
Phillips, P.J.[P. Jonathon],
Wechsler, H.[Harry],
Huang, J.[Jeffery],
Rauss, P.J.[Patrick J.],
The FERET Database and Evaluation Procedure for
Face-Recognition Algorithms,
IVC(16), No. 5, April 27 1998, pp. 295-306.
Elsevier DOI
Evaluation, Faces.
Dataset, Faces.
The FERET Database,
NIST1993.
WWW Link.
Dataset, Faces. Old version. For Color --
See also
Color FERET Database, The.
See also
National Institute of Standards and Technology (NIST) Intelligent Systems Division.
The Color FERET Database,
NISTJanuary 2008.
WWW Link.
Dataset, Faces.
Wong, Y.W.[Yee Wan],
Ch'ng, S.I.[Sue Inn],
Seng, K.P.[Kah Phooi],
Ang, L.M.[Li-Minn],
Chin, S.W.[Siew Wen],
Chew, W.J.[Wei Jen],
Lim, K.H.[King Hann],
A new multi-purpose audio-visual UNMC-VIER database with multiple
variabilities,
PRL(32), No. 13, 1 October 2011, pp. 1503-1510.
Elsevier DOI
Dataset, Faces. Audio-visual database; Face recognition; Speech recognition; Visual variation
Mavadati, S.M.[S. Mohammad],
Mahoor, M.H.[Mohammad H.],
Bartlett, K.[Kevin],
Trinh, P.[Philip],
Cohn, J.F.[Jeffrey F.],
DISFA: A Spontaneous Facial Action Intensity Database,
AffCom(4), No. 2, 2013, pp. 151-160.
IEEE DOI
Dataset, Facial Action. Databases
Zhang, X.[Xing],
Yin, L.J.[Li-Jun],
Cohn, J.F.[Jeffrey F.],
Canavan, S.[Shaun],
Reale, M.[Michael],
Horowitz, A.[Andy],
Liu, P.[Peng],
Girard, J.M.[Jeffrey M.],
BP4D-Spontaneous: a high-resolution spontaneous 3D dynamic facial
expression database,
IVC(32), No. 10, 2014, pp. 692-706.
Elsevier DOI
Earlier: A1, A2, A3, A4, A5, A6, A7, Only:
A high-resolution spontaneous 3D dynamic facial expression database,
FG13(1-6)
IEEE DOI
Dataset, Facial Expressions. emotion recognition
3D facial expression
Yin, L.J.[Li-Jun],
Chen, X.C.[Xiao-Chen],
Sun, Y.[Yi],
Worm, T.[Tony],
Reale, M.[Michael],
A high-resolution 3D dynamic facial expression database,
FG08(1-6).
IEEE DOI
Dataset, Facial Expressions.
Cheema, U.[Usman],
Moon, S.[Seungbin],
Sejong face database: A multi-modal disguise face database,
CVIU(208-209), 2021, pp. 103218.
Elsevier DOI
Dataset, Face Recognition. Biometrics, Disguise recognition, Face database,
Face recognition, Multi-modal
Poster, D.[Domenick],
Thielke, M.[Matthew],
Nguyen, R.[Robert],
Rajaraman, S.[Srinivasan],
Di, X.[Xing],
Fondje, C.N.[Cedric Nimpa],
Patel, V.M.[Vishal M.],
Short, N.J.[Nathaniel J.],
Riggan, B.S.[Benjamin S.],
Nasrabadi, N.M.[Nasser M.],
Hu, S.[Shuowen],
A Large-Scale, Time-Synchronized Visible and Thermal Face Dataset,
WACV21(1558-1567)
IEEE DOI PDF File.
Dataset, Face Recognition. Heating systems, Protocols, Thermal lensing, Photothermal effects,
Cameras, Thermal analysis, Task analysis
Cao, J.,
Li, Y.,
Zhang, Z.,
Celeb-500K: A Large Training Dataset for Face Recognition,
ICIP18(2406-2410)
IEEE DOI
Dataset, Face Recognition. Training, Face, Face recognition, Measurement, Learning systems,
Performance gain, Face detection, face recognition, face dataset,
convolutional neural networks
Whitelam, C.,
Taborsky, E.,
Blanton, A.,
Maze, B.,
Adams, J.,
Miller, T.,
Kalka, N.,
Jain, A.K.,
Duncan, J.A.,
Allen, K.,
Cheney, J.,
Grother, P.,
IARPA Janus Benchmark-B Face Dataset,
Biometrics17(592-600)
IEEE DOI
Dataset, Faces. Benchmark testing, Face, Face detection, Face recognition, Media,
Protocols, Videos
See also
IARPA Janus Benchmark A (IJB-A) dataset.
Kemelmacher-Shlizerman, I.,
Seitz, S.M.,
Miller, D.,
Brossard, E.,
The MegaFace Benchmark: 1 Million Faces for Recognition at Scale,
CVPR16(4873-4882)
IEEE DOI
Dataset, Face Recognition.
Guo, Y.D.[Yan-Dong],
Zhang, L.[Lei],
Hu, Y.X.[Yu-Xiao],
He, X.D.[Xiao-Dong],
Gao, J.F.[Jian-Feng],
MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition,
ECCV16(III: 87-102).
Springer DOI
Dataset, Face Recognition.
WWW Link.
Klare, B.F.[Brendan F.],
Klein, B.[Ben],
Taborsky, E.[Emma],
Blanton, A.[Austin],
Cheney, J.[Jordan],
Allen, K.[Kristen],
Grother, P.[Patrick],
Mah, A.[Alan],
Burge, M.[Mark],
Jain, A.K.[Anil K.],
Pushing the frontiers of unconstrained face detection and
recognition: IARPA Janus Benchmark A,
CVPR15(1931-1939)
IEEE DOI
Dataset, Face Recognition.
McDuff, D.J.[Daniel J.],
el Kaliouby, R.[Rana],
Senechal, T.[Thibaud],
Amr, M.[May],
Cohn, J.F.[Jeffrey F.],
Picard, R.W.[Rosalind W.],
Affectiva-MIT Facial Expression Dataset (AM-FED):
Naturalistic and Spontaneous Facial Expressions Collected 'In-the-Wild',
AMFG13(881-888)
IEEE DOI
Dataset, Facial Expressions. Facial expressions;dataset
Toderici, G.[George],
Evangelopoulos, G.[Georgios],
Fang, T.H.[Tian-Hong],
Theoharis, T.[Theoharis],
Kakadiaris, I.A.[Ioannis A.],
UHDB11 Database for 3D-2D Face Recognition,
PSIVT13(73-86).
Springer DOI
Dataset, Faces.
Colombo, A.[Alessandro],
Cusano, C.[Claudio],
Schettini, R.[Raimondo],
UMB-DB: A database of partially occluded 3D faces,
BenchFace11(2113-2119).
IEEE DOI
Dataset, Faces.
Somanath, G.[Gowri],
Rohith, M.V.,
Kambhamettu, C.[Chandra],
VADANA: A dense dataset for facial image analysis,
BenchFace11(2175-2182).
IEEE DOI
Dataset, Faces.
Özcan, M.[Mert],
Jie, L.[Luo],
Ferrari, V.[Vittorio],
Caputo, B.[Barbara],
A Large-Scale Database of Images and Captions for Automatic Face Naming,
BMVC11(xx-yy).
HTML Version.
Dataset, Faces.
Gupta, S.[Shalini],
Castleman, K.R.[Kenneth R.],
Markey, M.K.[Mia K.],
Bovik, A.C.[Alan C.],
Texas 3D Face Recognition Database,
Southwest10(97-100).
IEEE DOI
Dataset, Faces.
Bastanfard, A.[Azam],
Nik, M.A.[Melika Abbasian],
Dehshibi, M.M.[Mohammad Mahdi],
Iranian Face Database with age, pose and expression,
ICMV07(50-55).
IEEE DOI
Dataset, Faces.
Denes, L.J.,
Metes, P.,
Liu, Y.,
Hyperspectral Face Database,
CMU-RI-TR-02-25, October, 2002.
WWW Link.
Dataset, Faces.
Kärkkäinen, K.[Kimmo],
Joo, J.[Jungseock],
FairFace: Face Attribute Dataset for Balanced Race, Gender, and Age
for Bias Measurement and Mitigation,
WACV21(1547-1557)
IEEE DOI
Dataset, Face Recognition.
WWW Link. Training, Social networking (online),
Computational modeling, Multimedia Web sites, Decision making, Media
Face Recogniton Home Page,
Online2006.
WWW Link.
Code, Face Recognition.
Dataset, Faces. Listing of research groups, databases, and vendors.
Face Detection Home Page,
Online2007.
WWW Link.
Code, Face Detection.
Dataset, Faces. Listing of research groups, databases, and vendors.
BioID Face Database,
2006.
Dataset, Faces.
WWW Link.
See also
HumanScan, BioID.
Mian, A.S.[Ajmal S.],
Bennamoun, M.[Mohammed],
Owens, R.A.[Robyn A.],
Three-Dimensional Model-Based Object Recognition and Segmentation in
Cluttered Scenes,
PAMI(28), No. 10, October 2006, pp. 1584-1601.
IEEE DOI
Dataset, 3-D Data.
HTML Version. And
HTML Version.
Earlier:
3D Recognition and Segmentation of Objects in Cluttered Scenes,
WACV05(I: 8-13).
IEEE DOI
And:
Region-based Matching for Robust 3D Face Recognition,
BMVC05(xx-yy).
HTML Version.
And:
Matching Tensors for Pose Invariant Automatic 3D Face Recognition,
SafeSecur05(III: 120-120).
IEEE DOI
Earlier:
Performance analysis of an improved tensor based correspondence
algorithm for automatic 3d modeling,
ICIP04(III: 1951-1954).
IEEE DOI
And:
Matching Tensors for Automatic Correspondence and Registration,
ECCV04(Vol II: 495-505).
Springer DOI
Model range data with tensors. Match stored tensor representations.
Min, R.[Rui],
Kose, N.,
Dugelay, J.L.,
KinectFaceDB: A Kinect Database for Face Recognition,
SMCS(44), No. 11, November 2014, pp. 1534-1548.
IEEE DOI
Dataset, Faces, 3-D. face recognition
Equinox: Human Identification at a Distance,
HID. 2006.
IR images available.
Face Recognition.
HTML Version.
Dataset, Faces.
See also
Equinox Corporation.
Moschoglou, S.,
Papaioannou, A.,
Sagonas, C.,
Deng, J.K.[Jian-Kang],
Kotsia, I.,
Zafeiriou, S.P.[Stefanos P.],
AgeDB: The First Manually Collected, In-the-Wild Age Database,
FaceWild17(1997-2005)
IEEE DOI
Dataset, Face Age. Databases, Estimation, Face, Face recognition,
Machine learning, Protocols
Yu, J.H.[Jian-Hui],
Zhu, H.[Hao],
Jiang, L.M.[Li-Ming],
Loy, C.C.[Chen Change],
Cai, W.D.[Wei-Dong],
Wu, W.[Wayne],
CelebV-Text: A Large-Scale Facial Text-Video Dataset,
CVPR23(14805-14814)
IEEE DOI
Dataset, Facial Features.
Zhu, H.[Hao],
Wu, W.[Wayne],
Zhu, W.T.[Wen-Tao],
Jiang, L.M.[Li-Ming],
Tang, S.W.[Si-Wei],
Zhang, L.[Li],
Liu, Z.W.[Zi-Wei],
Loy, C.C.[Chen Change],
CelebV-HQ: A Large-Scale Video Facial Attributes Dataset,
ECCV22(VII:650-667).
Springer DOI
Dataset, Facial Features.
Jalal, A.[Ahsan],
Tariq, U.[Usman],
The LFW-Gender Dataset,
CV4AC16(III: 531-540).
Springer DOI
Dataset, Gender.
Dago-Casas, P.[Pablo],
Gonzalez-Jimenez, D.[Daniel],
Yu, L.L.[Long Long],
Alba-Castro, J.L.[Jose Luis],
Single- and cross- database benchmarks for gender classification under
unconstrained settings,
BenchFace11(2152-2159).
IEEE DOI
Dataset, Faces.
MAFL: Multi-Attribute Facial Landmark,
2014.
HTML Version.
Dataset, Facial Landmarks.
See also
Learning Deep Representation for Face Alignment with Auxiliary Attributes.
Yang, S.[Shuo],
Luo, P.[Ping],
Loy, C.C.[Chen Change],
Tang, X.[Xiaoou],
Faceness-Net: Face Detection through Deep Facial Part Responses,
PAMI(40), No. 8, August 2018, pp. 1845-1859.
IEEE DOI
Detectors, Face, Face detection, Mouth, Neural networks, Proposals,
Training, Face detection, convolutional neural network, deep learning
Earlier:
WIDER FACE: A Face Detection Benchmark,
CVPR16(5525-5533)
IEEE DOI
Dataset, Face Detection.
Earlier:
From Facial Parts Responses to Face Detection:
A Deep Learning Approach,
ICCV15(3676-3684)
IEEE DOI
Detectors; Face; Face detection; Hair; Mouth; Nose; Proposals
Kostinger, M.[Martin],
Wohlhart, P.[Paul],
Roth, P.M.[Peter M.],
Bischof, H.[Horst],
Annotated Facial Landmarks in the Wild:
A large-scale, real-world database for facial landmark localization,
BenchFace11(2144-2151).
IEEE DOI
Dataset, Faces, Features.
Schneiderman, H.[Henry],
Kanade, T.[Takeo],
A Statistical Method for 3D Object Detection Applied to Faces and Cars,
CVPR00(I: 746-751).
IEEE DOI
And:
A Histogram-based Method for Detection of Faces and Cars,
ICIP00(Vol III: 504-507).
IEEE DOI
And:
Frontal Face Images,
WWW Link.
Dataset, Faces. Combined CMU MIT face dataset.
CMU Profile Face Images,
2000.
HTML Version.
Dataset, Faces.
Frejlichowski, D.[Dariusz],
Tyszkiewicz, N.[Natalia],
The West Pomeranian University of Technology Ear Database:
A Tool for Testing Biometric Algorithms,
ICIAR10(II: 227-234).
Springer DOI
Dataset, Biometrics.
O'Toole, A.J.[Alice J.],
Harms, J.[Joshua],
Snow, S.L.[Sarah L.],
Hurst, D.R.[Dawn R.],
Pappas, M.R.[Matthew R.],
Ayyad, J.H.[Janet H.],
Abdi, H.[Herve],
A Video Database of Moving Faces and People,
PAMI(27), No. 5, May 2005, pp. 812-816.
IEEE Abstract.
Dataset, Faces. Face database.
Pandey, P.[Prashant],
Tyagi, A.K.[Aayush Kumar],
Ambekar, S.[Sameer],
Prathosh, A.P.,
Unsupervised Domain Adaptation for Semantic Segmentation of NIR Images
Through Generative Latent Search,
ECCV20(VI:413-429).
Springer DOI
Dataset, Segmentation.
WWW Link.
Gu, Q.,
Wang, G.,
Chiu, M.T.,
Tai, Y.,
Tang, C.,
LADN: Local Adversarial Disentangling Network for Facial Makeup and
De-Makeup,
ICCV19(10480-10489)
IEEE DOI
Dataset, Faces.
WWW Link. face recognition, feature extraction, LADN,
local adversarial disentangling network, facial makeup, Mouth
Li, S.Z.[Stan Z.],
Yi, D.[Dong],
Lei, Z.[Zhen],
Liao, S.C.[Sheng-Cai],
The CASIA NIR-VIS 2.0 Face Database,
PBVS13(348-353)
IEEE DOI
Dataset, Face Recognition. IR dataset.
Xu, T.[Tao],
Wu, B.[Bo],
Bai, Y.Q.[Yu-Qiong],
Zhou, Y.[Yun],
RavenGaze: A Dataset for Gaze Estimation Leveraging Psychological
Experiment Through Eye Tracker,
FG23(1-6)
IEEE DOI
Dataset, Gaze Tracking. Visualization, Target tracking,
Estimation, Psychology, Gesture recognition, Visual databases
Dynamic 2D/3D Speaking Face Dataset with Synchronized Audio,
2019.
HTML Version.
Dataset, Lip Reading. Refer to:
See also
3D Visual passcode: Speech-driven 3D facial dynamics for behaviometrics.
Language Independent Lip Reading,
2007.
HTML Version.
Dataset, Lip Reading.
OuluVS database,
2009.
WWW Link.
Dataset, Lip Reading.
OnMapGaze: A new gaze dataset for map perception modeling,
2024.
WWW Link.
WWW Link.
Dataset, Gaze.
Code, Gaze. Gaze data collected during the observation of different cartographic
backgrounds used in five online map services,
EyeMouseMap,
2024.
DOI Link
Dataset, Gaze.
Dataset, Mouse Tracking. Cartographic maps.
Reference:
See also
Quantifying map user response differences between gaze and cursor activity during searching cartographic point symbols.
See also
Examining the preattentive effect on cartographic backgrounds tilizing remote mouse tracking.
He, Q.H.[Qiu-Hai],
Hong, X.P.[Xiao-Peng],
Chai, X.J.[Xiu-Juan],
Holappa, J.[Jukka],
Zhao, G.Y.[Guo-Ying],
Chen, X.L.[Xi-Lin],
Pietikäinen, M.[Matti],
OMEG: Oulu Multi-Pose Eye Gaze Dataset,
SCIA15(418-427).
Springer DOI
Dataset, Gaze.
Hadizadeh, H.,
Enriquez, M.J.,
Bajic, I.V.,
Eye-Tracking Database for a Set of Standard Video Sequences,
IP(21), No. 2, February 2012, pp. 898-903.
IEEE DOI
Dataset, Eye Tracking.
Fox, N.A.[Niall A.],
O'Mullane, B.A.[Brian A.],
Reilly, R.B.[Richard B.],
VALID:
A New Practical Audio-Visual Database, and Comparative Results,
AVBPA05(777).
Springer DOI WWW Link.
Dataset, Faces.
Sharma, P.[Prag],
Reilly, R.B.[Richard B.],
The UCD Colour Face Image Database for Face Detection,
Online1998.
WWW Link.
Dataset, Faces.
Mollahosseini, A.,
Hasani, B.,
Mahoor, M.H.,
AffectNet: A Database for Facial Expression, Valence, and Arousal
Computing in the Wild,
AffCom(10), No. 1, January 2019, pp. 18-31.
IEEE DOI
Dataset, Facial Expressions. Databases, Computational modeling, Face, Face recognition,
Affective computing, Magnetic heads,
arousal
Papaioannou, A.[Athanasios],
Gecer, B.[Baris],
Cheng, S.[Shiyang],
Chrysos, G.[Grigorios],
Deng, J.K.[Jian-Kang],
Fotiadou, E.[Eftychia],
Kampouris, C.[Christos],
Kollias, D.[Dimitrios],
Moschoglou, S.[Stylianos],
Songsri-In, K.[Kritaphat],
Ploumpis, S.[Stylianos],
Trigeorgis, G.[George],
Tzirakis, P.[Panagiotis],
Ververas, E.[Evangelos],
Zhou, Y.X.[Yu-Xiang],
Ponniah, A.[Allan],
Roussos, A.[Anastasios],
Zafeiriou, S.P.[Stefanos P.],
MimicME: A Large Scale Diverse 4D Database for Facial Expression
Analysis,
ECCV22(VIII:467-484).
Springer DOI
Dataset, Facial Expressions.
CMU Facial Expression Database,
1999
Dataset, Faces.
Dataset, Facial Expression.
HTML Version. Includes annotation.
Matuszewski, B.J.[Bogdan J.],
Quan, W.[Wei],
Shark, L.K.[Lik-Kwan],
High-resolution comprehensive 3-D dynamic database for facial
articulation analysis,
BenchFace11(2128-2135).
IEEE DOI
Dataset, Facial Expressions.
Lucey, P.[Patrick],
Cohn, J.F.[Jeffrey F.],
Prkachin, K.M.[Kenneth M.],
Solomon, P.E.[Patricia E.],
Matthews, I.[Iain],
Painful data:
The UNBC-McMaster shoulder pain expression archive database,
FG11(57-64).
IEEE DOI
Dataset, Facial Expression.
McDuff, D.J.[Daniel J.],
Amr, M.,
el Kaliouby, R.[Rana],
AM-FED+: An Extended Dataset of Naturalistic Facial Expressions
Collected in Everyday Settings,
AffCom(10), No. 1, January 2019, pp. 7-17.
IEEE DOI
Dataset, Facial Expressions. Videos, Encoding, Face recognition, Training, Lighting, Task analysis,
Databases, Facial expressions, facial action coding system,
corpora
Lyons, M.J.,
Akamatsu, S.,
Kamachi, M.,
Gyoba, J.,
Coding Facial Expressions with Gabor Wavelets,
AFGR98(200-205).
IEEE DOI
Dataset, Facial Expressions.
HTML Version. 213 images of 7 facial expressions, 10 Japanese female subjects.
Children Spontaneous Facial Expression Video Database (LIRIS-CSE),
2019.
Dataset, Facial Expressions.
WWW Link.
spontaneous / natural facial expressions of 12 children in diverse
settings with variable recording scenarios showing six universal or
prototypic emotional expressions (happiness, sadness, anger, surprise,
disgust and fear).
See also
novel database of children's spontaneous facial expressions (LIRIS-CSE), A.
Yan, W.J.[Wen-Jing],
Wu, Q.[Qi],
Liu, Y.J.[Yong-Jin],
Wang, S.J.[Su-Jing],
Fu, X.L.[Xiao-Lan],
CASME database: A dataset of spontaneous micro-expressions collected
from neutralized faces,
FG13(1-7)
IEEE DOI
Dataset, Facial Expressions. computer vision
Oulu-CASIA NIR&VIS facial expression database,
2008.
WWW Link.
Dataset, Facial Expressions. 6 typical expressions from 80 subjects.
BU-3DFE (Binghamton University 3D Facial Expression) Database,
Dataset, Facial Expressions.
HTML Version.
The AR Face Database,
1998.
HTML Version. Or:
HTML Version.
Dataset, Faces.
Sim, T.[Terence],
Baker, S.,
Bsat, M.,
The CMU Pose, Illumination, and Expression Database,
PAMI(25), No. 12, December 2003, pp. 1615-1618.
IEEE Abstract.
Dataset, Faces.
Earlier:
The CMU Pose, Illumination, and Expression (PIE) Database of Human
Faces,
AFGR02(46-51).
IEEE DOI HTML Version.
And:
CMU-RI-TR-01-02, January, 2001.
HTML Version.
PDF File.
PS File.
HTML Version.
Gross, R.[Ralph],
Matthews, I.[Iain],
Cohn, J.F.[Jeffrey F.],
Kanade, T.[Takeo],
Baker, S.[Simon],
Multi-PIE,
IVC(28), No. 5, May 2010, pp. 807-813.
Elsevier DOI
Dataset, Faces.
Earlier:
FG08(1-8).
IEEE DOI
Face database; Face recognition across pose; Face recognition across
illumination; Face recognition across expression
See also
CMU Pose, Illumination, and Expression Database, The.
Kanade, T.[Takeo],
Cohn, J.F.[Jeffrey F.],
Tian, Y.L.[Ying-Li],
Comprehensive Database for Facial Expression Analysis,
AFGR00(46-53).
IEEE DOI
Dataset, Faces.
Dataset, Expressions.
Wang, S.,
Liu, Z.,
Lv, S.,
Lv, Y.,
Wu, G.,
Peng, P.,
Chen, F.,
Wang, X.,
A Natural Visible and Infrared Facial Expression Database for
Expression Recognition and Emotion Inference,
MultMed(12), No. 7, 2010, pp. 682-691.
IEEE DOI
Dataset, Facial Expressions.
Matuszewski, B.J.[Bogdan J.],
Quan, W.[Wei],
Shark, L.K.[Lik-Kwan],
McLoughlin, A.S.[Alison S.],
Lightbody, C.E.[Catherine E.],
Emsley, H.C.A.[Hedley C.A.],
Watkins, C.L.[Caroline L.],
Hi4D-ADSIP 3-D dynamic facial articulation database,
IVC(30), No. 10, October 2012, pp. 713-727.
Elsevier DOI
Dataset, Facial Expressions. Facial articulation database; Expression recognition; Facial;
Dysfunctions; Facial expression validation
Wang, S.F.[Shang-Fei],
Liu, Z.L.[Zhi-Lei],
Wang, Z.Y.[Zhao-Yu],
Wu, G.B.[Guo-Bing],
Shen, P.J.[Pei-Jia],
He, S.[Shan],
Wang, X.[Xufa],
Analyses of a Multimodal Spontaneous Facial Expression Database,
AffCom(4), No. 1, January 2013, pp. 34-46.
IEEE DOI
Dataset, Expression Recognition.
Baveye, Y.,
Dellandrea, E.,
Chamaret, C.,
Chen, L.M.[Li-Ming],
LIRIS-ACCEDE: A Video Database for Affective Content Analysis,
AffCom(6), No. 1, January 2015, pp. 43-55.
IEEE DOI
Dataset, Affective. copyright
Kossaifi, J.[Jean],
Walecki, R.[Robert],
Panagakis, Y.[Yannis],
Shen, J.[Jie],
Schmitt, M.[Maximilian],
Ringeval, F.[Fabien],
Han, J.[Jing],
Pandit, V.[Vedhas],
Toisoul, A.[Antoine],
Schuller, B.[Björn],
Star, K.[Kam],
Hajiyev, E.[Elnar],
Pantic, M.[Maja],
SEWA DB: A Rich Database for Audio-Visual Emotion and Sentiment
Research in the Wild,
PAMI(43), No. 3, March 2021, pp. 1022-1040.
IEEE DOI
Dataset, Emotion. Databases, Tools, Computational modeling,
Biological system modeling, Sensors, Affective computing,
facial action units
Zhang, Z.,
Girard, J.M.,
Wu, Y.,
Zhang, X.,
Liu, P.,
Ciftci, U.,
Canavan, S.,
Reale, M.,
Horowitz, A.,
Yang, H.,
Cohn, J.F.,
Ji, Q.,
Yin, L.,
Multimodal Spontaneous Emotion Corpus for Human Behavior Analysis,
CVPR16(3438-3446)
IEEE DOI
Dataset, Emotion.
Aubrey, A.J.[Andrew J.],
Marshall, D.[David],
Rosin, P.L.[Paul L.],
Vendeventer, J.[Jason],
Cunningham, D.W.[Douglas W.],
Wallraven, C.[Christian],
Cardiff Conversation Database (CCDb):
A Database of Natural Dyadic Conversations,
LV13(277-282)
IEEE DOI
Dataset, Facial Expressions. Conversations; Database; Facial Expressions
Lucey, P.[Patrick],
Cohn, J.F.[Jeffrey F.],
Kanade, T.[Takeo],
Saragih, J.M.[Jason M.],
Ambadar, Z.[Zara],
Matthews, I.[Iain],
The Extended Cohn-Kanade Dataset (CK+): A complete dataset for action
unit and emotion-specified expression,
CVPR4HB10(94-101).
IEEE DOI
Dataset, Facial Expressions.
Sapinski, T.[Tomasz],
Kaminska, D.[Dorota],
Pelikant, A.[Adam],
Ozcinar, C.[Cagri],
Avots, E.[Egils],
Anbarjafari, G.[Gholamreza],
Multimodal Database of Emotional Speech, Video and Gestures,
MIPPSNA18(153-163).
Springer DOI
Dataset, Emotions.
Sneddon, I.,
McRorie, M.,
McKeown, G.,
Hanratty, J.,
The Belfast Induced Natural Emotion Database,
AffCom(3), No. 1, 2012, pp. 32-41.
IEEE DOI
Dataset, Emotions.
OMG-Emotion (One-Minute Gradual-Emotional Behavior),
2018
WWW Link.
Dataset, Emotion Recognition.
Developed for a challenge. 500+ 1 minute emotion videos.
Petridis, S.[Stavros],
Martinez, B.[Brais],
Pantic, M.[Maja],
The MAHNOB Laughter database,
IVC(31), No. 2, February 2013, pp. 186-202.
Elsevier DOI
Dataset, Laughter. Laughter; Audiovisual; Thermal; Database; Audiovisual automatic
laughter-speech discrimination
Abadi, M.K.,
Subramanian, R.,
Kia, S.M.,
Avesani, P.,
Patras, I.,
Sebe, N.,
DECAF: MEG-Based Multimodal Database for Decoding Affective
Physiological Responses,
AffCom(6), No. 3, July 2015, pp. 209-222.
IEEE DOI
Dataset, Affective Responses. Databases
Provost, E.M.[Emily Mower],
Yuan, S.G.[Shang-Guan],
Busso, C.[Carlos],
UMEME: University of Michigan Emotional McGurk Effect Data Set,
AffCom(6), No. 4, October 2015, pp. 395-409.
IEEE DOI
Dataset, Emotion Recognition. Emotion recognition
Yan, J.J.[Jing-Jie],
Wang, B.[Bei],
Liang, R.Y.[Rui-Yu],
A Novel Bimodal Emotion Database from Physiological Signals and Facial
Expression,
IEICE(E101-D), No. 7, July 2018, pp. 1976-1979.
WWW Link.
Dataset, Emotions.
Lee, J.Y.[Ji-Young],
Kim, S.R.[Seung-Ryong],
Kim, S.[Sunok],
Park, J.[Jungin],
Sohn, K.H.[Kwang-Hoon],
Context-Aware Emotion Recognition Networks,
ICCV19(10142-10151)
IEEE DOI
Dataset, Emotion Recognition.
WWW Link. emotion recognition, face recognition, feature extraction,
image fusion, neural nets, visual scene, boosting manner,
Adaptive systems
Ong, D.C.[Desmond C.],
Wu, Z.X.[Zheng-Xuan],
Tan, Z.X.[Zhi-Xuan],
Reddan, M.[Marianne],
Kahhale, I.[Isabella],
Mattek, A.[Alison],
Zaki, J.[Jamil],
Modeling Emotion in Complex Stories:
The Stanford Emotional Narratives Dataset,
AffCom(12), No. 3, July 2021, pp. 579-594.
IEEE DOI
Dataset, Emotion. Computational modeling, Hidden Markov models,
Affective computing, Biological system modeling, Videos,
emotional corpora
Vicol, P.[Paul],
Tapaswi, M.[Makarand],
Castrejón, L.[Lluís],
Fidler, S.[Sanja],
MovieGraphs:
Towards Understanding Human-Centric Situations from Videos,
CVPR18(8581-8590)
IEEE DOI
WWW Link.
Dataset, Gestures. Videos of social situations to teach robots to understand people.
Videos, Motion pictures, Semantics, Natural languages, Face,
Automobiles, Legged locomotion
Nguyen, H.[Hung],
Kotani, K.[Kazunori],
Chen, F.[Fan],
Le, B.[Bac],
A Thermal Facial Emotion Database and Its Analysis,
PSIVT13(397-408).
Springer DOI
Dataset, Facial Expression.
Liu, H.Y.[Hai-Yang],
Zhu, Z.H.[Zi-Hao],
Iwamoto, N.[Naoya],
Peng, Y.C.[Yi-Chen],
Li, Z.Q.[Zheng-Qing],
Zhou, Y.[You],
Bozkurt, E.[Elif],
Zheng, B.[Bo],
BEAT: A Large-Scale Semantic and Emotional Multi-modal Dataset for
Conversational Gestures Synthesis,
ECCV22(VII:612-630).
Springer DOI
Dataset, Emotions.
Wei, H.L.[Hao-Lin],
Monaghan, D.S.[David S.],
O'Connor, N.E.[Noel E.],
Scanlon, P.[Patricia],
A New Multi-modal Dataset for Human Affect Analysis,
HBU14(42-51).
Springer DOI
Dataset, Human Affect.
Miranda-Correa, J.A.[Juan Abdon],
Abadi, M.K.[Mojtaba Khomami],
Sebe, N.[Nicu],
Patras, I.[Ioannis],
AMIGOS: A Dataset for Affect, Personality and Mood Research on
Individuals and Groups,
AffCom(12), No. 2, April 2021, pp. 479-493.
IEEE DOI
Dataset, Emotion. Videos, Databases, Mood, Physiology, Electroencephalography,
Brain modeling, Electrocardiography, Emotion classification, EEG,
affective computing
VGG Pose Datasets,
2013
Dataset, Human Pose.
HTML Version. A collection of several human pose datasets, BBC Pose, YouTube Pose,
ChaLearn Pose.
Extended BBC Pose Dataset,
2013
Dataset, Human Pose.
WWW Link. Original BBC Pose plus more. 92 Videos.
FLIC: Frames Labelled in Cinema,
2013
Dataset, Human Pose.
HTML Version.
See also
MODEC: Multimodal Decomposable Models for Human Pose Estimation.
Verma, M.,
Kumawat, S.,
Nakashima, Y.,
Raman, S.,
Yoga-82: A New Dataset for Fine-grained Classification of Human Poses,
VUHCS20(4472-4479)
IEEE DOI
Dataset, Homan Pose. Legged locomotion, Wheels, Pose estimation,
Visualization, Skeleton, Image resolution
Bourdev, L.[Lubomir], and
Malik, J.[Jitendra],
H3D Dataset,
2009.
Dataset, Humans.
WWW Link. Annotated human images.
3DHumans: Dataset for Human Body Models,
2023
Dataset, Human Shapes.
WWW Link.
he 3DHumans dataset provides around 180 meshes of people in diverse
body shapes in various garments styles and sizes.
See also
Indian Institute of Technology, Hyderabad.
Nibali, A.[Aiden],
Millward, J.[Joshua],
He, Z.[Zhen],
Morgan, S.[Stuart],
ASPset: An outdoor sports pose video dataset with 3D keypoint
annotations,
IVC(111), 2021, pp. 104196.
Elsevier DOI
Dataset, Human Pose. Markerless motion capture, Human pose estimation,
Triangulation, Camera calibration
van der Aa, N.P.,
Luo, X.,
Giezeman, G.J.,
Tan, R.T.,
Veltkamp, R.C.,
UMPM benchmark: A multi-person dataset with synchronized video and
motion capture data for evaluation of articulated human motion and
interaction,
HICV11(1264-1269).
IEEE DOI
Dataset, Human Pose.
Combettes, S.W.[Sylvain W.],
Boniol, P.[Paul],
Mazarguil, A.[Antoine],
Wang, D.P.[Dan-Ping],
Vaquero-Ramos, D.[Diego],
Chauveau, M.[Marion],
Oudre, L.[Laurent],
Vayatis, N.[Nicolas],
Vidal, P.P.[Pierre-Paul],
Roren, A.[Alexandra],
Lefèvre-Colau, M.M.[Marie-Martine],
Arm-CODA: A Data Set of Upper-limb Human Movement During Routine
Examination,
IPOL(14), 2024, pp. 1-13.
DOI Link
Dataset, Upper Body Motion.
HandNet Hand Images,
2015
Dataset, Gestures.
WWW Link.
More than 214971 images of 10 different particpants' hands captured by
a RealSense RGBD sensor performing random articulations.
Annotations include: per pixel classes, 6D fingertip pose,
heatmap. Recorded at GIP Lab, Technion.
Aristotle University of Thessaloniki UAV Gesture Dataset,
2022
WWW Link.
Dataset, Gestures. Public video dataset for gesture recognition in human-UAV/drone interaction.
AUTH UAV Gesture Dataset consists of 4930 videos (resolution: 1920 x
1080), distributed along 6 classes, at 30 frames per second. Both
indoors and outdoors settings are included, while 58 different human
subjects have been employed for filming the sequences.
Fanelli, G.,
Gall, J.,
Romsdorfer, H.,
Weise, T.,
Van Gool, L.J.,
A 3-D Audio-Visual Corpus of Affective Communication,
MultMed(12), No. 6, 2010, pp. 591-598.
IEEE DOI
Dataset, Gestures.
Molina, J.[Javier],
Pajuelo, J.A.[José A.],
Escudero-Viñolo, M.[Marcos],
Bescós, J.[Jesús],
Martínez, J.M.[José M.],
A natural and synthetic corpus for benchmarking of hand gesture
recognition systems,
MVA(25), No. 4, May 2014, pp. 943-954.
Springer DOI
Dataset, Hand Gestures.
Guyon, I.[Isabelle],
Athitsos, V.[Vassilis],
Jangyodsuk, P.[Pat],
Escalante, H.J.[Hugo Jair],
The ChaLearn gesture dataset (CGD 2011),
MVA(25), No. 8, November 2014, pp. 1929-1951.
Springer DOI
Dataset, Gesture.
Materzynska, J.,
Berger, G.,
Bax, I.,
Memisevic, R.,
The Jester Dataset: A Large-Scale Video Dataset of Human Gestures,
Hands19(2874-2882)
IEEE DOI
Dataset, Gestures. convolutional neural nets, gesture recognition,
human computer interaction, video signal processing,
deep learning
Myanganbayar, B.[Battushig],
Mata, C.[Cristina],
Dekel, G.[Gil],
Katz, B.[Boris],
Ben-Yosef, G.[Guy],
Barbu, A.[Andrei],
Partially Occluded Hands:
A Challenging New Dataset for Single-Image Hand Pose Estimation,
ACCV18(V:85-98).
Springer DOI
Dataset, Hand Pose.
WWW Link.
Bloom, V.[Victoria],
Argyriou, V.[Vasileios],
Makris, D.[Dimitrios],
Linear latent low dimensional space for online early action
recognition and prediction,
PR(72), No. 1, 2017, pp. 532-547.
Elsevier DOI
Earlier: A1, A3, A2:
G3D: A gaming action dataset and real time action recognition
evaluation framework,
CVCG12(7-12).
IEEE DOI
Dataset, Gesture Recognition. Action, recognition
Moon, G.[Gyeongsik],
Yu, S.I.[Shoou-I],
Wen, H.[He],
Shiratori, T.[Takaaki],
Lee, K.M.[Kyoung Mu],
Interhand2.6m: A Dataset and Baseline for 3d Interacting Hand Pose
Estimation from a Single RGB Image,
ECCV20(XX:548-564).
Springer DOI
Dataset, Hand Pose.
Buehler, P.[Patrick],
Everingham, M.R.[Mark R.],
Huttenlocher, D.P.[Daniel P.],
Zisserman, A.[Andrew],
Upper Body Detection and Tracking in Extended Signing Sequences,
IJCV(95), No. 2, November 2011, pp. 180-197.
WWW Link.
Earlier:
Long Term Arm and Hand Tracking for Continuous Sign Language TV
Broadcasts,
BMVC08(xx-yy).
PDF File.
PDF File. Data available.
Dataset, Sign Language.
HTML Version.
The BANCA Database,
2007.
WWW Link.
Dataset, Biometrics.
Soft-Biometric in Surveillance (SoBiS) Dataset,
2017
WWW Link.
Dataset, Biometrics. Recorded at Fraunhofer IOSB.
Ortega-Garcia, J.,
Fierrez-Aguilar, J.,
Simon, D.,
Gonzalez, J.,
Faundez-Zanuy, M.,
Espinosa, V.,
Satue, A.,
Hernaez, I.,
Igarza, J.J.,
Vivaracho, C.,
Escudero, D.,
Moro, Q.I.,
MCYT baseline corpus: a bimodal biometric database,
VISP(150), No. 6, December 2003, pp. 395-401.
IEEE Abstract.
Dataset, Biometrics.
Fierrez-Aguilar, J.[Julian],
Ortega-Garcia, J.[Javier],
Toledano, D.T.[Doroteo Torre],
Gonzalez-Rodriguez, J.[Joaquin],
Biosec baseline corpus: A multimodal biometric database,
PR(40), No. 4, April 2007, pp. 1389-1392.
Elsevier DOI
Dataset, Biometrics. Multimodal; Biometrics; Authentication; Verification; Database;
Performance; Fingerprint; Iris; Face; Voice
Ortega-Garcia, J.[Javier],
Fierrez, J.[Julian],
Alonso-Fernandez, F.[Fernando],
Galbally, J.[Javier],
Freire, M.R.[Manuel R.],
Gonzalez-Rodriguez, J.[Joaquin],
Garcia-Mateo, C.[Carmen],
Alba-Castro, J.L.[Jose-Luis],
Gonzalez-Agulla, E.[Elisardo],
Otero-Muras, E.[Enrique],
Garcia-Salicetti, S.[Sonia],
Allano, L.[Lorene],
Ly-Van, B.[Bao],
Dorizzi, B.[Bernadette],
Kittler, J.V.[Josef V.],
Bourlai, T.[Thirimachos],
Poh, N.[Norman],
Deravi, F.[Farzin],
Ng, M.N.R.[Ming N. R.],
Fairhurst, M.C.[Michael C.],
Hennebert, J.[Jean],
Humm, A.[Andreas],
Tistarelli, M.[Massimo],
Brodo, L.[Linda],
Richiardi, J.[Jonas],
Drygajlo, A.[Andrezj],
Ganster, H.[Harald],
Sukno, F.M.[Federico M.],
Pavani, S.K.[Sri-Kaushik],
Frangi, A.[Alejandro],
Akarun, L.[Lale],
Savran, A.[Arman],
The Multiscenario Multienvironment BioSecure Multimodal Database (BMDB),
PAMI(32), No. 6, June 2010, pp. 1097-1111.
IEEE DOI
Dataset, Biometrics. Withing the Europens BioSecure framework.
600 individuals. Acquired over internet, in office, and indoor/outdoor
with portable hardware. Audio/Video data, signature, fingerprint.
Santos, G.,
Fiadeiro, P.T.,
Proença, H.,
BioHDD: a dataset for studying biometric identification on heavily
degraded data,
IET-Bio(4), No. 1, 2015, pp. 1-9.
DOI Link
Dataset, Biometrics. biometrics (access control)
Zhang, Y.H.[Yuan-Han],
Yin, Z.F.[Zhen-Fei],
Li, Y.D.[Yi-Dong],
Yin, G.J.[Guo-Jun],
Yan, J.J.[Jun-Jie],
Shao, J.[Jing],
Liu, Z.W.[Zi-Wei],
Celeba-Spoof: Large-scale Face Anti-spoofing Dataset with Rich
Annotations,
ECCV20(XII: 70-85).
Springer DOI
Dataset, Face Anti-Spoofing.
Oliveira, H.P.[Hélder P.],
Magalhães, F.[Filipe],
Two Unconstrained Biometric Databases,
ICIAR12(II: 11-19).
Springer DOI
Dataset, Biometrics.
Zafeiriou, S.P.[Stefanos P.],
Hansen, M.[Mark],
Atkinson, G.A.[Gary A.],
Argyriou, V.[Vasileios],
Petrou, M.[Maria],
Smith, M.L.[Melvyn L.],
Smith, L.N.[Lyndon N.],
The Photoface database,
Biometrics11(132-139).
IEEE DOI
Dataset, Faces.
Nizami, H.,
Adkins-Hill, J.P.,
Zhang, Y.[Yong],
Sullins, J.R.,
McCullough, C.,
Canavan, S.,
Yin, L.J.[Li-Jun],
A biometric database with rotating head videos and hand-drawn face
sketches,
BTAS09(1-6).
IEEE DOI
Dataset, Biometrics.
Li, S.Z.[Stan Z.],
Lei, Z.[Zhen],
Ao, M.[Meng],
The HFB Face Database for Heterogeneous Face Biometrics research,
OTCBVS09(1-8).
IEEE DOI
Dataset, Faces.
Martinho-Corbishley, D.,
Nixon, M.S.[Mark S.],
Carter, J.N.[John N.],
Soft Biometric Retrieval to Describe and Identify Surveillance Images,
ISBA16(xx-xx).
IEEE DOI
Dataset, Soft Biometrics. SoBiR Dataset
WWW Link.
Messer, K.,
Matas, J.G.,
Kittler, J.V.,
Luettin, J.,
Maitre, G.,
XM2VTSDB: The Extended M2VTS Database,
AVBPA99(xx-yy).
WWW Link.
Dataset, Biometrics. 4 versions of 295 subjects.
Donida Labati, R.,
Genovese, A.[Angelo],
Piuri, V.[Vincenzo],
Scotti, F.[Fabio],
Vishwakarma, S.[Sarvesh],
I-SOCIAL-DB: A labeled database of images collected from websites and
social media for Iris recognition,
IVC(105), 2021, pp. 104058.
Elsevier DOI
Dataset, Iris. Biometrics, Iris, Web images
UBIRIS database,
2007, Department of Computer Science, University of Beira Interior, Portugal.
WWW Link.
Dataset, Iris Images. The enhanced version is available only for the Iris Segmentation Contest.
241 subjects, 1877 images.
CASIA Iris Image Database,
2007, Chinese Academy of Sciences.
HTML Version.
Dataset, Iris Images. Various versions. Version 3.
60 subjects, 2400 images.
NIST ICE Iris Image Database,
2007, NIST.
WWW Link.
Dataset, Iris Images. 132 subjects, 2953 images.
For most recent info:
See also
NIST IREX, Iris Exchange Datasets. and also
See also
Iris Recognition Database.
Iris Recognition Database,
2007
HTML Version.
Dataset, Iris Images. Derived from University of Bath
See also
University of of Bath. in association with
Smart Sensors Ltd.
See also
Smart Sensors Limited. High resolution images, 20 each eye for 800 people.
Iris Recognition Database,
2009
HTML Version.
Dataset, Iris Images. ND-IRIS-0405. A superset of
ICE2005 and ICE2006 datasets.
(
See also
NIST ICE Iris Image Database. )
64,980 iris images from 712 irises of 356 human subjects.
From the Notre Dame group.
See also
University of Notre Dame. For more updates:
See also
NIST IREX, Iris Exchange Datasets.
UTIRIS: University of Tehran IRIS Image Repository,
Online2014
WWW Link.
Dataset, Iris Images.
Visible and Infrared.
NIST IREX, Iris Exchange Datasets,
2020
WWW Link.
Dataset, Iris.
See also
Iris Recognition Database.
Dobeš, M.[Michal], and
Machala, L.[Libor],
Iris Database,
Online2006
WWW Link.
Dataset, Iris Images.
The database used for:
See also
Human eye localization using the modified Hough transform.
See also
Human Eye Iris Recognition Using the Mutual Information.
Proenca, H.[Hugo],
Filipe, S.[Silvio],
Santos, R.[Ricardo],
Oliveira, J.[Joao],
Alexandre, L.A.[Luis A.],
The UBIRIS.v2: A Database of Visible Wavelength Iris Images Captured
On-the-Move and At-a-Distance,
PAMI(32), No. 8, August 2010, pp. 1529-1535.
IEEE DOI
Dataset, Iris Recognition.
WWW Link. Visible wavelength, 4-8 meters distance, people moving.
Omelina, L.[Lubos],
Goga, J.[Jozef],
Pavlovicova, J.[Jarmila],
Oravec, M.[Milos],
Jansen, B.[Bart],
A survey of iris datasets,
IVC(108), 2021, pp. 104109.
Elsevier DOI
Survey, Iris Reognition.
Dataset, Iris Recognition. Biometrics, Iris recognition, Iris datasets, Human iris
Petrovska-Delacretaz, D.,
Lelandais, S.,
Colineau, J.,
Chen, L.M.,
Dorizzi, B.,
Ardabilian, M.,
Krichen, E.,
Mellakh, M.A.,
Chaari, A.,
Guerfi, S.,
d'Hose, J.,
Ben Amor, B.[Boulbaba],
The IV2 Multimodal Biometric Database (Including Iris, 2D, 3D,
Stereoscopic, and Talking Face Data), and the IV2-2007 Evaluation
Campaign,
BTAS08(1-7).
IEEE DOI
Dataset, Iris Recognition.
Maltoni, D.[Davide],
Maio, D.[Dario],
Jain, A.K.[Anil K.],
Prabhakar, S.[Salil],
Handbook of Fingerprint Recognition,
Springer2009.
ISBN: 978-1-84882-253-5
Second Edition.
WWW Link.
Earlier:
Springer-VerlagNew York, 2003
WWW Link.
Survey, Fingerprints.
Dataset, Fingerprints. The new edition is greatly expanded. Algorithms, evaluations, sensors,
standards, security.
Buy this book: Handbook of Fingerprint Recognition
Maio, D.,
Maltoni, D.[Davide],
Cappelli, R.[Raffaele],
Wayman, J.L.,
Jain, A.K.,
FVC2000: Fingerprint Verification Competition,
PAMI(24), No. 3, March 2002, pp. 402-412.
IEEE DOI
Dataset, Fingerprints.
Earlier:
Invited Paper: FVC2000: Fingerprint Verification Competition,
ICPR00(Vol IV: No paper).
Wilson, C.L.,
Watson, C.I.,
NIST Special Database 4, Fingerprint Database,
NISTIRMarch 1992.
WWW Link.
Dataset, Fingerprints.
Wang, Q.,
Li, S.Y.,
Database of human segmented images and its application in boundary
detection,
IET-IPR(6), No. 3, 2012, pp. 222-229.
DOI Link
Dataset, Segmentation.
ADE20K Dataset,
2017.
Dataset, Segmentation.
WWW Link. Annotated data,
LHI Segmentation Dataset,
Subset of larger dataset.
Online2008
HTML Version.
Dataset, Segmentation.
See also
Lotus Hill Institute.
The PASCAL Visual Object Classes Challenge 2012,
Online2012
Dataset, Segmentation.
WWW Link.
Various PASCAL datasets for different years
See also
Pascal: Pattern Analysis, Statistical Modelling and Computational Learning.
COCO: Common Objects in Context,
Online
Dataset, Segmentation.
WWW Link.
Large-scale object detection, segmentation, and captioning dataset.
Used for ECCV 2018 challange:
HTML Version.
DIS5K,
2022
Dataset, Segmentation.
WWW Link.
5,470 high-resolution (e.g., 2K, 4K or larger) images covering
camouflaged, salient, or meticulous objects in various backgrounds.
See also
Highly Accurate Dichotomous Image Segmentation.
Barnard, K.[Kobus],
Fan, Q.F.[Quan-Fu],
Swaminathan, R.[Ranjini],
Hoogs, A.[Anthony],
Collins, R.[Roderic],
Rondot, P.[Pascale],
Kaufhold, J.[John],
Evaluation of Localized Semantics: Data, Methodology, and Experiments,
IJCV(77), No. 1-3, May 2008, pp. 199-217.
Springer DOI
Dataset, Segmentation. Dataset with hand segmentations.
WWW Link.
Qi, L.[Lu],
Kuen, J.[Jason],
Shen, T.C.[Tian-Cheng],
Gu, J.X.[Jiu-Xiang],
Li, W.B.[Wen-Bo],
Guo, W.D.[Wei-Dong],
Jia, J.Y.[Jia-Ya],
Lin, Z.[Zhe],
Yang, M.H.[Ming-Hsuan],
High Quality Entity Segmentation,
ICCV23(4024-4033)
IEEE DOI Code:
WWW Link.
Dataset, Segmentation.
Upchurch, P.[Paul],
Niu, R.[Ransen],
A Dense Material Segmentation Dataset for Indoor and Outdoor Scene
Parsing,
ECCV22(VIII:450-466).
Springer DOI
Dataset, Segmentation.
Follmann, P.[Patrick],
Böttger, T.[Tobias],
Härtinger, P.[Philipp],
König, R.[Rebecca],
Ulrich, M.[Markus],
MVTec D2S: Densely Segmented Supermarket Dataset,
ECCV18(X: 581-597).
Springer DOI
Dataset, Segmentation.
Kampel, M.[Martin],
Hanbury, A.[Allan],
Blauensteiner, P.[Philipp],
Wildenauer, H.[Horst],
Improved motion segmentation based on shadow detection,
ELCVIA(6), No. 3, December 2007, pp. 1-12.
DOI Link
Includes Test Data:
Dataset, Shadow Detection.
Kirillov, A.[Alexander],
Mintun, E.[Eric],
Ravi, N.[Nikhila],
Mao, H.Z.[Han-Zi],
Rolland, C.[Chloe],
Gustafson, L.[Laura],
Xiao, T.[Tete],
Whitehead, S.[Spencer],
Berg, A.C.[Alexander C.],
Lo, W.Y.[Wan-Yen],
Dollár, P.[Piotr],
Girshick, R.[Ross],
Segment Anything,
ICCV23(3992-4003)
IEEE DOI WWW Link.
Dataset, Segmentation.
Semantic Boundaries Dataset and Benchmark,
Online2011.
Dataset, Segmentation.
HTML Version. or:
HTML Version.
See also
Semantic contours from inverse detectors. Related to:
See also
Berkeley Segmentation Dataset and Benchmark, The.
See also
PASCAL Visual Object Classes Challenge 2012, The.
Arbelaez, P.[Pablo],
Fowlkes, C.C.[Charless C.], and
Martin, D.R.[David R.],
The Berkeley Segmentation Dataset and Benchmark,
Online2007.
Dataset, Segmentation.
Dataset, BSDS.
Code, Segmentation.
WWW Link.
The updated code and data for the earlier paper.
See also
Database of Human Segmented Natural Images and its Application to Evaluating Segmentation Algorithms and Measuring Ecological Statistics, A.
Martin, D.R.[David R.],
Fowlkes, C.C.[Charless C.],
Tal, D.[Doron],
Malik, J.[Jitendra],
A Database of Human Segmented Natural Images and its Application to
Evaluating Segmentation Algorithms and Measuring Ecological Statistics,
ICCV01(II: 416-423).
IEEE DOI
Award, Helmholtz Prize.
And:
A Database of Human Segmented Natural Images and its Application to
Evaluating Segmentation Algorithms,
PercOrg01(xx-yy).
Dataset, Human Segmentation. BSDS300 DAtaset
Multiple human segmentations and a segmentation consistency measure.
Human-human are consistent with the measure, different images are
not consistent.
Promised online availability.
1000 images with hand segmentations. Multiple hand segmentations.
Anke, B.[Bellmann],
Olaf, H.[Hellwich],
Volker, R.[Rodehorst],
Ulas, Y.[Yilmaz],
A Benchmark Dataset for Performance Evaluation of Shape-from-X
Algorithms,
ISPRS08(B3b: 67 ff).
PDF File.
Dataset, Shape from X.
Aksoy, Y.[Yagiz],
Kim, C.[Changil],
Kellnhofer, P.[Petr],
Paris, S.[Sylvain],
Elgharib, M.[Mohamed],
Pollefeys, M.[Marc],
Matusik, W.[Wojciech],
A Dataset of Flash and Ambient Illumination Pairs from the Crowd,
ECCV18(IX: 644-660).
Springer DOI
Dataset, Illumination.
Narasimhan, S.G.[Srinivasa G.],
Wang, C.[Chi],
Nayar, S.K.[Shree K.],
All the Images of an Outdoor Scene,
ECCV02(III: 148 ff.).
Springer DOI PDF File.
Dataset, Outdoor Scene. A database of the same location every hour for 5 months. Registered and
calibrated.
WWW Link. for the database.
Shi, B.X.[Bo-Xin],
Mo, Z.P.[Zhi-Peng],
Wu, Z.[Zhe],
Duan, D.L.[Ding-Long],
Yeung, S.K.[Sai-Kit],
Tan, P.[Ping],
A Benchmark Dataset and Evaluation for Non-Lambertian and
Uncalibrated Photometric Stereo,
PAMI(41), No. 2, February 2019, pp. 271-284.
IEEE DOI
Earlier: A1, A3, A2, A4, A5, A6:
CVPR16(3707-3716)
IEEE DOI
Dataset, Photometric Stereo. Lighting, Taxonomy, Benchmark testing, Shape, Brain modeling, Cameras,
Heuristic algorithms, Photometric stereo, benchmark, dataset,
uncalibrated
Recurrent Asynchronous Multimodal Networks + Events, Frames, Semantic labels, and Depth maps recorded in CARLA simulator,
2021
HTML Version.
Code, Recurrent Networks.
Code, Monocular Depth.
Dataset, Monocular Depth.
Grosse, R.[Roger],
Johnson, M.K.[Micah K.],
Adelson, E.H.[Edward H.],
Freeman, W.T.[William T.],
Ground truth dataset and baseline evaluations for intrinsic image
algorithms,
ICCV09(2335-2342).
IEEE DOI
Dataset, Shading. For shading and reflectance computations.
Scharstein, D.[Daniel],
Szeliski, R.S.[Richard S.],
A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence
Algorithms,
IJCV(47), No. 1-3, April-June 2002, pp. 7-42.
DOI Link
Code, Stereo.
Dataset, Stereo. The data sets and code are also available:
WWW Link.
Award, Everingham. for 2015
di Rita, M.,
Nascetti, A.,
Crespi, M.,
FOSS4G Date Assessment On the Isprs Optical Stereo Satellite Data: A
Benchmark for DSM Generation,
Hannover17(635-638).
DOI Link
Dataset, Stereo. benchmark dataset with several stereo data sets from space borne stereo sensors
Scharstein, D.[Daniel],
Hirschmüller, H.[Heiko],
Kitajima, Y.[York],
Krathwohl, G.[Greg],
Nešic, N.[Nera],
Wang, X.[Xi],
Westling, P.[Porter],
High-Resolution Stereo Datasets with Subpixel-Accurate Ground Truth,
GCPR14(31-42).
Springer DOI
Dataset, Stereo.
Award, GCPR.
Haeusler, R.[Ralf],
Kondermann, D.[Daniel],
Synthesizing Real World Stereo Challenges,
GCPR13(164-173).
Springer DOI
Dataset, Stereo.
Janoch, A.[Allison],
Karayev, S.[Sergey],
Jia, Y.Q.[Yang-Qing],
Barron, J.T.[Jonathan T.],
Fritz, M.[Mario],
Saenko, K.[Kate],
Darrell, T.J.[Trevor J.],
A category-level 3-D object dataset: Putting the Kinect to work,
ConDepth11(1168-1174).
IEEE DOI
Dataset, Stereo. Color and depth pairs.
Browatzki, B.[Bjorn],
Fischer, J.[Jan],
Graf, B.[Birgit],
Bulthoff, H.H.[Heinrich H.],
Wallraven, C.[Christian],
Going into depth: Evaluating 2D and 3D cues for object classification
on a new, large-scale object dataset,
ConDepth11(1189-1195).
IEEE DOI
Dataset, Stereo.
Haeusler, R.[Ralf],
Klette, R.[Reinhard],
Analysis of KITTI Data for Stereo Analysis with Stereo Confidence
Measures,
UnOptFlow12(II: 158-167).
Springer DOI
And:
Disparity Confidence Measures on Engineered and Outdoor Data,
CIARP12(624-631).
Springer DOI
Earlier:
Benchmarking Stereo Data (Not the Matching Algorithms),
DAGM10(383-392).
Springer DOI
Dataset, Stereo.
Janowski, A.,
Sawicki, P.,
Szulwic, J.,
Internet database for photogrammetric close range applications,
IEVM06(xx-yy).
PDF File.
Dataset, Photogrammetry.
CVLab dense multi-view stereo image database,
2010
HTML Version.
Dataset, Stereo.
Multiple views, ground level, of buildings
IS-3D: Data,
2008.
HTML Version.
Dataset, Stereo. Multiple views of various structures.
Shao, S.[Shuai],
Li, Z.M.[Ze-Ming],
Zhang, T.Y.[Tian-Yuan],
Peng, C.[Chao],
Yu, G.[Gang],
Zhang, X.Y.[Xiang-Yu],
Li, J.[Jing],
Sun, J.[Jian],
Objects365: A Large-Scale, High-Quality Dataset for Object Detection,
ICCV19(8429-8438)
IEEE DOI
Dataset, Object Detection. feature extraction, image annotation, image classification,
image segmentation, learning (artificial intelligence), Clocks
Yogamani, S.,
Hughes, C.,
Horgan, J.,
Sistu, G.,
Chennupati, S.,
Uricar, M.,
Milz, S.,
Simon, M.,
Amende, K.,
Witt, C.,
Rashed, H.,
Nayak, S.,
Mansoor, S.,
Varley, P.,
Perrotton, X.,
Odea, D.,
Pérez, P.,
WoodScape: A Multi-Task, Multi-Camera Fisheye Dataset for Autonomous
Driving,
ICCV19(9307-9317)
IEEE DOI WWW Link.
Dataset, Autonomous Driving. automotive electronics, cameras,
driver information systems, image annotation, Nonlinear distortion
DOTA: A Large-Scale Benchmark and Challenges for Object Detection in Aerial Images,
Online2021
WWW Link.
Dataset, Aerial Objects.
2806 aerial images obtained from different sensors and platforms,
including 15 classification categories
(vehicle, track, storange tanks, sports fields, etc.)
TGRS-HRRSD-Dataset: High Resolution Remote Sensing Detection (HRRSD),
Online2017
WWW Link.
Dataset, Aerial Objects.
21,761 images. in 13 categories.
See also
Hierarchical and Robust Convolutional Neural Network for Very High-Resolution Remote Sensing Object Detection.
Wang, Q.[Qi],
Zhu, G.K.[Guo-Kang],
Yuan, Y.[Yuan],
Multi-spectral dataset and its application in saliency detection,
CVIU(117), No. 12, 2013, pp. 1748-1754.
Elsevier DOI
Dataset, Infrared. RGB+near infrared.
Multi-spectral
Gauglitz, S.[Steffen],
Höllerer, T.[Tobias],
Turk, M.A.[Matthew A.],
Evaluation of Interest Point Detectors and Feature Descriptors for
Visual Tracking,
IJCV(94), No. 3, September 2011, pp. 335-360.
WWW Link.
WWW Link.
Dataset, Tracking. Present a dataset with ground truth for evaluation. And evaluation
of camera tracking.
Balntas, V.[Vassileios],
Lenc, K.[Karel],
Vedaldi, A.[Andrea],
Tuytelaars, T.[Tinne],
Matas, J.G.[Jiri G.],
Mikolajczyk, K.[Krystian],
H-Patches: A Benchmark and Evaluation of Handcrafted and Learned
Local Descriptors,
PAMI(42), No. 11, November 2020, pp. 2825-2841.
IEEE DOI
Earlier: A1, A2, A3, A6, Only:
CVPR17(3852-3861)
IEEE DOI
Dataset, Local Descriptors. HPatches dataset.
Benchmark testing, Detectors, Protocols, Task analysis,
Feature extraction, Training, Image matching, Local features,
patch classification.
Feature extraction, Protocols, Size, measurement
CUReT: Columbia-Utrecht Reflectance and Texture Database,
2006.
Dataset, Texture.
WWW Link.
MIT Texture Data,
1995.
Dataset, Texture.
HTML Version.
Texture Data,
2006.
Dataset, Texture.
WWW Link.
Outex: New framework for empirical evaluation of
texture analysis algorithms,
2006.
Dataset, Texture.
WWW Link.
Texure Image Data,
2006.
Dataset, Texture.
WWW Link. A variety of texture datasets. Includes Brodatz.
The KTH-TIPS and KTH-TIPS2 image databases,
2006.
Dataset, Texture.
WWW Link. Textures under varying illumination, pose and scale.
Extension of:
See also
CUReT: Columbia-Utrecht Reflectance and Texture Database.
TILDA: Textile Texture Database,
1996.
Dataset, Texture.
WWW Link.
Describable Textures Dataset (DTD),
2014
Dataset, Texture.
WWW Link.
See also
Describing Textures in the Wild.
Hossain, S.[Shahera],
Serikawa, S.[Seiichi],
Texture databases: A comprehensive survey,
PRL(34), No. 15, 2013, pp. 2007-2022.
Elsevier DOI
Dataset, Texture.
Survey, Texture Datasets. Texture.
Xue, J.[Jia],
Wadekar, P.[Paras],
Zhang, H.[Hang],
Teran, L.[Leizer],
Dana, K.[Kristin],
Nishino, K.[Ko],
Ground Terrain Database, GTOS,
Online2017.
HTML Version.
Dataset, Texture.
Lee, S.K.[Seung-Kyu],
Liu, Y.X.[Yan-Xi],
PSU Near-Regular Texture Database,
OnlinePSU, 2005.
WWW Link.
Dataset, Texture.
Total found: 702