Databases or Datasets for Computer Vision Applications and Testing

Generally, to avoid confusion, in this bibliography, the word database is used for database systems or research and would apply to image database query techniques rather than a database containing images for use in specific applications. I have chosen to use dataset to describe collections of images used by researchers in some domain. In the past test data was difficult, but the advent of modern digital cameras has simplified acquiring data. But in order to test and especially compare algorithms, a common dataset is essential.

Test data is available in bits and pieces and in several larger repositories, These listed datasets are selected from the references in the Computer Vision Bibliography. There are other datasets and often older ones get removed from web sites. The links on the Author and Journal references in the list point to entries in that database. Current research and applications are highlighted in various Computer Vision and Image Processing Conferences. Some of these have evaluation sessions with related datasets.

Computer Vision resources include:

For more information on the topics, contact information, etc. see the annotated Computer Vision Bibliography or the Complete Conference Listing for Computer Vision and Image Analysis


Detailed Entries for Dataset

Khosla, A.[Aditya], Raju, A.S.[Akhil S.], Torralba, A.B.[Antonio B.], Oliva, A.[Aude],
Understanding and Predicting Image Memorability at a Large Scale,
ICCV15(2390-2398)
IEEE DOI
Dataset, Memorability.
WWW Link. Benchmark testing


Berga, D., Vidal, X.R.F., Otazu, X., Pardo, X.M.,
SID4VAM: A Benchmark Dataset With Synthetic Images for Visual Attention Modeling,
ICCV19(8788-8797)
IEEE DOI
Dataset, Gaze Tracking. gaze tracking, learning (artificial intelligence), neural nets, SID4VAM, visual attention modeling, saliency metrics, Benchmark testing


Barnard, K.[Kobus], and Funt, B.V.[Brian V.],
Camera characterization for color research,
ColorRes(27), No. 3, 2002, pp. 153-164.
PDF File. Dataset, Color Calibration.
WWW Link.


Huang, X.Y.[Xin-Yu], Wang, P.[Peng], Cheng, X.J.[Xin-Jing], Zhou, D.F.[Ding-Fu], Geng, Q.C.[Qi-Chuan], Yang, R.G.[Rui-Gang],
The ApolloScape Open Dataset for Autonomous Driving and Its Application,
PAMI(42), No. 10, October 2020, pp. 2702-2719.
IEEE DOI
Dataset, Autonomous Driving. Semantics, Task analysis, Videos, Labeling, Image segmentation, 3D understanding


Yu, F.[Fisher], Chen, H.F.[Hao-Feng], Wang, X.[Xin], Xian, W.Q.[Wen-Qi], Chen, Y.Y.[Ying-Ying], Liu, F.C.[Fang-Chen], Madhavan, V.[Vashisht], Darrell, T.J.[Trevor J.],
BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning,
CVPR20(2633-2642)
IEEE DOI
WWW Link. Dataset, Road Scenes. Task analysis, Visualization, Roads, Image segmentation, Meteorology, Training, Benchmark testing


DDD17: End-To-End DAVIS Driving Dataset,
2017
WWW Link. Dataset, Road Scenes. Over 12 h of a 346x260 pixel DAVIS sensor recording highway and city driving in daytime, evening, night, dry and wet weather.


Waymo Open Dataset,
2020
WWW Link. Dataset, Road Scenes. high-resolution sensor data collected by autonomous vehicles operated by the Waymo Driver in a wide variety of situations.


UZH FPV Drone Racing Dataset 2.0,
2024 WWW Link.
Dataset, Visual Odometry. Dataset, SLAM. The dataset comprises dozens of real-world sequences where a quadrotor controlled in first-person view (FPV) by a professional pilot has been flown both indoors and outdoors. Each sequence contains images, IMU, and events (from an event-based camera) recorded on-board, as well as ground truth from a robotic total station or motion capture system.


The ROad event Awareness Dataset for Autonomous Driving (ROAD),
2021
WWW Link. Dataset, Autonomous Driving. It contains 22 long-duration videos (ca 8 minutes each), ideal for continual learning research, annotated in terms of road events, defined as triplets E = (Agent, Action, Location) and represented as tubes, i.e., a series of frame-wise bounding box detections. ROAD is a large, high-quality multi-label benchmark, with 122K labelled video frames comprising 560K detection bounding boxes associated with 1.7M unique individual labels (560K agent labels, 640K action labels and 499K location labels).


DSEC: A Stereo Event Camera Dataset for Driving Scenarios,
2021.
HTML Version. CVPR 2021 competition dataset. Dataset, Stereo. Dataset, Driving.
Stereo Event Camera large-scale dataset for challenging driving scenarios! DSEC features over 400GB of data including stereo VGA Prophesee event cameras, stereo RGB cameras, Velodyne lidar, and RTK-GPS, recorded in challenging high-dynamic-range, day and night, sunrise and sunset, urban and Swiss-mountain driving scenarios.


Singh, G.[Gurkirt], Akrigg, S.[Stephen], di Maio, M.[Manuele], Fontana, V.[Valentina], Alitappeh, R.J.[Reza Javanmard], Khan, S.[Salman], Saha, S.[Suman], Jeddisaravi, K.[Kossar], Yousefi, F.[Farzad], Culley, J.[Jacob], Nicholson, T.[Tom], Omokeowa, J.[Jordan], Grazioso, S.[Stanislao], Bradley, A.[Andrew], di Gironimo, G.[Giuseppe], Cuzzolin, F.[Fabio],
ROAD: The Road Event Awareness Dataset for Autonomous Driving,
PAMI(45), No. 1, January 2023, pp. 1036-1054.
IEEE DOI
Dataset, Autonomous Driving. Roads, Autonomous vehicles, Task analysis, Videos, Benchmark testing, Decision making, Vehicle dynamics, Autonomous driving, decision making


Li, L.[Li], Ismail, K.N.[Khalid N.], Shum, H.P.H.[Hubert P. H.], Breckon, T.P.[Toby P.],
DurLAR: A High-Fidelity 128-Channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-Modal Autonomous Driving Applications,
3DV21(1227-1237)
IEEE DOI
Dataset, Autonomous Driving. Reflectivity, Laser radar, Image resolution, Supervised learning, Estimation, Benchmark testing, autonomous driving, dataset, three dimensional


Yogamani, S., Hughes, C., Horgan, J., Sistu, G., Chennupati, S., Uricar, M., Milz, S., Simon, M., Amende, K., Witt, C., Rashed, H., Nayak, S., Mansoor, S., Varley, P., Perrotton, X., Odea, D., Pérez, P.,
WoodScape: A Multi-Task, Multi-Camera Fisheye Dataset for Autonomous Driving,
ICCV19(9307-9317)
IEEE DOI
WWW Link.
Dataset, Autonomous Driving. automotive electronics, cameras, driver information systems, image annotation, Nonlinear distortion


Zendel, O.[Oliver], Honauer, K.[Katrin], Murschitz, M.[Markus], Steininger, D.[Daniel], Domínguez, G.F.[Gustavo Fernández],
WildDash: Creating Hazard-Aware Benchmarks,
ECCV18(VI: 407-421).
Springer DOI
Dataset, Highway Hazards. Driving hazards.


Sakaridis, C.[Christos], Dai, D.X.[Deng-Xin], Van Gool, L.J.[Luc J.],
Semantic Foggy Scene Understanding with Synthetic Data,
IJCV(126), No. 9, September 2018, pp. 973-992.
Springer DOI

And:
ACDC: The Adverse Conditions Dataset with Correspondences for Semantic Driving Scene Understanding,
ICCV21(10745-10755)
IEEE DOI
Dataset, Haze. Not just dehazing, actually understand the scene. Training, Image segmentation, Visualization, Rain, Snow, Semantics, Datasets and evaluation, Scene analysis and understanding, Vision for robotics and autonomous vehicles


Dev Roy, S., Kanti Bhowmik, M., Oakley, J.,
A Ground Truth Annotated Video Dataset for Moving Object Detection in Degraded Atmospheric Outdoor Scenes,
ICIP18(1318-1322)
IEEE DOI
Dataset, Object Detection. Object detection, Lighting, Meteorology, Cameras, Image restoration, Streaming media, Atmospheric measurements, Image Enhancement


Zhang, Y.J.[Yu-Jun], Zhu, L.[Lei], Feng, W.[Wei], Fu, H.Z.[Hua-Zhu], Wang, M.Q.[Ming-Qian], Li, Q.X.[Qing-Xia], Li, C.[Cheng], Wang, S.[Song],
VIL-100: A New Dataset and A Baseline Model for Video Instance Lane Detection,
ICCV21(15661-15670)
IEEE DOI
Dataset, Lane Detection. Performance evaluation, Codes, Lane detection, Annotations, Object segmentation, Streaming media, grouping and shape


Codevilla, F., Santana, E., Lopez, A., Gaidon, A.,
Exploring the Limitations of Behavior Cloning for Autonomous Driving,
ICCV19(9328-9337)
IEEE DOI
Dataset, Driver Behavior.
WWW Link. behavioural sciences computing, learning (artificial intelligence), neural nets, Vehicle dynamics


Lee, G.H.[Gim Hee], Achtelik, M., Fraundorfer, F., Pollefeys, M., Siegwart, R.,
A benchmarking tool for MAV visual pose estimation,
ICARCV10(1541-1546).
IEEE DOI
Dataset, SLAM. Large scale SLAM dataset with more sensors. For UAV algorithm evaluations.


Ji, R.R.[Rong-Rong], Duan, L.Y.[Ling-Yu], Chen, J.[Jie], Yang, S.[Shuang], Huang, T.J.[Tie-Jun], Yao, H.X.[Hong-Xun], Gao, W.[Wen],
PKUBench: A context rich mobile visual search benchmark,
ICIP11(2545-2548).
IEEE DOI
Dataset, Landmarks. Landmark search aided by GPS.


Li, N.[Ning], Zhao, Y.Q.[Yong-Qiang], Pan, Q.[Quan], Kong, S.G.[Seong G.], Chan, J.C.W.[Jonathan Cheung-Wai],
Full-time Monocular Road Detection Using Zero-distribution Prior of Angle of Polarization,
ECCV20(XXV:457-473).
Springer DOI
Dataset, Road Detection.
WWW Link.


Winkens, C., Sattler, F., Adams, V., Paulus, D.,
HyKo: A Spectral Dataset for Scene Understanding,
CVRoads17(254-261)
IEEE DOI
Dataset, Roads. Autonomous vehicles, Cameras, Hypercubes, Hyperspectral imaging, Image color analysis, Sensors


Schmidt, A.[Adam], Fularz, M.[Michal], Kraft, M.[Marek], Kasinski, A.[Andrzej], Nowicki, M.[Michal],
An Indoor RGB-D Dataset for the Evaluation of Robot Navigation Algorithms,
ACIVS13(321-329).
Springer DOI
Dataset, Navigation.


Swedish Trafic Signs,
Online2010
WWW Link. Dataset, Traffic Signs.


Challenging Unreal and Real Environments for Traffic Sign Detection and Recognition,
Online2017 CURE-TSD and CURE-TSR
WWW Link.
WWW Link. Dataset, Traffic Signs. Dataset, CURE-TSR. Dataset, CURE-TSD. Real-world and synthesized video sequences with challenging conditions. In total, there are 5,733 video sequences and around 1.72 million frames.


CMU VASC Image Database,
Online1997
WWW Link. Dataset, Motion. CMU has a collection of image datasets available. These include a number of motion sequences, stereo (with and without ground truth), faces and expressions, and cars.


PEIPA Computer Vision Software,
Online2004.
HTML Version. Code, Computer Vision. Dataset. Pilot European Image Processing Archive. This lists a number of sources for various alogrithms. They also include pointers to the usual set of image databases.


BBC Motion Gallery,
Video data. Online2004
WWW Link. Video clips, including rights managed and production ready royalty-free footage. Available to preview, purchase and download. Dataset, Retrieval. Dataset, Video.


Large Scale Dataset for Cross-Model Multimedia Analysis,
2013.
HTML Version. Dataset, Image Retrieval. Dataset, Text Retrieval.
See also Large Scale Video Database.


Shirahatti, N.V.[Nikhil V.], Barnard, K.[Kobus],
Evaluating Image Retrieval,
CVPR05(I: 955-961).
IEEE DOI
HTML Version.
Code, Image Retrieval. Dataset, Image Retrieval.


Duygulu, P., Barnard, K., de Freitas, J.F.G., Forsyth, D.A.,
Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary,
ECCV02(IV: 97 ff.). Award, ECCV, Cognitive Vision.
Springer DOI
HTML Version.
Dataset, Object Recognition.


Murray, N.[Naila], Marchesotti, L.[Luca], Perronnin, F.[Florent],
Learning to rank images using semantic and aesthetic labels,
BMVC12(110).
DOI Link

Earlier:
AVA: A large-scale database for aesthetic visual analysis,
CVPR12(2408-2415).
IEEE DOI
Dataset, Aesthetic Analysis.


Johnson, J.[Justin], Hariharan, B.[Bharath], van der Maaten, L.[Laurens], Hoffman, J., Fei-Fei, L.[Li], Zitnick, C.L.[C. Lawrence], Girshick, R.[Ross],
Inferring and Executing Programs for Visual Reasoning,
ICCV17(3008-3017)
IEEE DOI

Earlier: A1, A2, A3, A5, A6, A7, Only:
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning,
CVPR17(1988-1997)
IEEE DOI
Dataset, Visual Reasoning.
WWW Link. backpropagation, image matching, learning (artificial intelligence), neural nets, Visualization. Cognition, Image color analysis, Metals, Semantics, Shape.


Visual7W visual question answering,
Large-scale visual question answering (QA) dataset, with object-level groundings and multimodal answers. WWW Link.
Dataset, Visual Question Answering.


Visual Genome,
Visual Genome is a dataset, a knowledge base, an ongoing effort to connect structured image concepts to language. WWW Link.

WWW Link. Dataset, Visual Question Answering.


Mathew, M.[Minesh], Karatzas, D.[Dimosthenis], Jawahar, C.V.,
DocVQA: A Dataset for VQA on Document Images,
WACV21(2199-2208)
IEEE DOI
WWW Link.
Dataset, Visual Q-A. Visualization, Text analysis, Image recognition, Image analysis, Layout


UT Zappos50K,
Dataset, Shoes.
WWW Link.
University of Texas shoe dataset. 50,025 images.


Xiao, J.X.[Jian-Xiong], Hays, J.[James], Ehinger, K.A.[Krista A.], Oliva, A.[Aude], Torralba, A.B.[Antonio B.],
SUN database: Large-scale scene recognition from abbey to zoo,
JEP:HPP(36), No. 6, 2010, pp. 1430-1442.
And: CVPR10(3485-3492).
IEEE DOI
Dataset, Recognition.
WWW Link. 131067 images, 908 categories, objects and object categories.


Xiao, J.X.[Jian-Xiong], Ehinger, K.A.[Krista A.], Hays, J.[James], Torralba, A.B.[Antonio B.], Oliva, A.[Aude],
SUN Database: Exploring a Large Collection of Scene Categories,
IJCV(119), No. 1, August 2016, pp. 3-22.
Springer DOI
Dataset, Object Recognition.
WWW Link.


Le Cun, Y.L.[Yann L.], Huang, F.J.[Fu Jie], Bottou, L.[Leon],
Learning methods for generic object recognition with invariance to pose and lighting,
CVPR04(II: 97-104).
IEEE DOI And:
PDF File.
WWW Link.
Dataset, Objects. Real time implementation. Find generic objects.


Blandfort, P.[Philipp], Karayil, T.[Tushar], Hees, J.[Jörn], Dengel, A.[Andreas],
The Focus-Aspect-Value model for predicting subjective visual attributes,
MultInfoRetr(9), No. 1, March 2020, pp. 47-60.
Springer DOI
Dataset, Retrieval.
WWW Link.


Philbin, J.[James], Chum, O.[Ondrej], Isard, M.[Michael], Sivic, J.[Josef], Zisserman, A.[Andrew],
Lost in quantization: Improving particular object retrieval in large scale image databases,
CVPR08(1-8).
IEEE DOI
HTML Version.
Dataset, Objects.


Chum, O.[Ondrej], Philbin, J.[James], Sivic, J.[Josef], Isard, M.[Michael], Zisserman, A.[Andrew],
Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval,
ICCV07(1-8).
IEEE DOI

And: A2, A1, A4, A3, A5:
Object retrieval with large vocabularies and fast spatial matching,
CVPR07(1-8).
IEEE DOI
HTML Version.
Dataset, Buildings. Award, Longuet-Higgins. (after 10 years).


Bell, S.[Sean], Upchurch, P.[Paul], Snavely, N.[Noah], Bala, K.[Kavita],
Material recognition in the wild with the Materials in Context Database,
CVPR15(3479-3487)
IEEE DOI

And:
MINC Dataset,
WWW Link. Dataset, Materials.


Large Scale Video Database,
2012.
WWW Link. Dataset, Video Database.
This database consists of 156,823 videos sequences (2,907,447 keyframes), which were crawled from YouTube during the period of July 2010 to September 2010. We provide the features as well as the ground truth.
See also Multiple feature hashing for real-time large scale near-duplicate video retrieval.
See also Large Scale Dataset for Cross-Model Multimedia Analysis.


MA14KD: Movie Attraction 14K Dataset,

WWW Link. Dataset, Visual Attractiveness. MA14KD provides a set of "Attractiveness" features extracted from 14000 movie and TV series trailers. The movie IDs are in agreement with the movie IDs provided by a rating dataset, that contains millions of ratings and thousands of tags.


Xu, J.[Jun], Mei, T.[Tao], Yao, T.[Ting], Rui, Y.[Yong],
MSR-VTT: A Large Video Description Dataset for Bridging Video and Language,
CVPR16(5288-5296)
IEEE DOI
Dataset, Video Analysis.
See also MSR VTT Dataset.


Li, Y.C.[Yun-Cheng], Song, Y.[Yale], Cao, L.L.[Liang-Liang], Tetreault, J.[Joel], Goldberg, L.[Larry], Jaimes, A.[Alejandro], Luo, J.B.[Jie-Bo],
TGIF: A New Dataset and Benchmark on Animated GIF Description,
CVPR16(4641-4650)
IEEE DOI
WWW Link. Dataset, Animations.


Liu, J.Z.[Jing-Zhou], Chen, W.[Wenhu], Cheng, Y.[Yu], Gan, Z.[Zhe], Yu, L.C.[Li-Cheng], Yang, Y.M.[Yi-Ming], Liu, J.J.[Jing-Jing],
Violin: A Large-Scale Dataset for Video-and-Language Inference,
CVPR20(10897-10907)
IEEE DOI
Dataset, Video. Task analysis, Visualization, Cognition, Natural languages, TV, Motion pictures, Benchmark testing


Huang, Q.Q.[Qing-Qiu], Xiong, Y.[Yu], Rao, A.[Anyi], Wang, J.Z.[Jia-Ze], Lin, D.H.[Da-Hua],
Movienet: A Holistic Dataset for Movie Understanding,
ECCV20(IV:709-727).
Springer DOI
Dataset, Movie Understanding.
WWW Link.


Deep Video Understanding Dataset,
2020, used for workshops, and challenges. WWW Link.
Dataset, Video Understanding.


Bakker, E.M.[Erwin M.],
Open and free datasets for multimedia retrieval,
MultInfoRetr(5), No. 3, September 2016, pp. 135-136.
WWW Link.
Dataset, Multimedia Retrieval.


Khan, M., Chakareski, J.,
NJIT 6DOF VR Navigation Dataset,
2020.
WWW Link. Dataset, Virtual Reality. 6DOF (six degrees of freedom) virtual reality (VR) navigation data comprising spatial position (x,y,z) and head orientation (rotation angles yaw, pitch, and roll) of mobile VR users navigating a VR environment in an indoor arena.


Animals with Attributes 2 Dataset,
2017 Dataset, Animals.
WWW Link. Reference:
See also Zero-Shot Learning: The Good, the Bad and the Ugly. Note the earlier AWA dataset has been removed due to copyright issues and replaces with this version.


Cat Dataset,
2013 Dataset, Cats.
WWW Link. 9000 cat images with annotations.


Nilsback, M.E.[Maria-Elena], Zisserman, A.[Andrew],
Automated Flower Classification over a Large Number of Classes,
ICCVGIP08(722-729).
IEEE DOI
HTML Version. Dataset, Flowers.
HTML Version.


Plant Phenotyping Datasets for Computer Vision,
2016
WWW Link. Dataset, Plants. We present a collection of benchmark datasets in the context of plant phenotyping. We provide annotated imaging data and suggest suitable evaluation criteria for plant/leaf segmentation, detection, tracking as well as classification and regression problems. The figure symbolically depicts the data available together with ground truth segmentations and further annotations and metadata. Article in press.
See also Finely-grained annotated datasets for image-based plant phenotyping.


Wood image database,
2000.
WWW Link.
WWW Link. For information also see:
HTML Version. Dataset, Lumber.


Beery, S.[Sara], van Horn, G.[Grant], Perona, P.[Pietro],
Recognition in Terra Incognita,
ECCV18(XVI: 472-489).
Springer DOI
Dataset, Animals.
WWW Link.


Tropical Coral Reef Fish Detection, Tracking And Classification,
Fish4Knowledge project datasets. Online2014
WWW Link. Dataset, Fish.
See also University of Edinburgh.
See also Fish4Knowledge: Collecting and Analyzing Massive Coral Reef Fish Video Data.


Swanson, A.[Alexandra], Kosmala, M.[Margaret], Lintott, C.[Chris], Simpson, R.[Robert], Smith, A.[Arfon], Packer, C.[Craig],
Snapshot Serengeti, high-frequency annotated camera trap images of 40 mammalian species in an African savanna,
ScientificData(2), June 2015, Article 150026.
DOI Link
Dataset, Animals. Covered by many news outlets. Thousands of pictures of animals from motion activated cameras planted in the Serengeti. Includes interface for people to identify, etc. A great resource for automated detection and identification.


Brookes, O.[Otto], Mirmehdi, M.[Majid], Stephens, C.[Colleen], Angedakin, S.[Samuel], Corogenes, K.[Katherine], Dowd, D.[Dervla], Dieguez, P.[Paula], Hicks, T.C.[Thurston C.], Jones, S.[Sorrel], Lee, K.[Kevin], Leinert, V.[Vera], Lapuente, J.[Juan], McCarthy, M.S.[Maureen S.], Meier, A.[Amelia], Murai, M.[Mizuki], Normand, E.[Emmanuelle], Vergnes, V.[Virginie], Wessling, E.G.[Erin G.], Wittig, R.M.[Roman M.], Langergraber, K.[Kevin], Maldonado, N.[Nuria], Yang, X.Y.[Xin-Yu], Zuberbühler, K.[Klaus], Boesch, C.[Christophe], Arandjelovic, M.[Mimi], Kühl, H.[Hjalmar], Burghardt, T.[Tilo],
PanAf20K: A Large Video Dataset for Wild Ape Detection and Behaviour Recognition,
IJCV(132), No. 8, August 2024, pp. 3086-3102.
Springer DOI
Dataset, Apes.


Pre-Corrective Optics Space Telescope Axial Replacement Hubble Space Telescope star-cluster dataset,
Astronomy dataset. Dataset, Astronomy.
WWW Link.


Ramanathan, S.[Subramanian], Katti, H.[Harish], Sebe, N.[Nicu], Kankanhalli, M.[Mohan], Chua, T.S.[Tat-Seng],
An Eye Fixation Database for Saliency Detection in Images,
ECCV10(IV: 30-43).
Springer DOI
Dataset, Eye Fixation.


Rivera-Rubio, J.[Jose], Idrees, S.[Saad], Alexiou, I.[Ioannis], Hadjilucas, L.[Lucas], Bharath, A.A.[Anil A.],
A dataset for Hand-Held Object Recognition,
ICIP14(5881-5885)
IEEE DOI
Dataset, Object Recognition.
And:
Small Hand-Held Object Recognition Test (SHORT),
WACV14(524-531)
IEEE DOI

Earlier:
Mobile Visual Assistive Apps: Benchmarks of Vision Algorithm Performance,
ACVR13(30-40).
Springer DOI
Computer vision Cameras


Spacenet,
2020. Research Group, Europe.
WWW Link. Accelerating Geospatial Machine Learning Dataset, Mapping.
WWW Link.


Koch, T.[Tobias], d'Angelo, P.[Pablo], Kurz, F.[Franz], Fraundorfer, F.[Friedrich], Reinartz, P.[Peter], Körner, M.[Marco],
The TUM-DLR Multimodal Earth Observation Evaluation Benchmark,
SatStreet16(698-705)
IEEE DOI
Dataset, Remote Sensing.
WWW Link. Same scene, satellite, air, UAV, smartphone.


ISPRS Benchmarks,
Online2021
WWW Link. Dataset, Urban Data. Dataset, Building Detection. Dataset, Object Detection. Dataset, Point Cloud Segmentation. Multiple datasets. Some with associated benchmarks and challenges. Includes: VAihingen/Enz, Toronto, Potsdam, UAVid, Gaofen, EuroSDR, Urban classification.
See also ISPRS: International Society for Photogrammetry and Remote Sensing.


Hong, D.F.[Dan-Feng], Hu, J.L.[Jing-Liang], Yao, J.[Jing], Chanussot, J.[Jocelyn], Zhu, X.X.[Xiao Xiang],
Multimodal remote sensing benchmark datasets for land cover classification with a shared and specific feature learning model,
PandRS(178), 2021, pp. 68-80.
Elsevier DOI
Dataset, Remote Sensing. Benchmark datasets, Classification, Feature learning, Hyperspectral, Land cover mapping, DSM, Multimodal, Specific features


Boguszewski, A.[Adrian], Batorski, D.[Dominik], Ziemba-Jankowska, N.[Natalia], Dziedzic, T.[Tomasz], Zambrzycka, A.[Anna],
LandCover.ai: Dataset for Automatic Mapping of Buildings, Woodlands, Water and Roads from Aerial Imagery,
EarthVision21(1102-1110)
IEEE DOI
Dataset, Aerial Mapping. Deep learning, Image segmentation, Image resolution, Satellites, Roads, Buildings


Shermeyer, J., Hogan, D., Brown, J., van Etten, A., Weir, N., Pacifici, F., Hänsch, R., Bastidas, A., Soenen, S., Bacastow, T., Lewis, R.,
SpaceNet 6: Multi-Sensor All Weather Mapping Dataset,
EarthVision20(768-777)
IEEE DOI
Dataset, Mapping. Synthetic aperture radar, Optical sensors, Optical imaging, Adaptive optics, Optical polarization, Buildings


Chen, H.[Hao], Shi, Z.W.[Zhen-Wei],
A Spatial-Temporal Attention-Based Method and a New Dataset for Remote Sensing Image Change Detection,
RS(12), No. 10, 2020, pp. xx-yy.
DOI Link
WWW Link. Dataset, Building Changes. LEVIR-CD Dataset.


Verma, S.[Sagar], Panigrahi, A.[Akash], Gupta, S.[Siddharth],
QFabric: Multi-Task Change Detection Dataset,
EarthVision21(1052-1061)
IEEE DOI
Dataset, Change Detection. Deep learning, Urban areas, Predictive models, Benchmark testing, Metadata, Pattern recognition


Zhou, D.B.[Dong-Bo], Liu, S.J.[Shuang-Jian], Yu, J.[Jie], Li, H.[Hao],
A High-Resolution Spatial and Time-Series Labeled Unmanned Aerial Vehicle Image Dataset for Middle-Season Rice,
IJGI(9), No. 12, 2020, pp. xx-yy.
DOI Link
Dataset, Rice.


AerialWaste: a professionally curated dataset for waste detection in aerial images,
2023.
WWW Link. Dataset, Garbage. AerialWaste is a dataset for landfill detection featuring airborne, WorldView-3, and GoogleEarth images annotated by professional photo interpreters. AerialWaste contains 10,434 images generated from tiles of three different sources: AGEA Orthophotos (20 cm GSD), WorldView-3 (30 cm GSD) and GoogleEarth (50 cm GSD).


Tan, W., Qin, N., Ma, L., Li, Y., Du, J., Cai, G., Yang, K., Li, J.,
Toronto-3D: A Large-scale Mobile LiDAR Dataset for Semantic Segmentation of Urban Roadways,
EarthVision20(797-806)
IEEE DOI
Dataset, LiDAR. Semantics, Roads, Laser radar, Sensors, Machine learning, Automobiles


Hudson, W.H., Nadadur, D.C., Thornton, K.B., Liu, X., Haralick, R.M.,
The Radius CDROM Ground Truthed Data Set,
ARPA96(511-519). Dataset. Ground truth buildings for other users.


ISPRS benchmark on urban object detection and 3D building reconstruction,
2013
HTML Version. Dataset, Building Detection. Provide state-of-the-art data sets which can be used by interested researchers in order to test own methods and algorithms on urban object classification and building reconstruction.


Teller, S.[Seth], Antone, M.[Matthew], Bodnar, Z.[Zachary], Bosse, M.[Michael], Coorg, S.[Satyan], Jethwa, M.[Manish], Master, N.[Neel],
Calibrated, Registered Images of an Extended Urban Area,
IJCV(53), No. 1, June 2003, pp. 93-107.
DOI Link
Dataset, Buildings.
Earlier: CVPR01(I:813-820).
IEEE DOI
More the dataset than how to analyze the data.
WWW Link.
See also Spherical Mosaics with Quaternions and Dense Correlation.


Meinel, G.[Gotthard], Burckhardt, M.[Manuel],
The Digital Basic Geodata Sets Hausumringe and Hauskoordinaten: Characterization and Pre-processing for Building Stock Analysis,
PFG(2013), No. 6, 2013, pp. 575-588.
DOI Link
Dataset, Buildings.


Weber, E.[Ethan], Papadopoulos, D.P.[Dim P.], Lapedriza, A.[Agata], Ofli, F.[Ferda], Imran, M.[Muhammad], Torralba, A.[Antonio],
Incidents1M: A Large-Scale Dataset of Images With Natural Disasters, Damage, and Incidents,
PAMI(45), No. 4, April 2023, pp. 4768-4781.
IEEE DOI
Dataset, Disasters. Social networking (online), Task analysis, Satellites, Computational modeling, Data models, Visualization, Training, incident detection


ISPRS Test Project on Urban Classification and 3D Building Reconstruction,
LIDAR data for building descrtiptions.
WWW Link. Dataset, Building Extraction. Used for ISPRS 3D Labeling contest.


Ye, Z.[Zhen], Xu, Y.S.[Yu-Sheng], Huang, R.[Rong], Tong, X.H.[Xiao-Hua], Li, X.[Xin], Liu, X.F.[Xiang-Feng], Luan, K.F.[Kui-Feng], Hoegner, L.[Ludwig], Stilla, U.[Uwe],
LASDU: A Large-Scale Aerial LiDAR Dataset for Semantic Labeling in Dense Urban Areas,
IJGI(9), No. 7, 2020, pp. xx-yy.
DOI Link
Dataset, LiDAR.


Gao, W.X.[Wei-Xiao], Nan, L.L.[Liang-Liang], Boom, B.J.[Bas J.], Ledoux, H.[Hugo],
SUM: A benchmark dataset of Semantic Urban Meshes,
PandRS(179), 2021, pp. 108-120.
Elsevier DOI
Dataset, Urban Data. Texture meshes, Urban scene understanding, Mesh annotation, Semantic segmentation, Over-segmentation, Benchmark dataset Helsinki.


Cruz, S.[Steve], Hutchcroft, W.[Will], Li, Y.G.[Yu-Guang], Khosravan, N.[Naji], Boyadzhiev, I.[Ivaylo], Kang, S.B.[Sing Bing],
Zillow Indoor Dataset: Annotated Floor Plans With 360° Panoramas and 3D Room Layouts,
CVPR21(2133-2143)
IEEE DOI
Dataset, Floor Plans. Annotations, Layout, Urban areas, Semantics, Estimation


NYU Depth Dataset V2,
HTML Version. Dataset, RGBD. Dataset, Indoor Scenes.
See also Indoor Segmentation and Support Inference from RGBD Images.


Hu, Q.Y.[Qing-Yong], Yang, B.[Bo], Khalid, S.[Sheikh], Xiao, W.[Wen], Trigoni, N.[Niki], Markham, A.[Andrew],
SensatUrban: Learning Semantics from Urban-Scale Photogrammetric Point Clouds,
IJCV(130), No. 2, February 2022, pp. 316-343.
Springer DOI
WWW Link. Dataset, Urban. Urban scale point cloud


Cordts, M., Omran, M., Ramos, S., Rehfeld, T., Enzweiler, M., Benenson, R., Franke, U., Roth, S., Schiele, B.,
The Cityscapes Dataset for Semantic Urban Scene Understanding,
CVPR16(3213-3223)
IEEE DOI
WWW Link.
WWW Link.
WWW Link.
WWW Link.
Dataset, City Models.


Ros, G., Sellart, L., Materzynska, J., Vazquez, D., Lopez, A.M.,
The SYNTHIA Dataset: A Large Collection of Synthetic Images for Semantic Segmentation of Urban Scenes,
CVPR16(3234-3243)
IEEE DOI
Dataset, City Models.


Ma, Y.C.[Yan-Chun], Liu, Y.J.[Yong-Jian], Xie, Q.[Qing], Xiong, S.W.[Sheng-Wu], Bai, L.H.[Li-Hua], Hu, A.[Anshu],
A Tibetan Thangka data set and relative tasks,
IVC(108), 2021, pp. 104125.
Elsevier DOI
Dataset, Tibetan Culture. Chomo Yarlung Tibet version 1. Image data set, Thangka data set, Tibetan culture, Semantic content analysis, Image processing


CyArk,
3-D, Laser data collection and archiving.
WWW Link. Vendor, Cultural Heritage. Dataset, Cultural Heritage. Digiatal Archive of the world's heritage sites for preservation and education. Not a vendor as such, but an archive and group that will collect the data. Some are small, some huge. Used for sites being destroyed, or for reconstruction. Visualizations, etc.


Weir, N., Lindenbaum, D., Bastidas, A., Etten, A., Kumar, V., Mcpherson, S., Shermeyer, J., Tang, H.,
SpaceNet MVOI: A Multi-View Overhead Imagery Dataset,
ICCV19(992-1001)
IEEE DOI
Dataset, Stereo. feature extraction, image classification, image colour analysis, image resolution, image segmentation


UCF-ARG,
Online2012
WWW Link. Dataset, Surveillance.

Earlier: UCF Aerial Action Dataset,
WWW Link. A:Aerial Camera, R: Roof top camera, G: Ground camera. 3 views of different actions. The aerial subset


Jha, S.S.[Sudhanshu Shekhar], Nidamanuri, R.R.[Rama Rao],
Gudalur Spectral Target Detection (GST-D): A New Benchmark Dataset and Engineered Material Target Detection in Multi-Platform Remote Sensing Data,
RS(12), No. 13, 2020, pp. xx-yy.
DOI Link
Dataset, Targets. Target detection, or sparsely distributed materials.


Xia, G.S.[Gui-Song], Bai, X.[Xiang], Ding, J.[Jian], Zhu, Z.[Zhen], Belongie, S.[Serge], Luo, J.B.[Jie-Bo], Datcu, M.[Mihai], Pelillo, M.[Marcello], Zhang, L.P.[Liang-Pei],
DOTA: A Large-Scale Dataset for Object Detection in Aerial Images,
CVPR18(3974-3983)
IEEE DOI
Dataset, Vehicle Detection.
WWW Link. Object detection, Earth, Sports, Sensors, Marine vehicles, Image sensors


Matzen, K.[Kevin], Snavely, N.[Noah],
NYC3DCars: A Dataset of 3D Vehicles in Geographic Context,
ICCV13(761-768)
IEEE DOI
Dataset, Vehicles. 3D models; geography; object detection; structure from motion


Zhang, T.W.[Tian-Wen], Zhang, X.L.[Xiao-Ling], Li, J.W.[Jian-Wei], Xu, X.W.[Xiao-Wo], Wang, B.Y.[Bao-You], Zhan, X.[Xu], Xu, Y.Q.[Yan-Qin], Ke, X.[Xiao], Zeng, T.J.[Tian-Jiao], Su, H.[Hao], Ahmad, I.[Israr], Pan, D.[Dece], Liu, C.[Chang], Zhou, Y.[Yue], Shi, J.[Jun], Wei, S.[Shunjun],
SAR Ship Detection Dataset (SSDD): Official Release and Comprehensive Data Analysis,
RS(13), No. 18, 2021, pp. xx-yy.
DOI Link
WWW Link. Dataset, Ships.


Lei, S.L.[Song-Lin], Lu, D.D.[Dong-Dong], Qiu, X.L.[Xiao-Lan], Ding, C.[Chibiao],
SRSDD-v1.0: A High-Resolution SAR Rotation Ship Detection Dataset,
RS(13), No. 24, 2021, pp. xx-yy.
DOI Link
Dataset, Ship Detection.


Boat Detection,
Online2019
HTML Version. Dataset, Ships.
WWW Link. Public video dataset for boat detection/tracking from UAV video footage
See also MULTIDRONE.
See also Racing Bicycle Detection/Tracking from UAV Footage, UAV Detection.


Di, Y.H.[Yang-Hua], Jiang, Z.G.[Zhi-Guo], Zhang, H.[Haopeng],
A Public Dataset for Fine-Grained Ship Classification in Optical Remote Sensing Images,
RS(13), No. 4, 2021, pp. xx-yy.
DOI Link
Dataset, Ships.


Liu, Z.Y.[Zhao-Ying], Waqas, M.[Muhammad], Yang, J.[Jia], Rashid, A.[Ahmar], Han, Z.[Zhu],
A Multi-Task CNN for Maritime Target Detection,
SPLetters(28), 2021, pp. 434-438.
IEEE DOI
Dataset, Ship Detection. MaRine ShiP (MRSP-13) Dataset. Marine vehicles, Task analysis, Object detection, Image segmentation, Boats, Feature extraction, Annotations, cross-layer connections


He, B.[Boyong], Li, X.J.[Xian-Jiang], Huang, B.[Bo], Gu, E.[Enhui], Guo, W.J.[Wei-Jie], Wu, L.[Liaoni],
UnityShip: A Large-Scale Synthetic Dataset for Ship Recognition in Aerial Images,
RS(13), No. 24, 2021, pp. xx-yy.
DOI Link
Dataset, Ship Detection.


Gundogdu, E.[Erhan], Solmaz, B.[Berkan], Yücesoy, V.[Veysel], Koç, A.[Aykut],
MARVEL: A Large-Scale Image Dataset for Maritime Vessels,
ACCV16(V: 165-180).
Springer DOI
Dataset, Ships.


Melzi, P.[Pietro], Rodriguez-Albala, J.M.[Juan Manuel], Morales, A.[Aythami], Tolosana, R.[Ruben], Fierrez, J.[Julian], Vera-Rodriguez, R.[Ruben],
Fishing Gear Classification from Vessel Trajectories and Velocity Profiles: Database and Benchmark,
IbPRIA23(629-638).
Springer DOI
Dataset, Ship Tracking.
WWW Link. To detect illegal fishing.


Mostajabi, M.[Mohammadreza], Wang, C.M.[Ching Ming], Ranjan, D.[Darsh], Hsyu, G.[Gilbert],
High Resolution Radar Dataset for Semi-Supervised Learning of Dynamic Objects,
PBVS20(450-457)
IEEE DOI
Dataset, Radar. Synthetic aperture radar, Radar imaging, Spaceborne radar, Image resolution, Apertures, Azimuth


Japanese Character Image Database,
Cedar (Buffalo) database. Dataset, OCR.
WWW Link. Test data for Japanese OCR.


Wang, D.H.[Da-Han], Liu, C.L.[Cheng-Lin], Yu, J.L.[Jin-Lun], Zhou, X.D.[Xiang-Dong],
CASIA-OLHWDB1: A Database of Online Handwritten Chinese Characters,
ICDAR09(1206-1210).
IEEE DOI
Dataset, OCR.


Zhou, S.[Shusen], Chen, Q.C.[Qing-Cai], Wang, X.L.[Xiao-Long], Guo, X.[Xinyi], Li, H.[Hui],
An Empirical Evaluation on HIT-OR3C Database,
ICDAR11(1150-1154).
IEEE DOI
Dataset, OCR. Handwriting Chinese character database (HIT-OR3C)


Yan, H.Y.[Han-Yu], Jin, L.W.[Lian-Wen], Viard-Gaudin, C.[Christian], Mouchere, H.[Harold],
SCUT-COUCH Textline_NU: An Unconstrained Online Handwritten Chinese Text Lines Dataset,
FHR10(581-586).
IEEE DOI
Dataset, Chinese Characters.


Zhang, H.G.[Hong-Gang], Guo, J.[Jun], Chen, G.[Guang], Li, C.G.[Chun-Guang],
HCL2000: A Large-scale Handwritten Chinese Character Database for Handwritten Character Recognition,
ICDAR09(286-290).
IEEE DOI
Dataset, OCR.


Liu, C.L.[Cheng-Lin], Yin, F.[Fei], Wang, D.H.[Da-Han], Wang, Q.F.[Qiu-Feng],
Online and offline handwritten Chinese character recognition: Benchmarking on new databases,
PR(46), No. 1, January 2013, pp. 155-162.
Elsevier DOI

Earlier:
CASIA Online and Offline Chinese Handwriting Databases,
ICDAR11(37-41).
IEEE DOI
Dataset, OCR. Handwritten Chinese character recognition; Online; Offline; Databases; Benchmarking
See also Touching Character Database from Chinese Handwriting for Assessing Segmentation Algorithms, A.


Hull, J.J.,
A Database for Handwritten Text Recognition Research,
PAMI(16), No. 5, May 1994, pp. 550-554.
IEEE DOI Dataset, Handwriting. Handwriting Database.


Ground Truthed Handwritten Word Images,
Cambridge University dataset. Dataset, Handwriting.
HTML Version.


On-line Handwriting Database,
Tokyo Univ. of Agri. & Tech., Nakagawa Laboratory. Dataset, Handwriting.
WWW Link.


Shivram, A., Ramaiah, C., Setlur, S., Govindaraju, V.,
IBM_UB_1: A Dual Mode Unconstrained English Handwriting Dataset,
ICDAR13(13-17)
IEEE DOI
Dataset, OCR. handwriting recognition


Ben Abdelghani, I.A.[Imen Abroug], Ben Amara, N.E.[Najoua Essoukri],
SID Signature Database: A Tunisian Off-line Handwritten Signature Database,
EAHSP13(131-139).
Springer DOI
Dataset, Signatures.


Kleber, F., Fiel, S., Diem, M., Sablatnig, R.,
CVL-DataBase: An Off-Line Database for Writer Retrieval, Writer Identification and Word Spotting,
ICDAR13(560-564)
IEEE DOI
Dataset, Writer Identification. XML


The Street View House Numbers (SVHN) Dataset ,
2011 WWW Link.
Dataset, House Numbers. 600,000 digit images.


USPS Office of Advanced Technology Database of Handwritten Cities, States, ZIP Codes, Digits, and Alphabetic Characters,
Cedar (Buffalo) database. Dataset, Handwriting.
WWW Link. Database for mail processing.


Dimauro, G., Impedovo, S., Modugno, R., Pirlo, G.,
A new database for research on bank-check processing,
FHR02(524-528).
IEEE Top Reference.
Dataset, Checks.


Ma, L.L., Liu, J.[Ji], Wu, J.,
A new database for online handwritten Mongolian word recognition,
ICPR16(1131-1136)
IEEE DOI
Dataset, Mongolian Characters. Character recognition, Databases, Handwriting recognition, Layout, Sampling methods, Training, Writing, CNN, MRG-OHMW, annotation, evaluation, online, handwritten, Mongolian, word, recognition


Ali, H.[Hazrat],
UHaT: Urdu handwritten text dataset,
2020
WWW Link. Dataset, Urdu. Urdu handwritten characters and digits.


Das, N.[Nibaran], Acharya, K.[Kallol], Sarkar, R.[Ram], Basu, S.[Subhadip], Kundu, M.[Mahantapas], Nasipuri, M.[Mita],
A benchmark image database of isolated Bangla handwritten compound characters,
IJDAR(17), No. 4, December 2014, pp. 413-431.
Springer DOI
WWW Link.
Dataset, Bangla.


Sarkar, R.[Ram], Das, N.[Nibaran], Basu, S.[Subhadip], Kundu, M.[Mahantapas], Nasipuri, M.[Mita], Basu, D.K.[Dipak Kumar],
CMATERdb1: A database of unconstrained handwritten Bangla and Bangla-English mixed script document image,
IJDAR(15), No. 1, March 2012, pp. 71-83.
WWW Link.
Dataset, Bangla.


Nethravathi, B., Archana, C.P., Shashikiran, K., Ramakrishnan, A.G., Kumar, V.[Vijay],
Creation of a Huge Annotated Database for Tamil and Kannada OHR,
FHR10(415-420).
IEEE DOI
Dataset, OCR.


Sagheer, M.W.[Malik Waqas], He, C.L.[Chun Lei], Nobile, N.[Nicola], Suen, C.Y.[Ching Y.],
A New Large Urdu Database for Off-Line Handwriting Recognition,
CIAP09(538-546).
Springer DOI
Dataset, Urdu Handwriting.


ERIM Arabic Document Database,
Machine printed Arabic documents. Dataset, OCR. Dataset, Arabic.
HTML Version.


Mahmoud, S.A.[Sabri A.], Ahmad, I.[Irfan], Al-Khatib, W.G.[Wasfi G.], Alshayeb, M.[Mohammad], Parvez, M.T.[Mohammad Tanvir], Märgner, V.[Volker], Fink, G.A.[Gernot A.],
KHATT: An open Arabic offline handwritten text database,
PR(47), No. 3, 2014, pp. 1096-1112.
Elsevier DOI
Dataset, Arabic Text. Arabic handwritten text database


Mahmoud, S.A.[Sabri A.], Ahmad, I.[Irfan], Alshayeb, M.[Mohammad], Al-Khatib, W.G.[Wasfi G.], Parvez, M.T.[Mohammad Tanvir], Fink, G.A.[Gernot A.], Margner, V.[Volker], El Abed, H.[Haikal],
KHATT: Arabic Offline Handwritten Text Database,
FHR12(449-454).
IEEE DOI
Dataset, Handwritting, Arabic.


Lamghari, N.[Nidal], Raghay, S.[Said],
DBAHCL: database for Arabic handwritten characters and ligatures,
MultInfoRetr(6), No. 3, September 2017, pp. 263-269.
Springer DOI
Dataset, Arabic Characters.


Al Maadeed, S.[Somaya], Ayouby, W.[Wael], Hassaine, A.[Abdelaali], Aljaam, J.M.[Jihad Mohamad],
QUWI: An Arabic and English Handwriting Dataset for Offline Writer Identification,
FHR12(746-751).
IEEE DOI
Dataset, Arabic.


Soleimani, A., Fouladi, K., Araabi, B.N.,
UTSig: A Persian offline signature dataset,
IET-Bio(6), No. 1, 2017, pp. 1-8.
DOI Link
Dataset, Persian. handwriting recognition


Ziaratban, M.[Majid], Faez, K.[Karim], Bagheri, F.[Fatemeh],
FHT: An Unconstraint Farsi Handwritten Text Database,
ICDAR09(281-285).
IEEE DOI
Dataset, OCR.


Haghighi, P.J.[Puntis Jifroodian], Nobile, N.[Nicola], He, C.L.[Chun Lei], Suen, C.Y.[Ching Y.],
A New Large-Scale Multi-purpose Handwritten Farsi Database,
ICIAR09(278-286).
Springer DOI
Dataset, Farsi Handwriting.


NIST OCR Databases,
2005.
WWW Link. Dataset, OCR. Dataset, Documents. A series of datasets for OCR and document analysis.


Sauvola, J., Kauniskangas, H.,
Media Team Document Database II,
Online1999.
WWW Link. Dataset, Document Analysis.


Todoran, L.[Leon], Worring, M.[Marcel], Smeulders, A.W.M.[Arnold W. M.],
The UvA color document dataset,
IJDAR(7), No. 4, September 2005, pp. 228-240.
Springer DOI
Dataset, Documents.
Earlier:
Data GroundTruth, Complexity, and Evaluation Measures for Color Document Analysis,
DAS02(519 ff.).
Springer DOI


Bukhari, S.S.[Syed Saqib], Shafait, F.[Faisal], Breuel, T.M.[Thomas M.],
The IUPR Dataset of Camera-Captured Document Images,
CBDAR11(164-171).
Springer DOI
Dataset, Document Images.


Nagy, R.[Robert], Dicker, A.[Anders], Meyer-Wegener, K.[Klaus],
NEOCR: A Configurable Dataset for Natural Image Text Recognition,
CBDAR11(150-163).
Springer DOI
Dataset, Natural Image Text.


Ibrahim, A.[Ahmed], Abbott, A.L.[A. Lynn], Hussein, M.E.[Mohamed E.],
An Image Dataset of Text Patches in Everyday Scenes,
ISVC16(II: 291-300).
Springer DOI
Dataset, Scene Text.


Ikica, A.[Andrej], Peer, P.[Peter],
Computer Vision Lab OCR DataBase: CVL OCR DB,
2011. A public annotated image database of text in natural scenes
WWW Link. Dataset, Text in Images.


Guerin, C., Rigaud, C., Mercier, A., Ammar-Boudjelal, F., Bertet, K., Bouju, A., Burie, J.C., Louis, G., Ogier, J.M., Revel, A.,
eBDtheque: A Representative Database of Comics,
ICDAR13(1145-1149)
IEEE DOI
Dataset, Comics. entertainment


Schölch, L.[Lukas], Steinhäuser, J.[Jonas], Beichter, M.[Maximilian], Seibold, C.[Constantin], Yang, K.L.[Kai-Lun], Knaeble, M.[Merlin], Schwarz, T.[Thorsten], Maedche, A.[Alexander], Stiefelhagen, R.[Rainer],
Towards Automatic Parsing of Structured Visual Content through the Use of Synthetic Data,
ICPR22(1607-1613)
IEEE DOI
Dataset, Graphics.
WWW Link. Training, Measurement, Visualization, Annotations, Supervised learning, Static VAr compensators, Solids


Quiniou, S.[Solen], Mouchere, H.[Harold], Saldarriaga, S.P.[Sebastián Pen], Viard-Gaudin, C.[Christian], Morin, E.[Emmanuel], Petitrenaud, S.[Simon], Medjkoune, S.[Sofiane],
HAMEX: A Handwritten and Audio Dataset of Mathematical Expressions,
ICDAR11(452-456).
IEEE DOI
Dataset, OCR.


Stria, J.[Jan], Bresler, M.[Martin], Prua, D.[Daniel], Hlavác, V.[Vaclav],
MfrDB: Database of Annotated On-Line Mathematical Formulae,
FHR12(542-547).
IEEE DOI
Dataset, Formula.


FlickrLogos-32,
2013
WWW Link. Dataset, Logos. 32 logo classes, various orientations, surface shapes, etc.


UMD Logo Database,
Univ. Maryland database of 106 corportate logos. Dataset, Logos.
HTML Version.


CrisisMMD Dataset,
2017.
WWW Link. Dataset, Disasters. CrisisMMD Dataset. Multimodal Twitter dataset consists of several thousands of manually annotated tweets and images collected during seven major natural disasters including earthquakes, hurricanes, wildfires, and floods that happened in the year 2017.


Yang, Z.L.[Zhong-Liang], Wang, K.[Ke], Ma, S.[Sai], Huang, Y.F.[Yong-Feng], Kang, X.G.[Xian-Gui], Zhao, X.F.[Xian-Feng],
ISTEGO100K: Large-scale Image Steganalysis Dataset,
IWDW19(352-364).
Springer DOI
Dataset, Setganalysis.


Rocha, A.[Anderson], Goldenstein, S.K.[Siome K.], Scheirer, W.J.[Walter J.], Boult, T.E.[Terrance E.],
The Unseen Challenge data sets,
WVU08(1-8).
IEEE DOI
Dataset, Steganalysis.


Bai, W.M.[Wei-Ming], Zhang, Z.P.[Zhi-Peng], Li, B.[Bing], Wang, P.[Pei], Li, Y.X.[Yang-Xi], Zhang, C.X.[Cong-Xuan], Hu, W.M.[Wei-Ming],
Robust Texture-Aware Computer-Generated Image Forensic: Benchmark and Algorithm,
IP(30), 2021, pp. 8439-8453.
IEEE DOI
Dataset, Image Forensics.
WWW Link. Ddistinguish computer generated from photographic images. Benchmark testing, Feature extraction, Image forensics, Task analysis, computer-generated images forensic


Unipen Project,
Online1994. Dataset, Handwriting.
WWW Link. This is a working group organized through IAPR to maintain and protect (ensure available to researchers) various databases of handwriting data.


Njah, S.[Sourour], Ben Nouma, B.[Badreddine], Bezine, H.[Hala], Alimi, A.M.[Adel M.],
MAYASTROUN: A Multilanguage Handwriting Database,
FHR12(308-312).
IEEE DOI
Dataset, Handwriting.


Pérez, D.[Daniel], Tarazón, L.[Lionel], Serrano, N.[Nicolás], Castro, F.[Francisco], Terrades, O.R.[Oriol Ramos], Juan, A.[Alfons],
The GERMANA Database,
ICDAR09(301-305).
IEEE DOI
Dataset, OCR. Handwritten Spanish manuscript from 1891.


Shi, Z.H.[Zheng-Hao],
Sand Dust Image DAta,
March 26, 2020. Pictures in sandstorms.
DOI Link
WWW Link. Dataset, Sandstorm.


Aytekin, Ç., Nikkanen, J., Gabbouj, M.,
A Data Set for Camera-Independent Color Constancy,
IP(27), No. 2, February 2018, pp. 530-544.
IEEE DOI
Dataset, Color Constancy. Cameras, Image color analysis, Lighting, Reflectivity, Robustness, Sensitivity, Training, Color constancy, color shading, platform independence


Barnard, K.[Kobus], Martin, L.[Lindsay], Funt, B.V.[Brian V.], and Coath, A.[Adam],
A Data Set for Colour Research,
ColorRes(27), No 3, 2002, pp. 147-151.
HTML Version. Dataset, Color Constancy.
HTML Version.


Soundararajan, P.[Padmanabhan], Sarkar, S.[Sudeep],
An in-depth study of graph partitioning measures for perceptual organization,
PAMI(25), No. 6, June 2003, pp. 642-660.
IEEE Abstract.
Evaluation, Segmentation.
WWW Link. Code, Perceptual Grouping. Dataset, Perceptual Grouping.
Earlier:
Empirical evaluation of graph partitioning measures for perceptual organization,
EEMCV01(xx-yy).
Quality of groups generated by minimum (
See also Optimal Graph Theoretic Approach to Data Clustering: Theory and Its Application to Image Segmentation, An. ) or average (
See also Supervised Learning of Large Perceptual Organization: Graph Spectral Partitioning and Learning Automata. ) or normalized (
See also Normalized Cuts and Image Segmentation. ) cuts are equivalent for recognition.


Wang, T.C.[Ting-Chun], Zhu, J.Y.[Jun-Yan], Hiroaki, E.[Ebi], Chandraker, M.[Manmohan], Efros, A.A.[Alexei A.], Ramamoorthi, R.[Ravi],
A 4D Light-Field Dataset and CNN Architectures for Material Recognition,
ECCV16(III: 121-138).
Springer DOI
Dataset, Material Recognition.


Large Geometric Models Archive,
2008
WWW Link. Dataset, 3-D Models. Detailed 3-D models from Georgia Institute of Technology. Especially for graphics.
See also Georgia Tech.


Digne, J.[Julie], Audfray, N.[Nicolas], Lartigue, C.[Claire], Mehdi-Souzani, C.[Charyar], Morel, J.M.[Jean-Michel],
Farman Institute 3D Point Sets: High Precision 3D Data Sets,
IPOL(2011), No. 1, 2011, pp. xx-yy.
DOI Link
Dataset, 3D Data.


ISPRS Terrestrial laser scanning and 3D imaging Datasets,
2008.
HTML Version. Dataset, 3-D Data. 3-D datasets for large scale objects. Sanmarina Byzantine church and Golden Buddha.


NaturePix: Visual Cognitive Modeling Research,
2007.
WWW Link. Dataset, 3-D Data. ASU 3-D datasets. Replaces former ASU dataset?


The Stanford 3D Scanning Repository,
2007.
WWW Link. Dataset, 3-D Data. Stanford graphics databases


The Beazley Archive of Classical Art Pottery Database,
July 2013
WWW Link. Dataset, Pottery.


Oliva, A.[Aude], Torralba, A.B.[Antonio B.],
Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope,
IJCV(42), No. 3, May-June 2001, pp. 145-175.
DOI Link
WWW Link.
Dataset, Outdoor Secens.
Earlier:
Scene-Centered Description from Spatial Envelope Properties,
BMCV02(263 ff.).
Springer DOI
Otherwise known as OSR dataset. Spatial envelope: low dimensional representation of the secen. Perceptual dimensions to represent the dominat satial structure.


Memotion Dataset 7k,
2019.
WWW Link. Dataset, Sentinment. Memotion Dataset. Dataset for sentiment classification of memes.


Patro, B.N.[Badri N.], Lunayach, M.[Mayank], Srivastava, D.[Deepankar], Sarvesh, S.[Sarvesh], Singh, H.[Hunar], Namboodiri, V.P.[Vinay P.],
Multimodal Humor Dataset: Predicting Laughter tracks for Sitcoms,
WACV21(576-585)
IEEE DOI
WWW Link.
Dataset, Humor. Annotations, Semantics, Bit error rate, Manuals, Task analysis


Uy, M.A., Pham, Q., Hua, B., Nguyen, T., Yeung, S.,
Revisiting Point Cloud Classification: A New Benchmark Dataset and Classification Model on Real-World Data,
ICCV19(1588-1597)
IEEE DOI
Dataset, Point Cloud.
WWW Link. CAD, feature extraction, learning (artificial intelligence), neural nets, Market research


Hodan, T.[Tomáš], Haluza, P.[Pavel], Obdržálek, Š.[Štepán], Matas, J.G.[Jirí G.], Lourakis, M.[Manolis], Zabulis, X.[Xenophon],
T-LESS: An RGB-D Dataset for 6D Pose Estimation of Texture-Less Objects,
WACV17(880-888)
IEEE DOI
Dataset, RBG-D.
WWW Link. (Slow response) Image color analysis, Image sensors, Pose estimation, Sensors, Solid modeling, Training


Bellmann, A.[Anke], Hellwich, O.[Olaf], Rodehorst, V.[Volker], Yilmaz, U.[Ulas],
A Benchmarking Dataset for Performance Evaluation of Automatic Surface Reconstruction Algorithms,
BenCOS07(1-8).
IEEE DOI
Dataset, Surface Reconstruction.


Lee, S.K.[Seung-Kyu], Liu, Y.X.[Yan-Xi],
Curved Glide-Reflection Symmetry Detection,
PAMI(34), No. 2, February 2012, pp. 266-278.
IEEE DOI

Earlier: CVPR09(1046-1053).
IEEE DOI
Generalize Bilateral reflection symmetry to curved glide-reflection. Leaf images. Dataset, Symmetry Images.


Alpha Matting Evaluation Website,
2009.
WWW Link. Dataset, Image Matting.
See also perceptually motivated online benchmark for image matting, A.


Peng, B.[Bo], Zhang, M.L.[Ming-Liang], Lei, J.J.[Jian-Jun], Fu, H.Z.[Hua-Zhu], Shen, H.F.[Hai-Feng], Huang, Q.M.[Qing-Ming],
RGB-D Human Matting: A Real-World Benchmark Dataset and a Baseline Method,
CirSysVideo(33), No. 8, August 2023, pp. 4041-4053.
IEEE DOI
Dataset, Matting. Task analysis, Image color analysis, Semantics, Manuals, Benchmark testing, Feature extraction, Semantic segmentation, baseline


How2 Dataset,
2019
WWW Link. Instructional videos Used in How2 Challenge at ICML 2009 Dataset, Instructional Video.


YouCook2,
2018
WWW Link. Cooking videos Dataset, Instructional Video.


Alayrac, J.B.[Jean-Baptiste], Bojanowski, P.[Piotr], Agrawal, N.[Nishant], Sivic, J.[Josef], Laptev, I.[Ivan], Lacoste-Julien, S.[Simon],
Learning from Narrated Instruction Videos,
PAMI(40), No. 9, September 2018, pp. 2194-2208.
IEEE DOI
Dataset, Instructional Video.
WWW Link.
Earlier:
Unsupervised Learning from Narrated Instruction Videos,
CVPR16(4575-4583)
IEEE DOI
Videos, Automobiles, Visualization, Tires, YouTube, Internet, Pragmatics, Step discovery, narrated instruction videos, unsupervised learning. Text and images from video for learning the steps.


Tang, Y.S.[Yan-Song], Ding, D.J.[Da-Jun], Rao, Y.M.[Yong-Ming], Zheng, Y.[Yu], Zhang, D.Y.[Dan-Yang], Zhao, L.[Lili], Lu, J.W.[Ji-Wen], Zhou, J.[Jie],
COIN: A Large-Scale Dataset for Comprehensive Instructional Video Analysis,
CVPR19(1207-1216).
IEEE DOI
Dataset, Instructional Video.
WWW Link.


Miech, A., Zhukov, D., Alayrac, J., Tapaswi, M., Laptev, I., Alayrac, J.B.[Jean-Baptiste],
HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips,
ICCV19(2630-2640)
IEEE DOI
WWW Link. Dataset, Instructional Video. Internet, learning (artificial intelligence), natural language processing, social networking (online), Computational modeling


Zhukov, D.[Dimitri], Alayrac, J.B.[Jean-Baptiste], Cinbis, R.G.[Ramazan Gokberk], Fouhey, D.[David], Laptev, I.[Ivan], Sivic, J.[Josef],
Cross-Task Weakly Supervised Learning From Instructional Videos,
CVPR19(3532-3540).
IEEE DOI
Dataset, Instructional Video.
WWW Link.


STVD-FC: Large-Scale TV Dataset - Fact Checking',
2023
WWW Link. Dataset, Content Analysis. Public dataset on the political content analysis and fact-checking tasks. It consists of more than 1,200 fact-checked claims that have been scraped from a fact-checking service with associated metadata.


Vidal, R.G.[Rosaura G.], Banerjee, S.[Sreya], Grm, K.[Klemen], Struc, V.[Vitomir], Scheirer, W.J.[Walter J.],
UG^2: a Video Benchmark for Assessing the Impact of Image Restoration and Enhancement on Automatic Visual Recognition,

WWW Link.

Earlier: WACV18(1597-1606)
IEEE DOI
Dataset, Image Restoration. Used for restoration challenges at CVPR. image classification, image enhancement, image restoration, learning (artificial intelligence), object detection, Visualization


Liu, X.W.[Xin-Wei], Pedersen, M.[Marius], Hardeberg, J.Y.[Jon Yngve],
CID:IQ: A New Image Quality Database,
ICISP14(193-202).
Springer DOI
Dataset, Image Quality.


Sun, W.[Wen], Zhou, F.[Fei], Liao, Q.M.[Qing-Min],
MDID: A Multiply Distorted Image Database for Image Quality Assessment,
PR(61), No. 1, 2017, pp. 153-168.
Elsevier DOI
Dataset, Image Quality. Image database


Virtanen, T., Nuutinen, M., Vaahteranoksa, M., Oittinen, P., Hakkinen, J.,
CID2013: A Database for Evaluating No-Reference Image Quality Assessment Algorithms,
IP(24), No. 1, January 2015, pp. 390-402.
IEEE DOI
Dataset, Image Quality. cameras


Gao, W.[Wei], Yuan, H.[Hang], Liao, G.[Guibiao], Guo, Z.X.[Zi-Xuan], Chen, J.N.[Jia-Ning],
PP8K: A New Dataset for 8K UHD Video Compression and Processing,
MultMedMag(30), No. 3, July 2023, pp. 100-109.
IEEE DOI
WWW Link. Dataset, Video Compression.


Lin, J.Y.[Joe Yuchieh], Song, R.[Rui], Wu, C.H.[Chi-Hao], Liu, T.J.[Tsung-Jung], Wang, H.[Haiqiang], Kuo, C.C.J.[C.C. Jay],
MCL-V: A streaming video quality assessment database,
JVCIR(30), No. 1, 2015, pp. 1-9.
Elsevier DOI
Dataset, Video Streaming. Video quality


SAVAM, Visual Salience Dataset,
Saliency dataset.
WWW Link. Dataset, Saliency.
41 scenes, eyetracker, high res, left and right stereo views. Paper reference:
See also Semiautomatic visual-attention modeling and its application to video compression.


Anaya, J.[Josue], Barbu, A.[Adrian],
RENOIR-A dataset for real low-light image noise reduction,
JVCIR(51), 2018, pp. 144-154.
Elsevier DOI
Dataset, Noise Reduction. Image denoising, Denoising dataset, Low light noise, Poisson-Gaussian noise model


The Chinese University of Hong Kong,
Computer Vision Laboratory WWW Link.
Research Group, Hong Kong. PETA: Pedestrian Attribute Recognition At Far Distance,
Dataset, Pedestrians. HTML Version.
19,000 images. Large-scale Fashion (DeepFashion) Dataset,
2016. HTML Version.
Dataset, Fashion. 800,000 fashion images. In-Shop Clothes Retrieval Database.


Lotus Hill Institute,
Imageparsing WWW Link.
Research Group, China. Dataset, Segmentation. Code, Viewing. The Imageparsing site is devoted to providing ground truth datasets and Matlab code for annotation and viewing.
See also LHI Object Datasets.
See also LHI Sports Activity Dataset.
See also LHI Segmentation Dataset.
See also LHI Surveillance Dataset.


Oxford,
Robotics. WWW Link.
Visual Geometry Group WWW Link.
Research Group, UK. Active vision, visual geometry, medical imaging, manufacturing systems, sonar, robotics. Oxford Image Examples,
Dataset. HTML Version.

See also Oxford Town Center.


Swiss Federal Institute of Technology in Zurich,
ETHComputer Vision Lab: WWW Link.
Research Group, Switzerland. Interpretation of 2D and 3D image data sets from conventional and non-conventional image sources. Photogrammetry group: WWW Link.
Aerial Image Dataset,
Dataset, Aerial Images. WWW Link.


University Jaume I,
Institute of New Imaging Technologies WWW Link.
Computer Vision Group WWW Link.
Research Group, Spain. Spectral Imaging. Spectral Imaging Data Base,
Dataset, Spectral Imaging. WWW Link.


University of Toronto,
RBCV-TRAnd Toronto WWW Link.
eyeTap Personal Imaging Lab: WWW Link.
Research Group, Canada. Open Vidia code.
See also OpenVidia. Large group. CIFAR-10 and CIFAR-100 Datasets,
Dataset, Tiny Images. HTML Version.
10 classes, 10000 images per class. Or 100 classes t00 images each.


Abel Stock,
Commercial image database WWW Link.
Dataset, Images.


California Institute of Technology,
Computational Vision Group WWW Link.
Research Group, US. Computational foundations of vision. A number of datasets are available online. CalTech 101 Objects Categories,
Dataset, Objects. HTML Version.
CalTech 256 Objects Categories,
Dataset, Objects. WWW Link.
30607 images, 256 categories. CalTech 100 Natural Scenes,
Dataset, Natural Scenes. WWW Link.
CalTech 10000 Web Faces,
Dataset, Faces. WWW Link.
CalTech Turntable Images,
Dataset, 3D Data. WWW Link.
144 calibrated viewpoints, 3 lighting variations. CalTech Archived Images,
Dataset, Images. HTML Version.
CalTech-UCSD Birds 200 2011,
CUB-200-2011 Dataset, Images. HTML Version.
Dataset, Birds. Extension of the CUB-200 dataset.


Massachusetts Institute of Technology, AI Lab,
Computer Science and Artificial Intelligence Lab CSAILAI group memo MIT AI Memoor MIT AIor MIT AIMAI Memos are shorter reports. MIT AI-TRor MIT AI TRAI Tech Reports are longer (often the thesis). Also Project MAC Technical Reports MAC-TRMost are available through: AI TR and Memo series go to 2004, then the CSAIL series. WWW Link.
CS & AI Lab Vision Research: WWW Link.
Activity, learning, medical vision, and vision interfaces. Perceptual Science Group: WWW Link.
Sensing Perception Autonomy and Robot Kinetics WWW Link.
Motion Magnification WWW Link.
Research Group, US. MIT Places Database for Scene Recognition,
Dataset, Recognition.
WWW Link. 205 scene categories, 2.5Million images. SUN 397 Database,


Ohio State University,
Signal Analysis and Machine Perception Laboratory (SAMPL) WWW Link.
Research Group, US. Broad research areas, hyper and multi-spectral, aerial images, medical images, range processing, human motion, inspection. Various datasets for 2-D and 3-D data. OSU Datasets,
Dataset, Images. HTML Version.


Princeton,
PrincetonComputer Science Department. Computer vision group. WWW Link.
Research Group, US. Human action classification. Dataset. WWW Link.
SUNRGBD: A RGB-D Scene Understanding Benchmark Suite,
Dataset, RGBD. WWW Link.
Indoor Scenes.


University of Illinois,
Urbana-Champaign Various Departments, UIUCOr IllinoisVision Lab page: WWW Link.
Quantitative Light Imaging (QLI) Laboratory WWW Link.
Research Group, US. Robotics, Textures, 3-D recognition and representation, cameras, rendering, HCI. University of Illinois Datasets,
Dataset, Texture. 25 textures, 40 samples. Dataset, Natural Scenes. 15 Categories. Dataset, Stereo Data. 9 objects, 80 images Dataset, Multi-View Data. 10 datasets, 24 images of a single object each. Dataset, Visual Hull. Dataset, Object Recognition. Birds, Butterflys, etc. Dataset, Video. WWW Link.


University of Southern California, Signal and Image Processing,
USC_SIPI WWW Link.
Research Group, US. Dataset, Images. Image processing. Some of the old standard image datasets (texture, vehicles, compression).


Marszalek, M.[Marcin], Schmid, C.[Cordelia],
Accurate Object Recognition with Shape Masks,
IJCV(97), No. 2, April 2012, pp. 191-209.
WWW Link.

Earlier:
Accurate Object Localization with Shape Masks,
CVPR07(1-8).
IEEE DOI
Dataset, People.
WWW Link. The dataset includes annotations. Derived from Graz dataset.
WWW Link.


PhotoTourism, Matching Challenge Dataset,
2020. Dataset, Matching.
WWW Link. PhotoTourism dataset. Large baseline matching.


Yang, G.[Gehua], Stewart, C.V.[Charles V.], Sofka, M.[Michal], Tsai, C.L.[Chia-Ling],
Registration of Challenging Image Pairs: Initialization, Estimation, and Decision,
PAMI(29), No. 11, November 2007, pp. 1973-1989.
IEEE DOI
Dataset, Matching.
HTML Version.
Earlier:
Automatic robust image registration system: Initialization, estimation, and decision,
CVS06(23).
IEEE DOI


STVD-PVCD: Large-Scale TV Dataset,
2022.
WWW Link. Dataset, Video Copy Detection. Dataset, Copy Detection. STVD is a public dataset on the Partial Video Copy Detection (PVCD) task. It was constituted with about 83,000 videos of more than 10,000 hours duration and including more than 420,000 video copy pairs. It offers different test sets for fine performance characterization (frame degradation, global transformation, video speeding, etc.) with a frame level annotation for real-time detection and video alignment. Baseline comparisons are reported to show a room for improvement.
See also Large-scale TV Dataset for Partial Video Copy Detection, A.
See also University of Tours.


Zhang, J.C.[Jun-Cheng], Liao, Q.M.[Qing-Min], Liu, S.J.[Shao-Jun], Ma, H.Y.[Hao-Yu], Yang, W.M.[Wen-Ming], Xue, J.H.[Jing-Hao],
Real-MFF: A large realistic multi-focus image dataset with ground truth,
PRL(138), 2020, pp. 370-377.
Elsevier DOI
Dataset, Multi-Focus. Image fusion, Multi-focus images, Multi-focus dataset, Deep learning


Caye-Daudt, R.[Rodrigo], Le Saux, B.[Bertrand], Boulch, A.[Alexandre], Gousseau, Y.[Yann],
Onera Satellite Change Detection (OSCD) Database,
2018 Dataset, Change Detection.
WWW Link.
WWW Link.
See also Fully Convolutional Siamese Networks for Change Detection.


Goyette, N., Jodoin, P.M., Porikli, F.M., Konrad, J., Ishwar, P.,
A Novel Video Dataset for Change Detection Benchmarking,
IP(23), No. 11, November 2014, pp. 4663-4679.
IEEE DOI
Dataset, Change Detection. Adaptive optics


Wang, Y.[Yi], Jodoin, P.M.[Pierre-Marc], Porikli, F.M.[Fatih M.], Konrad, J.[Janusz], Benezeth, Y.[Yannick], Ishwar, P.[Prakash],
CDnet 2014: An Expanded Change Detection Benchmark Dataset,
CDW14(393-400)
IEEE DOI
Dataset, Change Detection.


Goyette, N.[Nil], Jodoin, P.M.[Pierre-Marc], Porikli, F.M.[Fatih M.], Konrad, J.[Janusz], Ishwar, P.[Prakash],
Changedetection.net: A new change detection benchmark dataset,
CDW12(1-8).
IEEE DOI
Dataset, Change Detection.


Walas, K.[Krzysztof], Leonardis, A.[Aleš],
UoB highly occluded object challenge (UoB-HOOC),
2016
WWW Link. Dataset, Object Detection.


Wang, Y.M.[Ya-Ming], Tan, X.[Xiao], Yang, Y.[Yi], Li, Z., Liu, X., Zhou, F., Davis, L.S.,
A Refined 3D Pose Dataset for Fine-Grained Object Categories,
R6D19(2797-2806)
IEEE DOI
Dataset, Object Recognition.
HTML Version. image segmentation, object recognition, pose estimation, statistical analysis, image segmentation networks, IoU, Fine grained objects


YCB-Video,
A large-scale video dataset for 6D object pose estimation. provides accurate 6D poses of 21 objects from the YCB dataset observed in 92 videos with 133,827 frames.
WWW Link. Dataset, Pose Estimation.


Drost, B.[Bertram], Ulrich, M.[Markus], Bergmann, P., Härtinger, P., Steger, C.T.[Carsten T.],
Introducing MVTec ITODD: A Dataset for 3D Object Recognition in Industry,
6DPose17(2200-2208)
IEEE DOI
Dataset, Object Recognition. Cameras, Engines, Gray-scale, Object detection, Sensor phenomena and characterization.


Hodan, T.[Tomáš], Michel, F.[Frank], Brachmann, E.[Eric], Kehl, W.[Wadim], Buch, A.G.[Anders Glent], Kraft, D.[Dirk], Drost, B.[Bertram], Vidal, J.[Joel], Ihrke, S.[Stephan], Zabulis, X.[Xenophon], Sahin, C.[Caner], Manhardt, F.[Fabian], Tombari, F.[Federico], Kim, T.K.[Tae-Kyun], Matas, J.G.[Jirí G.],
BOP: Benchmark for 6D Object Pose Estimation,
ECCV18(X: 19-35).
Springer DOI
Dataset, Object Pose.


Peters, G.[Gabriele], Zitova, B.[Barbara], von der Malsburg, C.[Christoph],
How to measure the pose robustness of object views,
IVC(20), No. 5-6, 15 April 2002, pp. 341-348.
Elsevier DOI
BMVC issue
And: IVC(20), No. 4, April 2002, pp. 249-256.
Elsevier DOI
HTML Version.
Dataset, 3-D Data.


Stegmann, M.B.[Mikkel B.],
Active Appearance Models,
Online2007.
WWW Link. Code, Active Appearance Model. Dataset, Active Appearance Model. AAM code and information.
See also Technical University of Denmark.


Luo, C.[Cai], Yu, L.J.[Lei-Jian], Yang, E.[Erfu], Zhou, H.Y.[Hui-Yu], Ren, P.[Peng],
A benchmark image dataset for industrial tools,
PRL(125), 2019, pp. 341-348.
Elsevier DOI
Dataset, Tools. Benchmark, Industrial tools, Image dataset


MIT 67 Indoor Dataset,
Dataset, Indoor Images.
HTML Version.
See also Recognizing indoor scenes.


Yang, K., Russakovsky, O., Deng, J.,
SpatialSense: An Adversarially Crowdsourced Benchmark for Spatial Relation Recognition,
ICCV19(2051-2060)
IEEE DOI
Dataset, Spatial Relations.
WWW Link. crowdsourcing, image capture, image recognition, image sampling, object recognition, SpatialSense benchmark, Genomics


Section, Multiple Entries: 13.4.6 Object Recognition, Retrieval Datasets Chapter Contents (Back)
Evaluation, Recognition. Dataset, Objects. Dataset, Retrieval.
See also Visual Question Answering, Query, VQA.
See also Object Recognition Evaluation.


The PASCAL Object Recognition Database Collection,
2006. Dataset, Objects.
HTML Version. Various datasets for object recognition. Pointers to some of the others.


Video Objects: A Test Database for Video Object Recognition,
2006. Dataset, Objects.
HTML Version. 180 videos of 15 objects.


Animals with Attributes: A dataset for Attribute Based Classification,
2006. Dataset, Objects.
WWW Link. 30,000+ images, 40 animal classes.


Image Net, ImageNet Dataset,
2014.
WWW Link. Dataset, Objects. Large set of images (or sets of datasets) for recognition. Related to ImageNet Challanges for recognition. 14Million+ images. Links to Stanford
See also Stanford University, Computer Science Departent. and Princeton.
See also Princeton.


Washington Ground Truth Image Database,
CBIR dataset. Online2004
WWW Link. Dataset. Dataset, Retrieval.


LHI Object Datasets,
Includes hand segmentations, and annotations. Online2004
HTML Version. Dataset. Dataset, Object Recognition. Transportation images, Animals, Aerial Images, Objects, Dataset also includes other data.
See also Lotus Hill Institute.


NEC Animal Dataset,
Online2009
WWW Link. Dataset. Dataset, Object Recognition. It consists of about 5000 high quality images from 60 toy animals taken at different poses against a plain background.


Xcavator.Net,
Online2007
WWW Link. Dataset, Object Recognition. Photo search for professional use. Searches stock databases, you then purchase the image for use. Part of CogniSign LLC.


The ETH-80 Dataset,
2017 Dataset, Objects.
WWW Link. The ETH-80 dataset contains visual object images from 8 different categories including apples, cars, cows,cups, dogs, horses, pears and tomatoes.
See also Covariance descriptors on a Gaussian manifold and their application to image set classification.
See also Swiss Federal Institute of Technology in Zurich.


15 Scene Dataset,
Dataset, Objects.
HTML Version. The 15 scene categories are office, kitchen, living room, bedroom, store, industrial, tall building, inside cite, street, highway, coast, open country, mountain, forest, and suburb. Images in the dataset are about 250*300 resolution, with 210 to 410 images per class.


Video Dataset Overview,
2021
WWW Link. Dataset, Overview. A good collection of Video datasets for various uses, activity, instruction, sports, etc..


Multi-Weather 4Seasons Dataset,
2021 Dataset, Driving.
WWW Link.


Vasiljevic, I., Kolkin, N., Zhang, S., Luo, R., Wang, H.,
DIODE: A Dense Indoor and Outdoor Depth Dataset,
2019 Dataset, Object Extraction.
WWW Link.


Blanco, J.L.[Jose-Luis], Moreno, F.A.[Francisco-Angel], Gonzalez, J.[Javier],
A collection of outdoor robotic datasets with centimeter-accuracy ground truth,
AutRob(27), No. 4, 2009, pp. 327.
Springer DOI
WWW Link. Dataset, SLAM. Malaga Parking


Geusebroek, J.M.[Jan-Mark], Burghouts, G.J.[Gertjan J.], Smeulders, A.W.M.[Arnold W.M.],
The Amsterdam Library of Object Images,
IJCV(61), No. 1, January 2005, pp. 103-112.
DOI Link

WWW Link. Dataset, Objects. 1000 objects over 100 images per object.


Torralba, A.B.[Antonio B.], Fergus, R.[Rob], Freeman, W.T.[William T.],
80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition,
PAMI(30), No. 11, November 2008, pp. 1958-1970.
IEEE DOI
WWW Link.

And: CSAIL-TR-2007-024, 2007. Dataset, Retrieval. Images from the WWW, associated with a noun. Large comprehensive dataset. Dataset with segmentations.


Russell, B.[Bryan], Torralba, A.B.[Antonio B.], Freeman, W.T.[William T.],
LableMe: The Open Annotation Tool,
Online2010.
WWW Link.
Dataset, Retrieval. Code, Annotation. The site for the annotation tool, also the video version.


Zhou, B.[Bolei], Lapedriza, A.[Agata], Khosla, A.[Aditya], Oliva, A.[Aude], Torralba, A.B.[Antonio B.],
Places: A 10 Million Image Database for Scene Recognition,
PAMI(40), No. 6, June 2018, pp. 1452-1464.
IEEE DOI
Dataset, Retrieval. Context, Databases, Image recognition, Semantics, Sun, Training, Visualization, Scene classification, deep feature, deep learning, visual recognition


Escalante, H.J.[Hugo Jair], Hernandez, C.A.[Carlos A.], Gonzalez, J.A.[Jesus A.], Lopez-Lopez, A., Montes-y-Gomez, M.[Manuel], Morales, E.F.[Eduardo F.], Sucar, L.E.[L. Enrique], Villasenor, L.[Luis], Grubinger, M.[Michael],
The segmented and annotated IAPR TC-12 benchmark,
CVIU(114), No. 4, April 2010, pp. 419-428.
Elsevier DOI
Dataset, Retrieval. Data set creation; Ground truth collection; Evaluation metrics; Automatic image annotation; Image retrieval


Russakovsky, O.[Olga], Deng, J.[Jia], Su, H.[Hao], Krause, J.[Jonathan], Satheesh, S.[Sanjeev], Ma, S.[Sean], Huang, Z.H.[Zhi-Heng], Karpathy, A.[Andrej], Khosla, A.[Aditya], Bernstein, M.[Michael], Berg, A.C.[Alexander C.], Fei-Fei, L.[Li],
ImageNet Large Scale Visual Recognition Challenge,
IJCV(115), No. 3, December 2015, pp. 211-252.
Springer DOI
Dataset, Object Category. Object category classification and detection on hundreds of object categories and millions of images.


Loh, Y.P.[Yuen Peng], Chan, C.S.[Chee Seng],
Getting to know low-light images with the Exclusively Dark dataset,
CVIU(178), 2019, pp. 30-42.
Elsevier DOI
Dataset, Low Light.


Aizawa, K., Fujimoto, A., Otsubo, A., Ogawa, T., Matsui, Y., Tsubota, K., Ikuta, H.,
Building a Manga Dataset 'Manga109' With Annotations for Multimedia Applications,
MultMedMag(27), No. 2, April 2020, pp. 8-18.
IEEE DOI
Dataset, Manga. Machine learning, Visualization, Character recognition, Art, Machine learning algorithms, Task analysis


Kuznetsova, A.[Alina], Rom, H.[Hassan], Alldrin, N.[Neil], Uijlings, J.[Jasper], Krasin, I.[Ivan], Pont-Tuset, J.[Jordi], Kamali, S.[Shahab], Popov, S.[Stefan], Malloci, M.[Matteo], Kolesnikov, A.[Alexander], Duerig, T.[Tom], Ferrari, V.[Vittorio],
The Open Images Dataset V4,
IJCV(128), No. 7, July 2020, pp. 1956-1981.
Springer DOI
Dataset, Object Detection. 9.2M images with unified annotations.
HTML Version.


SynthCity: A Large-Scale Synthetic Point Cloud,
2019.
WWW Link. Dataset, Point Clouds. Synthetic point clouds and RGB data from a detailed city model.


WHU Datasets,
2020.
WWW Link. Dataset, Buildings. Several datasets.
See also Whuan University.


VisDrone Datasets,
2019.
WWW Link. Dataset, Drone Images. Several datasets related to annual challenges..


Song, D.[Dan], Nie, W.Z.[Wei-Zhi], Li, W.H.[Wen-Hui], Kankanhalli, M.[Mohan], Liu, A.A.[An-An],
Monocular Image-Based 3-D Model Retrieval: A Benchmark,
Cyber(53), To be published.
IEEE DOI Dataset, MI3DOR.
WWW Link. Dataset, 3D Objects. Monocular image based 3D object retrieval


Tan, X.[Xin], Xu, K.[Ke], Cao, Y.[Ying], Zhang, Y.H.[Yi-Heng], Ma, L.Z.[Li-Zhuang], Lau, R.W.H.[Rynson W. H.],
Night-Time Scene Parsing With a Large Real Dataset,
IP(30), 2021, pp. 9085-9098.
IEEE DOI
Dataset, NightCity. Streaming media, Urban areas, Image segmentation, Annotations, Semantics, Computer science, Automobiles, Autonomous driving, adverse conditions


Deschaud, J.E.[Jean-Emmanuel], Duque, D.[David], Richa, J.P.[Jean Pierre], Velasco-Forero, S.[Santiago], Marcotegui, B.[Beatriz], Goulette, F.[François],
Paris-CARLA-3D: A Real and Synthetic Outdoor Point Cloud Dataset for Challenging Tasks in 3D Mapping,
RS(13), No. 22, 2021, pp. xx-yy.
DOI Link
Dataset, Point Cloud.


Pham, K.[Khoi], Kafle, K.[Kushal], Lin, Z.[Zhe], Ding, Z.H.[Zhi-Hong], Cohen, S.[Scott], Tran, Q.[Quan], Shrivastava, A.[Abhinav],
Learning to Predict Visual Attributes in the Wild,
CVPR21(13013-13023)
IEEE DOI
WWW Link. Dataset, VAW. Geometry, Visualization, Shape, Image color analysis, Annotations, Prediction algorithms


Zhou, Q.[Qiang], Wang, S.Y.[Shi-Yin], Wang, Y.T.[Yi-Tong], Huang, Z.L.[Zi-Long], Wang, X.G.[Xing-Gang],
Human De-occlusion: Invisible Perception and Recovery for Humans,
CVPR21(3690-3700)
IEEE DOI
WWW Link. Dataset, Human Occlusion. Annotations, Aggregates, Refining, Predictive models, Pattern recognition, Task analysis


Changpinyo, S.[Soravit], Sharma, P.[Piyush], Ding, N.[Nan], Soricut, R.[Radu],
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts,
CVPR21(3557-3567)
IEEE DOI
Dataset, Image Captioning. Conceptual 12M (CC12M), a dataset with 12 million image-text pairs. Visualization, Image recognition, Pipelines, Benchmark testing, Data collection, Knowledge discovery


Anderson, C.[Connor], Teuscher, A.[Adam], Anderson, E.[Elizabeth], Larsen, A.[Alysia], Shirley, J.[Josh], Farrell, R.[Ryan],
Have Fun Storming the Castle(s)!,
WACV21(3702-3711)
IEEE DOI
WWW Link.
Dataset, Castles. 2400 individual castles, palaces and fortresses from more than 90 countries, contains more than 770K images. Visualization, Image recognition, Geology, Computational modeling, Image retrieval


Figueiredo, A.[Augusto], Brayan, J.[Johnata], Reis, R.O.[Renan Oliveira], Prates, R.[Raphael], Schwartz, W.R.[William Robson],
MoRe: A Large-Scale Motorcycle Re-Identification Dataset,
WACV21(4033-4042)
IEEE DOI
WWW Link.
Dataset, Vehicles. Training, Deep learning, Computational modeling, Surveillance, Motorcycles, Traffic control


Le, H.A.[Hoang-An], Mensink, T.[Thomas], Das, P.[Partha], Karaoglu, S.[Sezer], Gevers, T.[Theo],
EDEN: Multimodal Synthetic Dataset of Enclosed GarDEN Scenes,
WACV21(1578-1588)
IEEE DOI
WWW Link.
Dataset, Outdoor Scenes. Deep learning, Image segmentation, Image color analysis, Computational modeling, Semantics


Scheck, T.[Tobias], Seidel, R.[Roman], Hirtz, G.[Gangolf],
Learning from THEODORE: A Synthetic Omnidirectional Top-View Indoor Dataset for Deep Transfer Learning,
WACV20(932-941)
IEEE DOI
Dataset, Fisheye Images. Cameras, Image segmentation, Object detection, Semantics, Solid modeling, Rendering (computer graphics)


Behley, J., Garbade, M., Milioto, A., Quenzel, J., Behnke, S., Stachniss, C., Gall, J.,
SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences,
ICCV19(9296-9306)
IEEE DOI
Dataset, LiDAR. distance measurement, image segmentation, optical radar, stereo image processing, LiDAR sequences, Lasers


Wang, X., Wu, J., Chen, J., Li, L., Wang, Y., Wang, W.Y.,
VaTeX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research,
ICCV19(4580-4590)
IEEE DOI
WWW Link.
Dataset, . language translation, linguistics, natural language processing, video signal processing, unified multilingual model, Social network services


Gu, S., Lugmayr, A., Danelljan, M., Fritsche, M., Lamour, J., Timofte, R.,
DIV8K: DIVerse 8K Resolution Image Dataset,
AIM19(3512-3516)
IEEE DOI
Dataset, High Resolution. convolutional neural nets, image resolution, learning (artificial intelligence), CNN, image processing


Mauceri, C.[Cecilia], Palmer, M.[Martha], Heckman, C.[Christoffer],
SUN-Spot: An RGB-D Dataset With Spatial Referring Expressions,
CLVL19(1883-1886)
IEEE DOI
Dataset, Recognition. image colour analysis, object detection, SLAM (robots), spatial referring expressions, SUN-Spot, objects localization, multimodal


Sølund, T.[Thomas], Buch, A.G.[Anders Glent], Krüger, N.[Norbert], Aanæs, H.[Henrik],
A Large-Scale 3D Object Recognition Dataset,
3DV16(73-82)
IEEE DOI
Dataset, Object Recognition.
WWW Link. object recognition


Hua, B.S.[Binh-Son], Pham, Q.H.[Quang-Hieu], Nguyen, D.T.[Duc Thanh], Tran, M.K.[Minh-Khoi], Yu, L.F.[Lap-Fai], Yeung, S.K.[Sai-Kit],
SceneNN: A Scene Meshes Dataset with aNNotations,
3DV16(92-101)
IEEE DOI
Dataset, RGB-D.
WWW Link. Cameras


Rotman, D.[Daniel], Gilboa, G.[Guy],
A Depth Restoration Occlusionless Temporal Dataset,
3DV16(176-184)
IEEE DOI
Dataset, RGB-D.


Zhang, J.J.[Jun-Jie], Zhang, J.[Jian], Lu, J.F.[Jian-Feng], Shen, C.H.[Chun-Hua], Curr, K.[Kate], Phua, R.[Robin], Neville, R.[Richard], Edmonds, E.[Elise],
SLNSW-UTS: A Historical Image Dataset for Image Multi-Labeling and Retrieval,
DICTA16(1-6)
IEEE DOI
Dataset, Object Recognition. 29713 images, 119 labels.


Xiang, Y.[Yu], Kim, W.[Wonhui], Chen, W.[Wei], Ji, J.W.[Jing-Wei], Choy, C.[Christopher], Su, H.[Hao], Mottaghi, R.[Roozbeh], Guibas, L.J.[Leonidas J.], Savarese, S.[Silvio],
ObjectNet3D: A Large Scale Database for 3D Object Recognition,
ECCV16(VIII: 160-176).
Springer DOI
Dataset, Object Recognition.
WWW Link.


Lin, T.Y.[Tsung-Yi], Maire, M.[Michael], Belongie, S.J.[Serge J.], Hays, J.[James], Perona, P.[Pietro], Ramanan, D.[Deva], Dollár, P.[Piotr], Zitnick, C.L.[C. Lawrence],
Microsoft COCO: Common Objects in Context,
ECCV14(V: 740-755).
Springer DOI
Dataset, Objects.
WWW Link.


Flickr30k Dataset,
From image descriptions to visual denotations. WWW Link.
Dataset, Visual Question Answering. Extension of Flickr 8k dataset.


Plummer, B.A.[Bryan A.], Wang, L.W.[Li-Wei], Cervantes, C.M.[Chris M.], Caicedo, J.C.[Juan C.], Hockenmaier, J.[Julia], Lazebnik, S.[Svetlana],
Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models,
IJCV(123), No. 1, May 2017, pp. 74-93.
Springer DOI

Earlier: ICCV15(2641-2649)
IEEE DOI
Dataset, Object Recognition. Benchmark testing


Fanello, S.R.[Sean Ryan], Ciliberto, C.[Carlo], Santoro, M.[Matteo], Natale, L.[Lorenzo], Metta, G.[Giorgio], Rosasco, L.[Lorenzo], Odone, F.[Francesca],
iCub World: Friendly Robots Help Building Good Vision Data-Sets,
GT13(700-705)
IEEE DOI
Dataset, Object Recognition. Human Robot Interaction; Object Categorization Dataset; iCub


Ponomarenko, N.[Nikolay], Ieremeiev, O.[Oleg], Lukin, V.[Vladimir], Jin, L.[Lina], Egiazarian, K.O.[Karen O.],
A New Color Image Database TID2013: Innovations and Results,
ACIVS13(402-413).
Springer DOI
Dataset, Color Images.


Ponce, J., Berg, T.L., Everingham, M.R., Forsyth, D.A., Hebert, M., Lazebnik, S.[Svetlana], Marszalek, M., Schmid, C., Russell, B.C., Torralba, A., Williams, C.K.I., Zhang, J., Zisserman, A.,
Dataset Issues in Object Recognition,
CLOR06(29-48).
Springer DOI
Dataset, Discussion.


Campbell, R., and Flynn, P.J.,
A WWW-Accessible 3D Image and Model Database for Computer Vision Research,
EEMCV98(148-154).
And: EEMTV98(xx) Dataset, 3-D Data.
HTML Version.


Nene, S.A., Nayar, S.K.[Shree K.], Murase, H.[Hiroshi],
Columbia Object Image Library (COIL-100),
ColumbiaTechnical Report CUCS-006-96, February 1996.
PS File. Also:
WWW Link. Also the COIL-20 database.
WWW Link. Dataset, Objects.


Section, Multiple Entries: 13.6.3.3 Dataset Distillation, Dataset Summary, Dataset Quantization Chapter Contents (Back)
Dataset Distillation. Dataset Summarization.


Zhai, W.[Wei], Luo, H.C.[Hong-Chen], Zhang, J.[Jing], Cao, Y.[Yang], Tao, D.C.[Da-Cheng],
One-Shot Object Affordance Detection in the Wild,
IJCV(130), No. 10, October 2022, pp. 2472-2500.
Springer DOI
Dataset, Affordance.
WWW Link. Affordance: potential action possibilities of objects in the scene.


Bileschi, S.M.[Stanley M.],
CBCL StreetScenes Challenge Framework,
Online2007.
WWW Link. Dataset, Object Detection. Primarily for Cars, people, and street scenes. Data is labeled.


Hoiem, D.[Derek], Efros, A.A.[Alexei A.], Hebert, M.[Martial],
Recovering Surface Layout from an Image,
IJCV(75), No. 1, October 2007, pp. 151-172.
Springer DOI

Earlier:
Geometric Context from a Single Image,
ICCV05(I: 654-661).
IEEE DOI
Dataset, Recognition. The example data is available:
HTML Version. Kanade issue. Coarse properties (ground plane, sky, planar regions) from one image. Probabilistic approach to estimate 3D geometry so that not every possible view is needed.


Fu, H.[Huan], Jia, R.F.[Rong-Fei], Gao, L.[Lin], Gong, M.M.[Ming-Ming], Zhao, B.Q.[Bin-Qiang], Maybank, S.J.[Steve J.], Tao, D.C.[Da-Cheng],
3D-FUTURE: 3D Furniture Shape with TextURE,
IJCV(129), No. 12, December 2021, pp. 3313-3337.
Springer DOI
Dataset Furniture.
WWW Link.


Medical Dataset Archive,
2006. Dataset, Medical Images.
WWW Link. Variety of medical data. CT dataset available from related web site.


Visible Human Project,
1994. Dataset, Medical Images.
HTML Version. Complete data in MRI, CT, slices.


MOTA Object Tracking Benchmark,
2021 for workshop.
WWW Link. Dataset, Cell Tracking.


CR Chisto Labeled Nuclei Dataset,
Online2016
WWW Link. Dataset, Nuclei.
Dataset of colorectal cancer histology images consisting of nearly 30,000 dotted nuclei with over 22,000 labeled with the type of cell they belong to.


FIRE Fundus Image Registration Dataset,
2016
WWW Link. Dataset, Retinal. Dataset, Registration.
134 retinal image pairs and ground truth for registration.


Kauppi, T., Kalesnykiene, V., Kamarainen, J.K., Lensu, L., Sorri, I., Raninen, A., Voutilainen, R., Uusitalo, H., Kalviainen, H., Pietila, J.,
The DIARETDB1 diabetic retinopathy database and evaluation protocol,
BMVC07(xx-yy).
PDF File.
Dataset, Retina.


MiniMammographic Database,
1995
WWW Link. Dataset, Mammography.


DDSM: Digital Database for Screening Mammography,
2000, USF.
HTML Version. Dataset, Mammography.


Developing Human Connectome Project (dHCP),
2017
WWW Link. Dataset, fMRI. The imaging data includes structural imaging, structural connectivity data (diffusion MRI) and functional connectivity data (resting-state fMRI).


Andreopoulos, A.[Alexander], Tsotsos, J.K.[John K.],
Cardiac MRI dataset,
Online2008.
WWW Link. Dataset, Cardiac MRI.


CoronARe: A Coronary Artery Reconstruction Challenge,
2017. Dataset, Angiography.
WWW Link. 3D Reconstrucion challange dataset.


Zimmermann, K.[Karel], Matas, J.G.[Jirí G.], Svoboda, T.[Thomáš],
Tracking by an Optimal Sequence of Linear Predictors,
PAMI(31), No. 4, April 2009, pp. 677-692.
IEEE DOI
Code, Tracking. Dataset, Tracking.
Earlier: A1, A3, A2:
Simultaneous learning of motion and appearance,
MLMotion08(xx-yy).

Earlier: A1, A3, A2:
Adaptive Parameter Optimization for Real-time Tracking,
NRTL07(1-8).
IEEE DOI

Earlier: A1, A3, A2:
Multiview 3D Tracking with an Incrementally Constructed 3D Model,
3DPVT06(488-495).
IEEE DOI
Learning approach to tracking. Estimation of the pose given the pose of the previous frame. Matlab implementation available.
WWW Link.


Huang, Y.[Yan], Essa, I.A.[Irfan A.],
Tracking Multiple Objects through Occlusions,
CVPR05(II: 1051-1058).
IEEE DOI
WWW Link. Dataset, Actions.

And: CVPR05(II: 1182).
IEEE DOI
See also Georgia Tech.


Hopkins 155,
Motion Dataset Online2007.
WWW Link. Dataset, Motion. Testing feature based motion segmentation algorithms.
See also Johns Hopkins University.


Tracking Any Object, TAO, Dataset,
Motion Dataset Online
WWW Link. Dataset, Tracking. 2,907 high resolution videos, captured in diverse environments.


OTCBVS Benchmark Dataset Collection,
2001
WWW Link. Dataset, Tracking. Dataset, Face Recognition. Collection of datasets for benchmarking realted to the related conferences. Includes face dataset.


UCF Parking Lot Tracking,
2012
WWW Link. Dataset, Tracking. Tracking multiple people in parking lot.
See also Part-based multiple-person tracking with partial occlusion handling.
See also GMCP-Tracker: Global Multi-object Tracking Using Generalized Minimum Clique Graphs.


Dubuisson, S.[Séverine], Gonzales, C.[Christophe],
A survey of datasets for visual tracking,
MVA(27), No. 1, January 2016, pp. 23-52.
WWW Link.
Survey, Tracking. Dataset, Tracking.


Huang, L.H.[Liang-Hua], Zhao, X.[Xin], Huang, K.Q.[Kai-Qi],
GOT-10k: A Large High-Diversity Benchmark for Generic Object Tracking in the Wild,
PAMI(43), No. 5, May 2021, pp. 1562-1577.
IEEE DOI
WWW Link. Dataset, Tracking. Training, Object tracking, Databases, Protocols, Benchmark testing, Servers, Object tracking, benchmark dataset, performance evaluation


Bondi, E., Jain, R., Aggrawal, P., Anand, S., Hannaford, R., Kapoor, A., Piavis, J., Shah, S., Joppa, L., Dilkina, B., Tambe, M.,
BIRDSAI: A Dataset for Detection and Tracking in Aerial Thermal Infrared Videos,
WACV20(1736-1745)
IEEE DOI
Dataset, Tracking.
WWW Link. Videos, Cameras, Surveillance, Animals, Task analysis, Benchmark testing


Dave, A.[Achal], Khurana, T.[Tarasha], Tokmakov, P.[Pavel], Schmid, C.[Cordelia], Ramanan, D.[Deva],
TAO: A Large-scale Benchmark for Tracking Any Object,
ECCV20(V:436-454).
Springer DOI
Dataset, Tracking.


Lukezic, A., Kart, U., Käpylä, J., Durmush, A., Kamarainen, J., Matas, J.G., Kristan, M.,
CDTB: A Color and Depth Visual Object Tracking Dataset and Benchmark,
ICCV19(10012-10021)
IEEE DOI
Dataset, Tracking. image colour analysis, image sequences, object detection, object tracking, pose estimation, most diverse dataset, Robot sensing systems


Müller, M.[Matthias], Bibi, A.[Adel], Giancola, S.[Silvio], Alsubaihi, S.[Salman], Ghanem, B.[Bernard],
TrackingNet: A Large-Scale Dataset and Benchmark for Object Tracking in the Wild,
ECCV18(I: 310-327).
Springer DOI
Dataset, Tracking.


Valmadre, J.[Jack], Bertinetto, L.[Luca], Henriques, J.F.[João F.], Tao, R.[Ran], Vedaldi, A.[Andrea], Smeulders, A.W.M.[Arnold W. M.], Torr, P.H.S.[Philip H. S.], Gavves, E.[Efstratios],
Long-Term Tracking in the Wild: A Benchmark,
ECCV18(III: 692-707).
Springer DOI
Dataset, Tracking.


Zhang, S.[Shu], Staudt, E.[Elliot], Faltemier, T.[Tim], Roy-Chowdhury, A.K.[Amit K.],
A Camera Network Tracking (CamNeT) Dataset and Performance Baseline,
WACV15(365-372)
IEEE DOI
Dataset, Camera Tracking.
WWW Link. Cameras; Legged locomotion; Lighting; Target tracking; Trajectory; Videos


Jaynes, C., Kale, A., Sanders, N., Grossmann, E.,
The Terrascope Dataset: Scripted Multi-Camera Indoor Video Surveillance with Ground-truth,
PETS05(309-316).
IEEE DOI
WWW Link.
Dataset, Surveillance.


Visual Object Tracking Challenges, VOT,
Tracking Challenges and datasets. Online
HTML Version. Dataset, Tracking. Various VOT workshop datasets.
See also Visual Object Tracking Challenge.


Li, A., Lin, M., Wu, Y., Yang, M., Yan, S.,
NUS-PRO: A New Visual Tracking Challenge,
PAMI(38), No. 2, February 2016, pp. 335-349.
IEEE DOI
Dataset, Tracking. Airplanes


Dendorfer, P.[Patrick], Osep, A.[Aljosa], Milan, A.[Anton], Schindler, K.[Konrad], Cremers, D.[Daniel], Reid, I.D.[Ian D.], Roth, S.[Stefan], Leal-Taixé, L.[Laura],
MOTChallenge: A Benchmark for Single-Camera Multiple Target Tracking,
IJCV(129), No. 4, April 2021, pp. 845-881.
Springer DOI
WWW Link. Dataset, Motion Tracking. There are a series of related datasets for annual challenges.


PETS Benchmark Datasets,
Online2006 Dataset:
HTML Version. Dataset, Surveillance. 2014 Dataset:
HTML Version. 2015 Dataset:
HTML Version. 2016 Dataset:
HTML Version.


The KITTI Vision Benchmark Suite,
Online2013
WWW Link. Dataset, Road Scenes. Award, Everingham Prize. Stereo, Lidar, GPS, etc.
See also Vision meets robotics: The KITTI dataset.


Per, J.[Janez], Kenk, V.S.[Vildana Sulic], Mandeljc, R.[Rok], Kristan, M.[Matej], Kovacic, S.[Stanislav],
Dana36: A Multi-camera Image Dataset for Object Identification in Surveillance Scenarios,
AVSS12(64-69).
IEEE DOI
Dataset,Surveillance.


LHI Surveillance Dataset,
Annotated surveillance images. Online2008
HTML Version. Dataset, Segmentation. Subset of larger dataset.
See also Lotus Hill Institute.


CLEAR: Classification of Events, Activities and Relationships,
MTPH07
WWW Link. Dataset, Activity Recogniton.


i-LIDS: Bag and vehicle detection challenge,
Online2007 AVSBS07
HTML Version. Dataset, Activity Recogniton. Data used at Advanced Video and Signal Based Surveillance, 2007.


Multimedia Event Detection,
Series of Event and Activity Detection evaluations.
WWW Link.
WWW Link.
WWW Link. Dataset, Activity Recogniton. MED13, MED12, MED11.


Multiview Extended Video with Activities,
MEVA Test 3:
WWW Link. Information also:
WWW Link. Dataset, Activity Recogniton. Dataset, MEVA. 333 hours of ground-camera and UAV videos and 28 hours of MEVA training Annotations dataset.
See also MEVA: A Large-Scale Multiview, Multimodal Video Dataset for Activity Detection.


PETS 2006 Benchmark Data,
Online2006 PETS06
HTML Version. Dataset, Activity Recogniton. Data used at International Workshop on Performance Evaluation of Tracking and Surveillance 2006.


PETS 2001 Benchmark Data,
Online2001 PETS01
WWW Link. Dataset, Activity Recogniton. Data used at International Workshop on Performance Evaluation of Tracking and Surveillance 2001.


OTCBVS Benchmark Dataset Collection,
OTCBVS072007
WWW Link. Dataset, Activity Recogniton. Beyound the Visual Spectrum (IR especially). Data for various OTCBVS workshops.


YouTube-8M Dataset,
Labed video dataset.
WWW Link.
WWW Link. Dataset, Video Database. 4700+ visual entities. Introduced in:
See also YouTube-8M: A Large-Scale Video Classification Benchmark.


Fisher, R.B.[Robert B.],
CAVIAR Test Case Scenarios,
Online BookOctober 2004.
WWW Link. Dataset, Video. From the EC funded CAVIAR project (Context Aware Vision using Image-based Active Recognition). The sequences are labelled (in XML) with both the tracked persons and a semantic description of their activities. 81 video sequences comprising about 90K frames. These sequences include indoor plaza and shopping center observations of individuals and small groups of people walking, browsing, window shopping, fighting, meeting, leaving packages behind, collapsing, entering and exiting shops, etc.


Optic Flow Data,
Edinburgh2007. Smoothed flow sequences for the Waverly train station scene.
WWW Link. Dataset, Video. Behavior, pedestrian analysis.


BEHAVE Interactions Test Case Scenarios,
Edinburgh2007. Two views of various scenarios of people acting out various interactions.
WWW Link. Dataset, Video. Behavior, pedestrian analysis. Includes ground truth bounding boxes for much of the data.


Sigal, L.[Leonid], Balan, A.O.[Alexandru O.], Black, M.J.[Michael J.],
HumanEva: Synchronized Video and Motion Capture Dataset and Baseline Algorithm for Evaluation of Articulated Human Motion,
IJCV(87), No. 1-2, March 2010, pp. xx-yy.
Springer DOI
Dataset, Human Motion.
Earlier: A1, A3, Only:
HumanEva: Synchronized Video and Motion Capture Dataset for Evaluation of Articulated Human Motion,
BrownTechnical Report CS-06-08, September 2006.
HTML Version. For the dataset:
HTML Version. Calibrated video sequences synchronized with motion capture data.


Bolten, T.[Tobias], Pohle-Fröhlich, R.[Regina], Tönnies, K.D.[Klaus D.],
DVS-OUTLAB: A Neuromorphic Event-Based Long Time Monitoring Dataset for Real-World Outdoor Scenarios,
EventVision21(1348-1357)
IEEE DOI
Dataset, Surveilance. Privacy, Rain, Power demand, Neuromorphics, Noise reduction, Pipelines, Vision sensors


Li, L.Z.[Long-Zhen], Nawaz, T.[Tahir], Ferryman, J.M.,
PETS 2015: Datasets and challenge,
AVSS15(1-6)
IEEE DOI
Dataset, PETS 2015. object detection


Oh, S.M.[Sang-Min], Hoogs, A.J.[Anthony J.], Perera, A.[Amitha], Cuntoor, N.[Naresh], Chen, C.C.[Chia-Chih], Lee, J.T.[Jong Taek], Mukherjee, S.[Saurajit], Aggarwal, J.K., Lee, H.T.[Hyung-Tae], Davis, L.S.[Larry S.], Swears, E.[Eran], Wang, X.Y.[Xiao-Yang], Ji, Q.A.[Qi-Ang], Reddy, K.K.[Kishore K.], Shah, M.[Mubarak], Vondrick, C.[Carl], Pirsiavash, H.[Hamed], Ramanan, D.[Deva], Yuen, J.[Jenny], Torralba, A.B.[Antonio B.], Song, B.[Bi], Fong, A.[Anesco], Roy-Chowdhury, A.K.[Amit K.], Desai, M.[Mita],
A large-scale benchmark dataset for event recognition in surveillance video,
CVPR11(3153-3160).
IEEE DOI

And: AVSBS11(527-528).
IEEE DOI
Dataset, Action Recognition. Dataset, Event Recognition.


Harvey, A.[Adam], LaPlace, J.[Jules],
Exposing.ai,
Online2021.
WWW Link.
Dataset, Duke MTMC Dataset. Privacy issues in re-identification research and the use of large datasets.


MIT Car Database MITC,
Online2000
HTML Version. Dataset, Vehicles.


PKU-VD Dataset,
2017 HTML Version.
Dataset, Vehicles. VD1: 1,097,649 images. 1,232 vehicle models and 11 colors. VD2: 807,260 images. 1,112 vehicle models and 11 colors. Reference:
See also Exploiting Multi-grain Ranking Constraints for Precisely Searching Visually-similar Vehicles.


PKU VehicleID Dataset,
2016 HTML Version.
Dataset, Vehicles. 10319 vehicles, 90196 images. Reference:
See also Deep Relative Distance Learning: Tell the Difference between Similar Vehicles.


Struwe, M.[Marvin], Hasler, S.[Stephan], Bauer-Wersing, U.[Ute],
Rendered Benchmark Data Set for Evaluation of Occlusion-Handling Strategies of a Parts-Based Car Detector,
PSIVT15(99-110).
Springer DOI
Dataset, Vehicle Detection.


Racing Bicycle Detection/Tracking from UAV Footage, UAV Detection,
Motion Datasets Online2019.
HTML Version. Dataset, Vehicle Tracking. Dataset, Drone Detection. Multiple datasets. UAV detection against variety of backgrounds.
See also MULTIDRONE.


Stanford Cars Dataset,
2019. A dataset for understanding human actions in still images WWW Link.
HTML Version.
Dataset, Vehicles. 196 classes of cars, 16,185 images.
See also Leveraging the Wisdom of the Crowd for Fine-Grained Recognition.
See also Stanford University, Computer Science Departent.


Behrendt, K.,
Boxy Vehicle Detection in Large Images,
CVRSUAD19(840-846)
IEEE DOI
Dataset, Vehicles.
WWW Link. cameras, image resolution, image segmentation, object detection, road vehicles, traffic engineering computing, individual teams, dataset


UA-DETRAC Benchmark Suite,
2016.
WWW Link. Dataset, Traffic.

See also UA-DETRAC 2017: Report of AVSS2017 IWT4S Challenge on Advanced Traffic Monitoring.


Neuhold, G.[Gerhard], Ollmann, T.[Tobias], Bulò, S.R.[Samuel Rota], Kontschieder, P.[Peter],
The Mapillary Vistas Dataset for Semantic Understanding of Street Scenes,
ICCV17(5000-5009)
IEEE DOI
Dataset, Traffic. 25,000 images, 66 categories. computational geometry, data visualisation, image annotation, image resolution, image segmentation, road traffic, Visualization


Koschorrek, P.[Philipp], Piccini, T.[Tommaso], Oberg, P.[Per], Felsberg, M.[Michael], Nielsen, L.[Lars], Mester, R.[Rudolf],
A Multi-sensor Traffic Scene Dataset with Omnidirectional Video,
GT13(727-734)
IEEE DOI
Dataset, Traffic. automotive


da Cruz, S.D.[Steve Dias], Wasenmüller, O.[Oliver], Beise, H.P.[Hans-Peter], Stifter, T.[Thomas], Stricker, D.[Didier],
SVIRO: Synthetic Vehicle Interior Rear Seat Occupancy Dataset and Benchmark,
WACV20(962-971)
IEEE DOI
Dataset, Vehicle Surveilance.
WWW Link. Task analysis, Benchmark testing, Training, Automobiles, Cameras, Lightning


Massoz, Q., Langohr, T., Francois, C., Verly, J.G.,
The ULg multimodality drowsiness database (called DROZY) and examples of use,
WACV16(1-7)
IEEE DOI
Dataset, Driver Monitoring. Cameras


Xu, Z.B.[Zhen-Bo], Yang, W.[Wei], Meng, A.[Ajin], Lu, N.X.[Nan-Xue], Huang, H.[Huan], Ying, C.C.[Chang-Chun], Huang, L.S.[Liu-Sheng],
Towards End-to-End License Plate Detection and Recognition: A Large Dataset and Baseline,
ECCV18(XIII: 261-277).
Springer DOI
Dataset, License Plates.


Eraqi, H.M.[Hesham M.], Abouelnaga, Y.[Yehya], Saad, M.H.[Mohamed H.], Moustafa, M.N.[Mohamed N.],
Distracted Driver Dataset,
WWW Link.
Dataset, Driver Monitoring. Includes Distracted Driver V1 and Distracted Driver V2.


MIT Pedestrian Database MITP,
Online2000
HTML Version. Dataset, Surveillance.


UCF Action Recogniton Dataset 101,
Online2012
WWW Link.

Earlier: UCF Action Recogniton Dataset 50,
Online2010
WWW Link. Dataset, Surveillance.
101 action categories, consisting of realistic videos taken from youtube. UCF 101 is an extension of UCF 50. Categories include: Baseball Pitch, Basketball Shooting, Bench Press, Biking, Biking, Billiards Shot,Breaststroke, Clean and Jerk, Diving, Drumming, Fencing, Golf Swing, Playing Guitar, High Jump, Horse Race, Horse Riding, Hula Hoop, Javelin Throw, Juggling Balls, Jump Rope, Jumping Jack, Kayaking, Lunges, Military Parade, Mixing Batter, Nun chucks, Playing Piano, Pizza Tossing, Pole Vault, Pommel Horse, Pull Ups, Punch, Push Ups, Rock Climbing Indoor, Rope Climbing, Rowing, Salsa Spins, Skate Boarding, Skiing, Skijet, Soccer, Juggling, Swing, Playing Tabla, TaiChi, Tennis Swing, Trampoline Jumping, Playing Violin, Volleyball Spiking, Walking with a dog, and Yo Yo. The printed reference:
See also UCF101: A Dataset of 101 Human Action Classes from Videos in The Wild.


UCF-iPhone,
Online2012
WWW Link. Dataset, Surveillance.
Aerobic actions using the Inertial Measurement Unit (IMU) on an Apple iPhone. Biking, Climbing Stairs, Descending Stairs, Gym Biking, Jump Roping, Running, Standing, Treadmill Walking and Walking.
See also Macro-Class Selection for Hierarchical K-NN Classification of Inertial Sensor Data. for the paper.


Hollywood2 Human Actions and Scenes Dataset,
Online2016
WWW Link. Dataset, Surveillance.
Part originally from:
See also Actions in context.


HMDB: a large human motion database,
Online2016
WWW Link. Dataset, Surveillance. Award, ICCV, Helmholtz.
51 actions.
See also HMDB: A large video database for human motion recognition.


TRECVID Workshop DAta,
Online2017
HTML Version. Dataset, Surveillance.
Surveillance datasets from 2001 to 2017.


Privacy-Preserving Visual Recognition PA-HMDB51,
Online2019.
WWW Link. Dataset, Actions. Dataset, Privacy. The dataset contains 592 videos selected from the HMDB51 dataset (
See also HMDB: A large video database for human motion recognition. ). For each video, we provide with frame-level annotation of five privacy attributes: skin color, gender, face, nudity, and relationship.
See also Towards Privacy-Preserving Visual Recognition via Adversarial Training: A Pilot Study.


HVU Dataset,
Online2021
WWW Link. Dataset, Action. For Holistic Video Understanding workshop


EPIC-KITCHENS,
Online2018
WWW Link. Dataset, Action. Dataset, Daily Activities. First-person (egocentric) vision; multi-faceted non-scripted recordings in native environments - i.e. the wearers' homes, capturing all daily activities in the kitchen over multiple days.
See also EPIC-KITCHENS Dataset: Collection, Challenges and Baselines, The.


Egocentric Live 4D Perception (Ego4D) Dataset: A large-scale first-person video dataset, supporting research in multi-modal machine perception for daily life activity,
Online2021
WWW Link. Dataset, Action. Dataset, Egocentric. The Ego4D Consortium. A large-scale first-person video dataset, supporting research in multi-modal machine perception for daily life activity.


Kay, W.[Will], Carreira, J.[Joao], Simonyan, K.[Karen], Zhang, B.[Brian], Hillier, C.[Chloe], Vijayanarasimhan, S.[Sudheendra], Viola, F.[Fabio], Green, T.[Tim], Back, T.[Trevor], Natsev, P.[Paul], Suleyman, M.[Mustafa], Zisserman, A.[Andrew],
The Kinetics Human Action Video Dataset,
Online2019.
WWW Link.
WWW Link. Dataset, Actions. Dataset, Human Action.


Tenorth, M.[Moritz], Bandouch, J.[Jan], Beetz, M.[Michael],
The TUM Kitchen Data Set of everyday manipulation activities for motion tracking and action recognition,
THEMIS09(1089-1096).
IEEE DOI
Dataset, Activity Recognition.


Guerra-Filho, G.[Gutemberg], Biswas, A.[Arnab],
The human motion database: A cognitive and parametric sampling of human motion,
IVC(30), No. 3, March 2012, pp. 251-261.
Elsevier DOI

Earlier: FG11(103-110).
IEEE DOI
Dataset, Activity Recognition. Human motion database; Quantitative evaluation; Parametric and cognitive sampling; Motion synthesis and analysis


Chaquet, J.M.[Jose M.], Carmona, E.J.[Enrique J.], Fernandez-Caballero, A.[Antonio],
A survey of video datasets for human action and activity recognition,
CVIU(117), No. 6, June 2013, pp. 633-659.
Elsevier DOI
Survey, Activity Recognition. Dataset, Activity Recognition. Human action recognition; Human activity recognition; Database; Dataset; Review; Survey


Chavarriaga, R.[Ricardo], Sagha, H.[Hesam], Calatroni, A.[Alberto], Digumarti, S.T.[Sundara Tejaswi], Tröster, G.[Gerhard], del R. Millán, J.[José], Roggen, D.[Daniel],
The Opportunity challenge: A benchmark database for on-body sensor-based activity recognition,
PRL(34), No. 15, 2013, pp. 2033-2042.
Elsevier DOI
Dataset, Activity Recognition. Activity recognition


Barrett, D.P.[Daniel Paul], Xu, R.[Ran], Yu, H.N.[Hao-Nan], Siskind, J.M.[Jeffrey Mark],
Collecting and annotating the large continuous action dataset,
MVA(27), No. 7, October 2016, pp. 983-995.
Springer DOI
Dataset, Actions. LCA Dataset.


Hadfield, S.[Simon], Lebeda, K.[Karel], Bowden, R.[Richard],
Hollywood 3D: What are the Best 3D Features for Action Recognition?,
IJCV(121), No. 1, January 2017, pp. 95-110.
Springer DOI

Earlier: A1, A3, Only:
Hollywood 3D: Recognizing Actions in 3D Natural Scenes,
CVPR13(3398-3405)
IEEE DOI
Dataset, Attion Recognition. Hollywood3D dataset. 3.5d


Monfort, M.[Mathew], Andonian, A.[Alex], Zhou, B.L.[Bo-Lei], Ramakrishnan, K.[Kandan], Bargal, S.A.[Sarah Adel], Yan, T.[Tom], Brown, L.[Lisa], Fan, Q.F.[Quan-Fu], Gutfreund, D.[Dan], Vondrick, C.[Carl], Oliva, A.[Aude],
Moments in Time Dataset: One Million Videos for Event Understanding,
PAMI(42), No. 2, February 2020, pp. 502-508.
IEEE DOI
WWW Link. Dataset, Action. Videos, Visualization, Feature extraction, Vocabulary, Animals, Convolution, Video dataset, event recognition


Patino, L.[Luis], Ferryman, J.M.[James M.],
PETS 2014: Dataset and challenge,
AVSS14(355-360)
IEEE DOI
Dataset, Surveillance. Cameras


Liu, C.[Ce], Freeman, W.T.[William T.], Adelson, E.H.[Edward H.], Weiss, Y.[Yair],
Human-assisted motion annotation,
CVPR08(1-8).
IEEE DOI
Dataset, Motion.
WWW Link. Motion annotation then applied to datasets to provide ground truth.


Shi, Y.F.[Yi-Fan], Huang, Y.[Yan], Minnen, D., Bobick, A.F., Essa, I.A.,
Propagation networks for recognition of partially ordered sequential action,
CVPR04(II: 862-869).
IEEE DOI
HTML Version. Dataset, Actions.
See also Georgia Tech.


Crowd Detection/Recognition/Segmentation from UAV/Drone-Captured Images/Videos,
2022.
WWW Link. Dataset, Crowd Detection. Under the auspices of the European Union's "Horizon 2020" research framework programme. It is a collection of datasets suitable for research on autonomous UAV/drone vision.
See also Aristotle University of Thessaloniki.


VIPeR: Viewpoint Invariant Pedestrian Recognition,
Pedestrian dataset. 2007. WWW Link.
Dataset, Pedestrians.


Akshatha, K.R., Karunakar, A.K., Shenoy, B.S.[B. Satish], Pavan, K.P.[K. Phani], Dhareshwar, C.V.[Chinmay V.], Johnson, D.G.[Dennis George],
Manipal-UAV person detection dataset: A step towards benchmarking dataset and algorithms for small object detection,
PandRS(195), 2023, pp. 77-89.
Elsevier DOI
Dataset, UAV Human Detection. Small object detection, Unmanned aerial vehicles, Convolutional neural networks, Deep learning, Computer vision


Wang, D.[Dan], Zhang, C.Y.[Chong-Yang], Cheng, H.[Hao], Shang, Y.F.[Yan-Feng], Mei, L.[Lin],
SPID: Surveillance Pedestrian Image Dataset and Performance Evaluation for Pedestrian Detection,
BEST16(III: 463-477).
Springer DOI
Dataset, Pedestrians.


Stanford 40 Actions,
A dataset for understanding human actions in still images HTML Version.
Dataset, Action Recognition.


People Playing Musical Instrument (PPMI),
A dataset of human and object interaction activities HTML Version.
Dataset, Action Recognition.


Kliper-Gross, O.[Orit], Hassner, T.[Tal], Wolf, L.B.[Lior B.],
The Action Similarity Labeling Challenge,
PAMI(34), No. 3, March 2012, pp. 615-621.
IEEE DOI
Dataset, Action Recognition. Labeled dataset. Same/not-same rather than recognition.


Distante, C.[Cosimo], Diraco, G.[Giovanni], Leone, A.[Alessandro],
Active Range Imaging Dataset for Indoor Surveillance,
BMVA(2010), No. 3, 2010, pp. 1-14.
PDF File.
Dataset, Action Recognition.


Blunsden, S.[Scott], Fisher, R.B.[Robert B.],
The BEHAVE video dataset: Ground truthed video for multi-person behavior classification,
BMVA(2010), No. 4, 2010, pp. 1-12.
PDF File.
Dataset, Action Recognition.


Hwang, S.[Soonmin], Park, J.[Jaesik], Kim, N.[Namil], Choi, Y.[Yukyung], Kweon, I.S.[In So],
Multispectral pedestrian detection: Benchmark dataset and baseline,
CVPR15(1037-1045)
IEEE DOI
Dataset, Pedestrian Detection.


Wallraven, C.[Christian], Schultze, M.[Michael], Mohler, B.[Betty], Vatakis, A.[Argiro], Pastra, K.[Katerina],
The POETICON enacted scenario corpus: A tool for human and computational experiments on action understanding,
FG11(484-491).
IEEE DOI
Dataset, Actions.


Munder, S., Gavrila, D.M.[Dariu M.],
An Experimental Study on Pedestrian Classification,
PAMI(28), No. 11, November 2006, pp. 1863-1868.
IEEE DOI
PDF File.
Dataset available:
HTML Version. Dataset, Pedestrians. DaimlerChrysler Res. Investigate global versus local and adaptive versus nonadaptive features. PCA coefficients, Haar wavelets, and local receptive fields (LRFs). SVM, Neural Nets, K-NN classifiers. Combination of SVMs with LRF features performs best. And boosted cascade of Haar wavelets is close.


Daimler Pedestrian Detection Benchmark,
2009.
HTML Version. Dataset, Pedestrian Detection. Dataset, Surveillance.
See also Daimler. Training set: 15,560 pedestrian and non-pedestrian samples. 6744 additional images. Test set: a sequence with more than 21,790 images with 56,492 pedestrian labels. From a vehicle in 27 minutes of urban driving. VGA resolution. Dataset used in:
See also Monocular Pedestrian Detection: Survey and Experiments.


Edinburgh Informatics Forum Pedestrian Database,
2010.
WWW Link. Dataset, Human Tracking. Dataset, Surveillance. Overhead views, of a building atrium. Several months of observations, with trajectories (computed).


Dalal, N.[Navneet],
INRIA Person Dataset,
Online2005
WWW Link. Dataset, Human Motion. The collected dataset for the above paper, from various sources.


Wu, Y.[Yang], Liu, Y.L.[Yuan-Liu], Yuan, Z.J.[Ze-Jian], Zheng, N.N.[Nan-Ning],
IAIR-CarPed: A psychophysically annotated dataset with fine-grained and layered semantic labels for object recognition,
PRL(33), No. 2, 15 January 2012, pp. 218-226.
Elsevier DOI
Dataset, Pedestrian Detection. Object recognition; Image database; Object detection; Pedestrian detection; Psychophysical experiments


García-Martín, Á.[Álvaro], Martínez, J.M.[José M.], Bescós, J.[Jesús],
A corpus for benchmarking of people detection algorithms,
PRL(33), No. 2, 15 January 2012, pp. 152-156.
Elsevier DOI
Dataset, Person Detection. People detection; Ground-truth; Corpus; Dataset; Surveillance video


Wang, Q.[Qi], Gao, J.Y.[Jun-Yu], Lin, W.[Wei], Li, X.L.[Xue-Long],
NWPU-Crowd: A Large-Scale Benchmark for Crowd Counting and Localization,
PAMI(43), No. 6, June 2021, pp. 2141-2149.
IEEE DOI
WWW Link.
WWW Link.
Dataset, Crowd Counting. Benchmark testing, Task analysis, Head, Surveillance, Cameras, Magnetic heads, Internet, Crowd counting, crowd localization, benchmark website


Sindagi, V., Yasarla, R., Patel, V.,
Pushing the Frontiers of Unconstrained Crowd Counting: New Dataset and Benchmark Method,
ICCV19(1221-1231)
IEEE DOI
Dataset, Crowd Counting. feature extraction, image classification, learning (artificial intelligence), object detection, Error analysis


Rasouli, A., Kotseruba, I., Kunic, T., Tsotsos, J.K.[John K.],
PIE: A Large-Scale Dataset and Models for Pedestrian Intention Estimation and Trajectory Prediction,
ICCV19(6261-6270)
IEEE DOI
Dataset, Pedestrians.
WWW Link. intelligent transportation systems, pedestrians, large-scale dataset, pedestrian intention estimation, Vehicle dynamics


Zheng, L.[Liang], Bie, Z.[Zhi], Sun, Y.F.[Yi-Fan], Wang, J.D.[Jing-Dong], Su, C.[Chi], Wang, S.J.[Sheng-Jin], Tian, Q.[Qi],
MARS: A Video Benchmark for Large-Scale Person Re-Identification,
ECCV16(VI: 868-884).
Springer DOI
Dataset, Re-Identification.


Yan, C.[Cheng], Pang, G.S.[Guan-Song], Wang, L.[Lei], Jiao, J.[Jile], Feng, X.T.[Xue-Tao], Shen, C.H.[Chun-Hua], Li, J.J.[Jing-Jing],
BV-Person: A Large-scale Dataset for Bird-view Person Re-identification,
ICCV21(10923-10932)
IEEE DOI
Dataset, Re-Identification. Computational modeling, Benchmark testing, Cameras, Video surveillance, Search problems, Birds, Image and video retrieval


Figueira, D.[Dario], Taiana, M.[Matteo], Nambiar, A.[Athira], Nascimento, J.C.[Jacinto C.], Bernardino, A.[Alexandre],
The HDA+ Data Set for Research on Fully Automated Re-identification Systems,
Re-Id14(241-255).
Springer DOI
Dataset, Re-Identification.


Ragheb, H.[Hossein], Velastin, S.A.[Sergio A.], Remagnino, P.[Paolo], Ellis, T.[Tim],
Human action recognition using robust power spectrum features,
ICIP08(753-756).
IEEE DOI

And:
ViHASi: Virtual human action silhouette data for the performance evaluation of silhouette-based action recognition methods,
ICDSC08(1-10).
IEEE DOI

And: VNBA08(77-84).
DOI Link

And:
A Novel Approach for Fast Action Recognition using Simple Features,
VS08(xx-yy).
Dataset, Action Recognition. Silhouette based action recognition.


Chakraborty, A.[Anirban], Das, A.[Abir], Roy-Chowdhury, A.K.[Amit K.],
Network Consistent Data Association,
PAMI(38), No. 9, September 2016, pp. 1859-1871.
IEEE DOI

Earlier: A2, A1, A3:
Consistent Re-identification in a Camera Network,
ECCV14(II: 330-345).
Springer DOI
Dataset, Re-Identification.
WWW Link.


Bialkowski, A., Denman, S., Sridharan, S., Fookes, C., Lucey, P.,
A Database for Person Re-Identification in Multi-Camera Surveillance Networks,
DICTA12(1-8).
IEEE DOI
Dataset, Re-Identification.


Gou, M., Karanam, S., Liu, W., Camps, O., Radke, R.J.,
DukeMTMC4ReID: A Large-Scale Multi-camera Person Re-identification Dataset,
Re-Id17(1425-1434)
IEEE DOI
Dataset, Re-Identification. Airports, Benchmark testing, Cameras, Detectors, Feature extraction, Measurement, Surveillance


MoCA: Moving Camouflaged Animals dataset,
Online2020.
WWW Link. Dataset, Animals.
See also Betrayed by Motion: Camouflaged Object Discovery via Motion Segmentation.


Indoor pig behavior RGBD video dataset,
Online2021.
WWW Link. Dataset, Animals. There are approximately 3.5 million data frames in 1905 clips, each 5 minutes long, for a total of about 160 hours of video.
See also Extracting Accurate Long-Term Behavior Changes from a Large Pig Dataset.


Truong, C.[Charles], Barrois-Müller, R.[Rémi], Moreau, T.[Thomas], Provost, C.[Clément], Vienne-Jumeau, A.[Aliénor], Moreau, A.[Albane], Vidal, P.P.[Pierre-Paul], Vayatis, N.[Nicolas], Buffat, S.[Stéphane], Yelnik, A.[Alain], Ricard, D.[Damien], Oudre, L.[Laurent],
A Data Set for the Study of Human Locomotion with Inertial Measurements Units,
IPOL(9), 2019, pp. 381-390.
DOI Link
Dataset, Gait. Data set of 1020 multivariate gait signals collected with two inertial measurement units, from 230 subjects undergoing a fixed protocol: standing still, walking 10 m, turning around, walking back and stopping. In total, 8.5~h of gait time series are distributed.


Song, C.F.[Chun-Feng], Huang, Y.Z.[Yong-Zhen], Wang, W.N.[Wei-Ning], Wang, L.[Liang],
CASIA-E: A Large Comprehensive Dataset for Gait Recognition,
PAMI(45), No. 3, March 2023, pp. 2801-2815.
IEEE DOI
Dataset, Gait Recognition. Videos, Gait recognition, Legged locomotion, Face recognition, Training, Lighting, Benchmark testing, Deep learning, gait dataset, soft biometrics


Zhu, Z.[Zheng], Guo, X.D.[Xian-Da], Yang, T.[Tian], Huang, J.J.[Jun-Jie], Deng, J.K.[Jian-Kang], Huang, G.[Guan], Du, D.L.[Da-Long], Lu, J.W.[Ji-Wen], Zhou, J.[Jie],
Gait Recognition in the Wild: A Benchmark,
ICCV21(14769-14779)
IEEE DOI
Dataset, Gait Recognition.
WWW Link. Biometrics, Datasets and evaluation, Emergency Reviewer


Baseline Algorithm and Performance for Gait Based Human ID Challenge Problem,
2004, USF.
WWW Link. Dataset, Gait. Code, Gait.


Seely, R.D.[Richard D.], Samangooei, S.[Sina], Middleton, L.[Lee], Carter, J.N.[John N.], Nixon, M.S.[Mark S.],
The University of Southampton Multi-Biometric Tunnel and introducing a novel 3D gait dataset,
BTAS08(1-6).
IEEE DOI
Dataset, Gait Recognition.


CMU Graphics Lab Motion Capture Database,
2004.
WWW Link. Dataset, Motion Capture. Code, Motion Capture. 2000+ examples of motion capture data. Includes some software.


Human3.6M,
Online2014.
WWW Link. Or the original:
WWW Link. Dataset, Motion Capture. Dataset, Human Actions. 3.6 Million human poses, various people, various actions. For the description:
See also Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments.


Voisard, C.[Cyril], de l'Escalopier, N.[Nicolas], Moreau, A.[Albane], Vienne-Jumeau, A.[Alienor], Ricard, D.[Damien], Oudre, L.[Laurent],
A Reference Data Set for the Study of Healthy Subject Gait with Inertial Measurements Units,
IPOL(13), 2023, pp. 314-320.
DOI Link
Dataset, Gait.


Hofmann, M.[Martin], Geiger, J.[Jürgen], Bachmann, S.[Sebastian], Schuller, B.[Björn], Rigoll, G.[Gerhard],
The TUM Gait from Audio, Image and Depth (GAID) database: Multimodal recognition of subjects and traits,
JVCIR(25), No. 1, 2014, pp. 195-206.
Elsevier DOI
Dataset, Gait. Gait recognition


Kuehne, H., Jhuang, H., Garrote, E., Poggio, T., Serre, T.,
HMDB: A large video database for human motion recognition,
ICCV11(2556-2563).
IEEE DOI
Dataset, Action Recognition. The internet has billions of videos, most recognition datasets have a dozen. The dataset itself:
See also HMDB: a large human motion database.


Edinburgh Ceilidh Overhead Video Data,
Dataset, Dance.
WWW Link.
16 ground-truthed dances viewed from overhead, where the 10 dancers follow a structured dance pattern (2 different dances). The dances are in the Scottish Ceilidh style (somewhat similar to American Square Dancing).


Gorelick, L.[Lena], Blank, M.[Moshe], Shechtman, E.[Eli], Irani, M.[Michal], Basri, R.[Ronen],
Actions as Space-Time Shapes,
PAMI(29), No. 12, December 2007, pp. 2247-2253.
IEEE DOI
Dataset, Actions.
HTML Version.
Earlier: A2, A1, A3, A4, A5: ICCV05(II: 1395-1402).
IEEE DOI Award, Helmholtz Prize.
Human action as 3-D shapes induced by silhouettes in the spacetime volume.


Li, R.H.[Rong-Hui], Zhao, J.[Junfan], Zhang, Y.[Yachao], Su, M.Y.[Ming-Yang], Ren, Z.[Zeping], Zhang, H.[Han], Tang, Y.S.[Yan-Song], Li, X.[Xiu],
FineDance: A Fine-grained Choreography Dataset for 3D Full Body Dance Generation,
ICCV23(10200-10209)
IEEE DOI
Dataset, Dance.


de la Torre-Frade, F.[Fernando], Hodgins, J.K.[Jessica K.], Bargteil, A.W.[Adam W.], Artal, X.M.[Xavier Martin], Macey, J.C.[Justin C.], Collado I Castells, A.[Alexandre], and Beltran, J.[Josep],
Guide to the Carnegie Mellon University Multimodal Activity (CMU-MMAC) Database,
CMU-RI-TR-08-22, April, 2008.
WWW Link. Dataset, Activity Recognition.


Liu, J.[Jun], Shahroudy, A.[Amir], Perez, M.[Mauricio], Wang, G.[Gang], Duan, L.Y.[Ling-Yu], Kot, A.C.[Alex C.],
NTU RGB+D 120: A Large-Scale Benchmark for 3D Human Activity Understanding,
PAMI(42), No. 10, October 2020, pp. 2684-2701.
IEEE DOI
WWW Link. Or:
WWW Link.
Dataset, Human Activity. Benchmark testing, Cameras, Deep learning, Semantics, Lighting, Skeleton, Activity understanding, large-scale benchmark


Shahroudy, A.[Amir], Liu, J., Ng, T.T.[Tian-Tsong], Wang, G.,
NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis,
CVPR16(1010-1019)
IEEE DOI
Dataset, Human Activity.


FCVID: Fudan-Columbia Video Dataset,

WWW Link. Dataset, Activity Recognition. 90,000+ videos, manually annotated for 239 categories. Human activities.


Ben-Shabat, Y.Z.[Yi-Zhak], Yu, X.[Xin], Saleh, F.[Fatemeh], Campbell, D.[Dylan], Rodriguez-Opazo, C.[Cristian], Li, H.D.[Hong-Dong], Gould, S.[Stephen],
The IKEA ASM Dataset: Understanding People Assembling Furniture through Actions, Objects and Pose,
WACV21(846-858)
IEEE DOI
WWW Link.
Dataset, Activity Recognition. Deep learning, Annotations, Pose estimation, Object segmentation, Benchmark testing


Corona, K.[Kellie], Osterdahl, K.[Katie], Collins, R.[Roderic], Hoogs, A.J.[Anthony J.],
MEVA: A Large-Scale Multiview, Multimodal Video Dataset for Activity Detection,
WACV21(1059-1067)
IEEE DOI
Dataset, Activity Detection. Solid modeling, Visualization, Annotations, NIST, Cameras
See also Multiview Extended Video with Activities.


Damen, D.[Dima], Doughty, H.[Hazel], Farinella, G.M.[Giovanni Maria], Furnari, A.[Antonino], Kazakos, E.[Evangelos], Ma, J.[Jian], Moltisanti, D.[Davide], Munro, J.[Jonathan], Perrett, T.[Toby], Price, W.[Will], Wray, M.[Michael],
Rescaling Egocentric Vision: Collection, Pipeline and Challenges for EPIC-KITCHENS-100,
IJCV(130), No. 1, January 2022, pp. 33-55.
Springer DOI
Dataset, Egocentric Actions.


Damen, D.[Dima], Doughty, H.[Hazel], Farinella, G.M.[Giovanni Maria], Fidler, S.[Sanja], Furnari, A.[Antonino], Kazakos, E.[Evangelos], Moltisanti, D.[Davide], Munro, J.[Jonathan], Perrett, T.[Toby], Price, W.[Will], Wray, M.[Michael],
Scaling Egocentric Vision: The Epic Kitchens Dataset,
ECCV18(II: 753-771).
Springer DOI
Dataset, Egocentric Actions.


Delgado, K.[Kevin], Origgi, J.M.[Juan Manuel], Hasanpoor, T.[Tania], Yu, H.[Hao], Allessio, D.[Danielle], Arroyo, I.[Ivon], Lee, W.[William], Betke, M.[Margrit], Woolf, B.[Beverly], Bargal, S.A.[Sarah Adel],
Student Engagement Dataset,
ABAW21(3621-3629)
IEEE DOI
Dataset, Classrooms. Training, Deep learning, Visualization, Head, Distance learning, Time series analysis


Edinburgh office monitoring video dataset,
2021.
WWW Link. Dataset, Office Monitor.
This dataset consists of video, image frames, and ground truth for 20 days of monitoring people in 4 different offices. The data is acquired using a fixed camera as a set of 1280*720 pixel color images captured at an average of about 1 FPS. This dataset is interesting because there are about 450K labeled frames of people doing standard office activities. The ground truth is the position of each person in each image with a bounding box, plus their behavior. Four behaviors are annotated (standing/walking, sitting, two or three people are talking, or the person in room has fallen). Paper to appear CVPR21.


Zhang, J.[Jing], Li, W.Q.[Wan-Qing], Ogunbona, P.O.[Philip O.], Wang, P.[Pichao], Tang, C.[Chang],
RGB-D-based action recognition datasets: A survey,
PR(60), No. 1, 2016, pp. 86-105.
Elsevier DOI
Dataset, Action Recognition. Action recognition


Laptev, I.[Ivan], Caputo, B.[Barbara], Schuldt, C.[Christian], Lindeberg, T.[Tony],
Local velocity-adapted motion events for spatio-temporal recognition,
CVIU(108), No. 3, December 2007, pp. 207-229.
Elsevier DOI

Earlier: A3, A1, A2, Only:
Recognizing human actions: a local SVM approach,
ICPR04(III: 32-36).
IEEE DOI
Dataset, Actions.
WWW Link. Motion; Local features; Motion descriptors; Matching; Velocity adaptation; Action recognition; Learning; SVM


Penn Action Dataset,
2013.
WWW Link. Dataset, Facial Landmarks.
See also From Actemes to Action: A Strongly-Supervised Representation for Detailed Action Understanding.


Li, W.H.[Wen-Hui], Wong, Y.K.[Yong-Kang], Liu, A.A.[An-An], Li, Y.[Yang], Su, Y.T.[Yu-Ting], Kankanhalli, M.[Mohan],
Multi-Camera Action Dataset for Cross-Camera Action Recognition Benchmarking,
WACV17(187-196)
IEEE DOI
Dataset, Action Recognition.
HTML Version. Multi-Camera Action Dataset (MCAD). Benchmark testing, Cameras, Heuristic algorithms, Internet, Robustness, Surveillance


Barekatain, M., Martí, M., Shih, H.F., Murray, S., Nakayama, K., Matsuo, Y., Prendinger, H.,
Okutama-Action: An Aerial View Video Dataset for Concurrent Human Action Detection,
PETS17(2153-2160)
IEEE DOI
Dataset, Okutama-Action. Cameras, Data collection, Mobile communication, Surveillance, Training, Video, sequences


Zhao, H., Torralba, A., Torresani, L., Yan, Z.,
HACS: Human Action Clips and Segments Dataset for Recognition and Temporal Localization,
ICCV19(8667-8677)
IEEE DOI
WWW Link. Dataset, Human Actions. image classification, image motion analysis, image segmentation, learning (artificial intelligence), video signal processing, YouTube


Kong, Q., Wu, Z., Deng, Z., Klinkigt, M., Tong, B., Murakami, T.,
MMAct: A Large-Scale Dataset for Cross Modal Human Action Understanding,
ICCV19(8657-8666)
IEEE DOI
Dataset, Human Actions. image colour analysis, image motion analysis, image recognition, video signal processing, RGB videos, Task analysis


Ofli, F., Chaudhry, R., Kurillo, G., Vidal, R., Bajcsy, R.,
Berkeley MHAD: A comprehensive Multimodal Human Action Database,
WACV13(53-60).
IEEE DOI
Dataset, Human Actions.


Ji, Y.L.[Yan-Li], Yang, Y., Shen, F., Shen, H.T., Zheng, W.S.,
Arbitrary-View Human Action Recognition: A Varying-View RGB-D Action Dataset,
CirSysVideo(31), No. 1, January 2021, pp. 289-300.
IEEE DOI
Dataset, Action Recognition. Skeleton, Sensors, Videos, Dictionaries, Robots, HRI


Vaquette, G., Orcesi, A., Lucat, L., Achard, C.,
The DAily Home LIfe Activity Dataset: A High Semantic Activity Dataset for Online Recognition,
FG17(497-504)
IEEE DOI
Dataset, Smart Home. Cameras, Databases, Protocols, Semantics, Sensors, Skeleton, Videos


Ragusa, F.[Francesco], Furnari, A.[Antonino], Livatino, S.[Salvatore], Farinella, G.M.[Giovanni Maria],
The MECCANO Dataset: Understanding Human-Object Interactions from Egocentric Videos in an Industrial-like Domain,
WACV21(1568-1577)
IEEE DOI
WWW Link.
Dataset, Interactions. Taxonomy, Motorcycles, Object detection, Tools, Object recognition


UDIVA Dataset,
2021
WWW Link. Dataset, Social Interaction. Non-acted datasetof face-to-face dyadic interactions. WACV Paper.
See also Context-Aware Personality Inference in Dyadic Scenarios: Introducing the UDIVA Dataset.


Gong, W.J.[Wen-Juan], Gonzàlez, J.[Jordi], Tavares, J.M.R.S.[João Manuel R.S.], Xavier Roca, F.,
A New Image Dataset on Human Interactions,
AMDO12(204-209).
Springer DOI
Dataset, Action Recognition.


Burger, S.[Susanne],
The CHIL RT07 Evaluation Data,
MTPH07(xx-yy).
Springer DOI
Dataset, Activity Recogniton.


Truong, C.[Charles], Atiq, M.[Mounir], Minvielle, L.[Ludovic], Serra, R.[Renan], Mougeot, M.[Mathilde], Vayatis, N.[Nicolas],
A Data Set for Fall Detection with Smart Floor Sensors,
IPOL(13), 2023, pp. 183-197.
DOI Link
Dataset, Fall Detection.


Fouhey, D.F., Kuo, W., Efros, A.A., Malik, J.,
From Lifestyle VLOGs to Everyday Interactions,
CVPR18(4991-5000)
IEEE DOI
Dataset, Action.
HTML Version. Videos, YouTube, Task analysis, Cameras, Internet, Benchmark testing


CVBASE Annotated Video Data,
2006.
HTML Version. Dataset, Video.


Olympic Sports Dataset,
2010
WWW Link. Dataset, Sports. The Olympic Sports Dataset contains videos of athletes practicing different sports. We have obtained all video sequences from YouTube and annotated their class label with the help of Amazon Mechanical Turk. Refer to:
See also Modeling Temporal Structure of Decomposable Motion Segments for Activity Classification.


UCF Sports Action Dataset,
2008.
WWW Link. Details:
WWW Link. A large set of sports actions. Dataset, Sports. Note that many of the other non UCF links to data on that page are out of date.


LHI Sports Activity Dataset,
Subset of larger dataset. Online2008
HTML Version. Dataset, Sports.
See also Lotus Hill Institute.


MEXaction2 action detection and localization dataset,
2015.
WWW Link. Dataset, Actions. The aim of the MEXaction2 dataset is to support the development and evaluation of methods for spotting instances of short actions in a relatively large video database. Actions: BullChargeCape (1324) and HorseRiding (651).


Zhang, W.C.[Wei-Chen], Liu, Z.G.[Zhi-Guang], Zhou, L.Y.[Liu-Yang], Leung, H.[Howard], Chan, A.B.[Antoni B.],
Martial Arts, Dancing and Sports dataset: A challenging stereo and multi-view dataset for 3D human pose estimation,
IVC(61), No. 1, 2017, pp. 22-39.
Elsevier DOI
Dataset, Human Activities. Human pose estimation


Zalluhoglu, C.[Cemil], Ikizler-Cinbis, N.[Nazli],
Collective Sports: A multi-task dataset for collective activity recognition,
IVC(94), 2020, pp. 103870.
Elsevier DOI
Dataset, Sports. Collective activity recognition, Action recognition, Convolutional neural networks, Multi-task learning, LSTM


Li, Y.X.[Yi-Xuan], Chen, L.[Lei], He, R.[Runyu], Wang, Z.Z.[Zhen-Zhi], Wu, G.S.[Gang-Shan], Wang, L.M.[Li-Min],
MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions,
ICCV21(13516-13525)
IEEE DOI
Dataset, Sports. Location awareness, Annotations, Error analysis, Benchmark testing, Complexity theory, Standards, Action and behavior recognition, Datasets and evaluation


Zhang, C.L.[Chuan-Lei], Liu, L.X.[Li-Xin], Yao, M.[Minda], Chen, W.[Wei], Chen, D.F.[Du-Feng], Wu, Y.L.[Yu-Liang],
HSiPu2: A New Human Physical Fitness Action Dataset for Recognition and 3D Reconstruction Evaluation,
VOCVALC21(3166-3175)
IEEE DOI
Dataset, Physical Fitness. Support vector machines, Solid modeling,


Setti, F.[Francesco], Conigliaro, D.[Davide], Rota, P.[Paolo], Bassetti, C.[Chiara], Conci, N.[Nicola], Sebe, N.[Nicu], Cristani, M.[Marco],
The S-Hock dataset: A new benchmark for spectator crowd analysis,
CVIU(159), No. 1, 2017, pp. 47-58.
Elsevier DOI
Dataset, Crowd Analysis.
Earlier: A2, A3, A1, A4, A5, A6, A7:
The S-HOCK dataset: Analyzing crowds at the stadium,
CVPR15(2039-2047)
IEEE DOI
Spectator, monitoring


Ali, S.[Saad], Shah, M.[Mubarak],
Floor Fields for Tracking in High Density Crowd Scenes,
ECCV08(II: 1-14).
Springer DOI
PDF File.
Dataset, Tracking.
WWW Link.


Ali, S.[Saad], Shah, M.[Mubarak],
A Lagrangian Particle Dynamics Approach for Crowd Flow Segmentation and Stability Analysis,
CVPR07(1-6).
IEEE DOI
PDF File. Dataset, Surveillance. The dataset for this paper is available:
WWW Link. UCF Lists:
WWW Link. But no link to data.


Ali, S.[Saad],
Crowd Flow Segmentation and Stability Analysis,
Online2007
HTML Version. The more general discussion of the issues of the other papers. Includes a more complete dataset and pointers to other useful code. Dataset, Surveillance.
WWW Link.


Multimodal Meme Classification Identifying Offensive Content in Image and Text,
2019.
WWW Link. Dataset, Offensive Images. MultOFF Dataset.


Cheng, M.[Ming], Cai, K.J.[Kun-Jing], Li, M.[Ming],
RWF-2000: An Open Large Scale Video Database for Violence Detection,
ICPR21(4183-4190)
IEEE DOI
Dataset, Violence. Image motion analysis, Databases, Surveillance, Logic gates, Cameras


Ntalampiras, S.[Stavros], Arsic, D.[Dejan], Hofmann, M.[Martin], Andersson, M.[Maria], Ganchev, T.[Todor],
PROMETHEUS: heterogeneous sensor database in support of research on human behavioral patterns in unrestricted environments,
SIViP(8), No. 7, October 2014, pp. 1211-1231.
Springer DOI
Dataset, Human Activity.


IAUFD: A 100k images dataset for automatic football image/video analysis,
2022.
WWW Link. Dataset, Event Detection. Dataset, Sports.


Penate-Sanchez, A.[Adrian], Freire-Obregón, D.[David], Lorenzo-Melián, A.[Adrián], Lorenzo-Navarro, J.[Javier], Castrillón-Santana, M.[Modesto],
TGC20ReId: A dataset for sport event re-identification in the wild,
PRL(138), 2020, pp. 355-361.
Elsevier DOI
Dataset, Sports. Sport, Re-identification, Dataset


Abrams, A.[Austin], Tucek, J.[Jim], Little, J.[Joshua], Jacobs, N.[Nathan], Pless, R.[Robert],
LOST: Longterm Observation of Scenes (with Tracks),
WACV12(297-304).
IEEE DOI
Using the data, same half hour every day. Dataset, Surveillance.


Rebecq, H.[Henri], Ranftl, R.[René], Koltun, V.[Vladlen], Scaramuzza, D.[Davide],
High Speed and High Dynamic Range Video with an Event Camera,
PAMI(43), No. 6, June 2021, pp. 1964-1980.
IEEE DOI

Earlier:
Events-To-Video: Bringing Modern Computer Vision to Event Cameras,
CVPR19(3852-3861).
IEEE DOI
Code, HDR. Dataset, HDR. Dataset, E2VID.
HTML Version. Image reconstruction, Cameras, Streaming media, Dynamic range, Brightness, Heuristic algorithms, high dynamic range


DAVIS: Densely Annotated VIdeo Segmentation,
WWW Link.
2017. Dataset, Video Segmentation. For the competition at CVPR 2017.


Video Instance Segmentation - YouTube-VOS,
WWW Link.
Dataset, Video Segmentation. Dataset for video instance segmentation.


Video Instance Segmentation - YouTube-VOS,
WWW Link.
Dataset, Video Segmentation. Dataset for video instance segmentation. And related to Youtube-VIS.


Qi, J.Y.[Ji-Yang], Gao, Y.[Yan], Hu, Y.[Yao], Wang, X.G.[Xing-Gang], Liu, X.Y.[Xiao-Yu], Bai, X.[Xiang], Belongie, S.[Serge], Yuille, A.L.[Alan L.], Torr, P.H.S.[Philip H.S.], Bai, S.[Song],
OVIS: Occluded Video Instance Segmentation,
Online2021. WWW Link.
Dataset, Video Segmentation. Designed with the philosophy of perceiving object occlusions in videos, which could reveal the complexity and the diversity of real-world scenes.


Perazzi, F.[Federico], Pont-Tuset, J., McWilliams, B., Van Gool, L.J., Gross, M., Sorkine-Hornung, A.[Alexander],
A Benchmark Dataset and Evaluation Methodology for Video Object Segmentation,
CVPR16(724-732)
IEEE DOI
Dataset, Video Segmentation.


Change Detection Benchmark Website,
2012 Dataset, Motion Detection.
WWW Link. Dataset for the 2012 Change Detection workshop at CVPR.


Scene Background Initialization (SBI) Dataset,
2016
HTML Version. Dataset, Background.
14 sequences with ground truth.
See also Towards Benchmarking Scene Background Initialization.


Mahmood, M.H.[Muhammad Habib], Díez, Y.[Yago], Salvi, J.[Joaquim], Lladó, X.[Xavier],
A collection of challenging motion segmentation benchmark datasets,
PR(61), No. 1, 2017, pp. 1-14.
Elsevier DOI
Dataset, Motion Segmentation. Motion segmentation


Mahmood, M.H.[Muhammad Habib], Zappella, L.[Luca], Díez, Y.[Yago], Salvi, J.[Joaquim], Lladó, X.[Xavier],
A New Trajectory Based Motion Segmentation Benchmark Dataset (UdG-MS15),
IbPRIA15(463-470).
Springer DOI
Dataset, Motion Segmentation.


Cuevas, C.[Carlos], Yáñez, E.M.[Eva María], García, N.[Narciso],
Labeled dataset for integral evaluation of moving object detection algorithms: LASIESTA,
CVIU(152), No. 1, 2016, pp. 103-117.
Elsevier DOI
Dataset, Foreground Detection. Database


Vacavant, A.[Antoine], Chateau, T.[Thierry], Wilhelm, A.[Alexis], Lequièvre, L.[Laurent],
A Benchmark Dataset for Outdoor Foreground/Background Extraction,
BMC12(I:291-300).
Springer DOI
Dataset, Foreground Extraction. Surveillance applications.


Image Stitching Database,
2010
HTML Version. Dataset, Image Stitching.


Richter, S.R.[Stephan R.], Hayder, Z.[Zeeshan], Koltun, V.[Vladlen],
Playing for Benchmarks,
ICCV17(2232-2241)
IEEE DOI
Dataset, Video. image annotation, image resolution, image segmentation, image sequences, object detection, object tracking,


Stottinger, J.[Julian], Zambanini, S.[Sebastian], Khan, R.[Rehanullah], Hanbury, A.[Allan],
FeEval A Dataset for Evaluation of Spatio-temporal Local Features,
ICPR10(499-502).
IEEE DOI
Dataset, Motion.


Avola, D., Cinque, L., Foresti, G.L., Martinel, N., Pannone, D., Piciarelli, C.,
A UAV Video Dataset for Mosaicking and Change Detection From Low-Altitude Flights,
SMCS(50), No. 6, June 2020, pp. 2139-2149.
IEEE DOI
Dataset, Change Detection. Video sequences, Change detection algorithms, Cameras, Detection algorithms, Task analysis, Telemetry, unmanned aerial vehicle (UAV)


SuperTex136,
2016
WWW Link. Dataset, Superresolution. Refer to:
See also Jointly Optimized Regressors for Image Super-resolution.


Set5, Set14, Urban 100, BSD 100, Sun-Hays 80 Datasets,
Dataset, Super Resolution. Linkd from:
WWW Link.


Wang, Y.Q.[Ying-Qian], Wang, L.G.[Long-Guang], Yang, J.G.[Jun-Gang], An, W.[Wei], Guo, Y.L.[Yu-Lan],
Flickr1024: A Large-Scale Dataset for Stereo Image Super-Resolution,
CLI19(3852-3857)
IEEE DOI
Dataset, Flickr. Dataset, Super Resolution.
WWW Link. cameras, data acquisition, image resolution, stereo image processing, large-scale stereo dataset, super resolution


Tulyakov, S.[Stepan], Gehrig, D.[Daniel], Georgoulis, S.[Stamatios], Erbach, J.[Julius], Gehrig, M.[Mathias], Li, Y.[Yuanyou], Scaramuzza, D.[Davide],
Time Lens: Event-based Video Frame Interpolation,
CVPR21(16150-16159)
IEEE DOI
HTML Version.
Code, Frame Interpolation. Dataset, Frame Interpolation. Interpolation, Visualization, Image color analysis, Benchmark testing, Cameras, Sensors, Pattern recognition


Xiao, J.X.[Jian-Xiong], Owens, A.[Andrew], Torralba, A.B.[Antonio B.],
SUN3D: A Database of Big Spaces Reconstructed Using SfM and Object Labels,
ICCV13(1625-1632)
IEEE DOI
Dataset, Scene Understanding.
WWW Link. RGB-D Video dataset. Camera pose and object labels. Interactive reconstruction process.


Shugrina, M.[Maria], Liang, Z.H.[Zi-Heng], Kar, A.[Amlan], Li, J.[Jiaman], Singh, A.[Angad], Singh, K.[Karan], Fidler, S.[Sanja],
Creative Flow+ Dataset,
CVPR19(5379-5388).
IEEE DOI
Dataset, Optical Flow.
WWW Link. Video dataset richly labeled with per-pixel optical flow, occlusions, correspondences, segmentation labels, normals, and depth.


Mayer, N.[Nikolaus], Ilg, E.[Eddy], Häusser, P.[Philip], Fischer, P.[Philipp], Cremers, D.[Daniel], Dosovitskiy, A.[Alexey], Brox, T.[Thomas],
A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation,
CVPR16(4040-4048)
IEEE DOI
Dataset, Optical Flow.


Baker, S.[Simon], Scharstein, D.[Daniel], Lewis, J.P., Roth, S.[Stefan], Black, M.J.[Michael J.], Szeliski, R.S.[Richard S.],
A Database and Evaluation Methodology for Optical Flow,
IJCV(92), No. 1, March 2011, pp. 1-31.
Springer DOI

Earlier: A1, A4, A2, A5, A3, A6: ICCV07(1-8).
IEEE DOI
Dataset, Optical Flow.
WWW Link.


Song, H.O., Xiang, Y., Jegelka, S.[Stefanie], Savarese, S.[Silvio],
Deep Metric Learning via Lifted Structured Feature Embedding,
CVPR16(4004-4012)
IEEE DOI
Stanford Online Products. Dataset, Products.


Nascimento, S.M.C., Ferreira, F., and Foster, D.H.,
Statistics of spatial cone-excitation ratios in natural scenes,
JOSA-A(19), No. 8, August 2002, pp. 1484-1490.
PDF File. Dataset, Hyperspectral.
HTML Version.


Foster, D.H., Nascimento, S.M.C., Amano, K.,
Information limits on neural identification of coloured surfaces in natural scenes,
Visual Neuroscience(21), 2004, pp. 331-336.
PDF File. Dataset, Hyperspectral.
HTML Version.


Cerra, D.[Daniele], Pato, M.[Miguel], Alonso, K.[Kevin], Köhler, C.[Claas], Schneider, M.[Mathias], de los Reyes, R.[Raquel], Carmona, E.[Emiliano], Richter, R.[Rudolf], Kurz, F.[Franz], Reinartz, P.[Peter], Müller, R.[Rupert],
DLR HySU: A Benchmark Dataset for Spectral Unmixing,
RS(13), No. 13, 2021, pp. xx-yy.
DOI Link
Dataset, Unmixing.


Bossard, L.[Lukas], Guillaumin, M.[Matthieu], Van Gool, L.J.[Luc J.],
Food-101: Mining Discriminative Components with Random Forests,
ECCV14(VI: 446-461).
Springer DOI
Dataset, Food. 101 food categories, with 101’000 images recognizing pictured dishes.


Wang, X.H.[Xiao-Han], Eliott, F.M.[Fernanda M.], Ainooson, J.[James], Palmer, J.H.[Joshua H.], Kunda, M.[Maithilee],
An Object is Worth Six Thousand Pictures: The Egocentric, Manual, Multi-image (EMMI) Dataset,
Egocentric17(2364-2372)
IEEE DOI
WWW Link.
Dataset, Learning. Egocentric, Manual, Multi-Image (EMMI) Dataset. Automobiles, Cameras, Manuals, Object recognition, Toy manufacturing industry, Training, Visualization


Agarwal, S.[Shivani], Awan, A.[Aatif], and Roth, D.[Dan],
Learning to Detect Objects in Images via a Sparse, Part-Based Representation,
PAMI(26), No. 11, November 2004, pp. 1475-1490.
IEEE Abstract. Or:
PDF File.
WWW Link.
Dataset, Vehicles. Detecting specific object classes (e.g. cars).


Messikommer, N.[Nico], Gehrig, D.[Daniel], Loquercio, A.[Antonio], Scaramuzza, D.[Davide],
Event-based Asynchronous Sparse Convolutional Networks,
ECCV20(VIII:415-431).
Springer DOI
WWW Link. Code, Semantic Segmentation.
WWW Link. Dataset, Semantic Segmentation.


Borji, A.[Ali], Izadi, S.[Saeed], Itti, L.[Laurent],
iLab-20M: A Large-Scale Controlled Object Dataset to Investigate Deep Learning,
CVPR16(2221-2230)
IEEE DOI
Dataset, Learning.


300 Videos in the Wild,
2015 Dataset, Faces.
WWW Link. Used for the ICCV 2015 workshop challenge.


WIDER Attribute dataset,
2016.
WWW Link. Dataset, Faces.
See also Human Attribute Recognition by Deep Hierarchical Contexts.


Description of the Collection of Facial Images,
2007 Dataset, Faces.
HTML Version. Essex collection of faces. 395 people, 20 images each.


Annotated Facial Dataset,
2007 Dataset, Faces.
WWW Link.


The CMU Multi-PIE Face Database,
2010 Dataset, Faces.
WWW Link. It contains 337 subjects, captured under 15 view points and 19 illumination conditions in four recording sessions for a total of more than 750,000 images.


FaceScrub Annotated Face Dataset,
2014 Dataset, Faces.
HTML Version. 100,000 images of 530 people. Acquired from internet search with rejection of pictures that do not match.
See also data-driven approach to cleaning large face datasets, A.


GVVPerfcapEva Repository of Evaluation Data Sets,
2015 Dataset, Faces. Dataset, Human Motion. Dataset, Hand Tracking.
WWW Link. A set of dataset including:
GVVPerfCapEva: IDT - Full body skeletal motion capture results from from 
 body-worn inertial sensor data and depth camera recordings
GVVPerfCapEva: Dexter 1: Evaluation data set for 3D hand tracking with 
 depth and multi-view video data
GVVPerfCapEva: PDT 2013: Body shape estimation and real-time motion 
 capture with a depth camera
GVVPerfcapEva: BinoCap - Dense 3D full-body performance capture with 
 handheld stereo cameras (single + multiple person(s))
GVVPerfcapEva: MonFacecCap - Monocular dense face performance capture
GVVPerfCapEva: MVIC - markerless multi-view performance capture of 
 multiple interacting characters
GVVPerfCapEva: HKIC: Performance capture of interacting characters with 
 handheld Kinects


MPII Human Shape,
2015 Dataset, Human Pose.
WWW Link. Expressive 3D human body shape models and tools for human shape space building.


UB KinFace Database,
2011 Dataset, Faces.
HTML Version.


Yale Face Database,
Online2006. First is 165 images.
HTML Version. And 5760 single light source images of 10 subjects each seen under 576 viewing conditions
HTML Version. Dataset, Faces.


The University of Oulu Physics-Based Face Database,
2000. 125 different faces each in 16 different camera calibration and illumination conditions.
WWW Link. Dataset, Faces.


The University of Oulu Face Video Database,
2002.
WWW Link. Dataset, Faces.


The CAS-PEAL Large-Scale Chinese Face Database and Baseline Evaluations,
2004. 9,594 images of 1040 individuals (595 males and 445 females) with varying Pose, Expression, Accessory, and Lighting
HTML Version. Dataset, Faces.


MIT Face Recognition Database,
Online2000 Fi Dataset, Faces.
HTML Version.
HTML Version. First one is small (19X19) images. Second one has training and test data.


The UMIST Face Database,
1998. Face Recognition.
HTML Version. Dataset, Faces.


NIST Mugshot Identification Database,
2002.
HTML Version. Dataset, Faces.


IARPA Janus Benchmark A (IJB-A) dataset,
2017.
WWW Link. Dataset, Faces.


The ORL Database of Faces,
1992-1994. More recently called the AT&T database.
HTML Version. Dataset, Faces.


PubFig: Public Figures Face Database,
2015 Dataset, Faces.
WWW Link. 58,797 images of 200 people collected from the internet. Refer to:
See also Attribute and simile classifiers for face verification.


Peer, P.[Peter],
CVL Face Database,
Online1999. Dataset, Faces.
HTML Version. 114 people, 7 images each.


POSTECH Face Database,
2001 Dataset, Faces. Dataset, Expressions. Dataset, Gesture.
HTML Version. A variety of datasets for face recognition, expression recognition, gesture recognition, and video surveillance.
See also POSTECH face database (PF07) and performance evaluation, The.


Face Recognition Vendor Test 2006,
Online2006.
WWW Link. Dataset, Faces.
WWW Link. Results in February 2007.


FacePix Database,
Online2009.
WWW Link. Dataset, Faces. 181 poses 1 degree apart plus lighting (direction) changes.
See also Arizona State University.


YouTube Faces DB,
2015 Dataset, Faces.
WWW Link. A database of face videos designed for studying the problem of unconstrained face recognition in videos. The data set contains 3,425 videos of 1,595 different people.


Oxford Town Center,
2009 Dataset, Human Tracking.
WWW Link. Pedestrian detection and tracking.


CHUK Datasets,
2009 Dataset, Pedestrian Tracking. Dataset, Crowd Analysis. Dataset, Pedestrian Detection. Dataset, Re-Identification.
HTML Version. Person search, re-identification


A View From Somewhere (AVFS),
2023 Dataset, Face Similarity.
WWW Link. A dataset of 638,180 human judgments of face similarity.


Jain, V.[Vidit], Learned-Miller, E.G.[Erick G.],
FDDB: Face Detection Data Set and Benchmark,
UMass2010, Technical Report 2010-009.
WWW Link. Dataset, Faces. annotations for 5171 faces in a set of 2845 images. Subset of
See also Labeled faces in the wild: A database for studying face recognition in unconstrained environments.


Huang, G.B., Ramesh, M., Berg, T.L., Learned-Miller, E.G.,
Labeled faces in the wild: A database for studying face recognition in unconstrained environments,
UMass2007, Technical Report 07-49. annotated faces captured from news articles on the web. Dataset, Faces.
WWW Link. Detected using:
See also Robust Real-Time Face Detection.


Phillips, P.J., Moon, H.J., Rizvi, S.A., Rauss, P.J.,
The FERET Evaluation Methodology for Face-Recognition Algorithms,
PAMI(22), No. 10, October 2000, pp. 1090-1104.
IEEE DOI Evaluation, Faces. Dataset, Faces.

Earlier: A1, A2, A4, A3: CVPR97(137-143).
IEEE DOI
PDF File.
Evaluation; data.


Phillips, P.J.[P. Jonathon], Wechsler, H.[Harry], Huang, J.[Jeffery], Rauss, P.J.[Patrick J.],
The FERET Database and Evaluation Procedure for Face-Recognition Algorithms,
IVC(16), No. 5, April 27 1998, pp. 295-306.
Elsevier DOI
Evaluation, Faces. Dataset, Faces.


The FERET Database,
NIST1993.
WWW Link. Dataset, Faces. Old version. For Color --
See also Color FERET Database, The.
See also National Institute of Standards and Technology (NIST) Intelligent Systems Division.


The Color FERET Database,
NISTJanuary 2008.
WWW Link. Dataset, Faces.


Wong, Y.W.[Yee Wan], Ch'ng, S.I.[Sue Inn], Seng, K.P.[Kah Phooi], Ang, L.M.[Li-Minn], Chin, S.W.[Siew Wen], Chew, W.J.[Wei Jen], Lim, K.H.[King Hann],
A new multi-purpose audio-visual UNMC-VIER database with multiple variabilities,
PRL(32), No. 13, 1 October 2011, pp. 1503-1510.
Elsevier DOI
Dataset, Faces. Audio-visual database; Face recognition; Speech recognition; Visual variation


Mavadati, S.M.[S. Mohammad], Mahoor, M.H.[Mohammad H.], Bartlett, K.[Kevin], Trinh, P.[Philip], Cohn, J.F.[Jeffrey F.],
DISFA: A Spontaneous Facial Action Intensity Database,
AffCom(4), No. 2, 2013, pp. 151-160.
IEEE DOI
Dataset, Facial Action. Databases


Zhang, X.[Xing], Yin, L.J.[Li-Jun], Cohn, J.F.[Jeffrey F.], Canavan, S.[Shaun], Reale, M.[Michael], Horowitz, A.[Andy], Liu, P.[Peng], Girard, J.M.[Jeffrey M.],
BP4D-Spontaneous: a high-resolution spontaneous 3D dynamic facial expression database,
IVC(32), No. 10, 2014, pp. 692-706.
Elsevier DOI

Earlier: A1, A2, A3, A4, A5, A6, A7, Only:
A high-resolution spontaneous 3D dynamic facial expression database,
FG13(1-6)
IEEE DOI
Dataset, Facial Expressions. emotion recognition 3D facial expression


Yin, L.J.[Li-Jun], Chen, X.C.[Xiao-Chen], Sun, Y.[Yi], Worm, T.[Tony], Reale, M.[Michael],
A high-resolution 3D dynamic facial expression database,
FG08(1-6).
IEEE DOI
Dataset, Facial Expressions.


Cheema, U.[Usman], Moon, S.[Seungbin],
Sejong face database: A multi-modal disguise face database,
CVIU(208-209), 2021, pp. 103218.
Elsevier DOI
Dataset, Face Recognition. Biometrics, Disguise recognition, Face database, Face recognition, Multi-modal


Poster, D.[Domenick], Thielke, M.[Matthew], Nguyen, R.[Robert], Rajaraman, S.[Srinivasan], Di, X.[Xing], Fondje, C.N.[Cedric Nimpa], Patel, V.M.[Vishal M.], Short, N.J.[Nathaniel J.], Riggan, B.S.[Benjamin S.], Nasrabadi, N.M.[Nasser M.], Hu, S.[Shuowen],
A Large-Scale, Time-Synchronized Visible and Thermal Face Dataset,
WACV21(1558-1567)
IEEE DOI
PDF File.
Dataset, Face Recognition. Heating systems, Protocols, Thermal lensing, Photothermal effects, Cameras, Thermal analysis, Task analysis


Cao, J., Li, Y., Zhang, Z.,
Celeb-500K: A Large Training Dataset for Face Recognition,
ICIP18(2406-2410)
IEEE DOI
Dataset, Face Recognition. Training, Face, Face recognition, Measurement, Learning systems, Performance gain, Face detection, face recognition, face dataset, convolutional neural networks


Whitelam, C., Taborsky, E., Blanton, A., Maze, B., Adams, J., Miller, T., Kalka, N., Jain, A.K., Duncan, J.A., Allen, K., Cheney, J., Grother, P.,
IARPA Janus Benchmark-B Face Dataset,
Biometrics17(592-600)
IEEE DOI
Dataset, Faces. Benchmark testing, Face, Face detection, Face recognition, Media, Protocols, Videos
See also IARPA Janus Benchmark A (IJB-A) dataset.


Kemelmacher-Shlizerman, I., Seitz, S.M., Miller, D., Brossard, E.,
The MegaFace Benchmark: 1 Million Faces for Recognition at Scale,
CVPR16(4873-4882)
IEEE DOI
Dataset, Face Recognition.


Guo, Y.D.[Yan-Dong], Zhang, L.[Lei], Hu, Y.X.[Yu-Xiao], He, X.D.[Xiao-Dong], Gao, J.F.[Jian-Feng],
MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition,
ECCV16(III: 87-102).
Springer DOI
Dataset, Face Recognition.
WWW Link.


Klare, B.F.[Brendan F.], Klein, B.[Ben], Taborsky, E.[Emma], Blanton, A.[Austin], Cheney, J.[Jordan], Allen, K.[Kristen], Grother, P.[Patrick], Mah, A.[Alan], Burge, M.[Mark], Jain, A.K.[Anil K.],
Pushing the frontiers of unconstrained face detection and recognition: IARPA Janus Benchmark A,
CVPR15(1931-1939)
IEEE DOI
Dataset, Face Recognition.


McDuff, D.J.[Daniel J.], el Kaliouby, R.[Rana], Senechal, T.[Thibaud], Amr, M.[May], Cohn, J.F.[Jeffrey F.], Picard, R.W.[Rosalind W.],
Affectiva-MIT Facial Expression Dataset (AM-FED): Naturalistic and Spontaneous Facial Expressions Collected 'In-the-Wild',
AMFG13(881-888)
IEEE DOI
Dataset, Facial Expressions. Facial expressions;dataset


Toderici, G.[George], Evangelopoulos, G.[Georgios], Fang, T.H.[Tian-Hong], Theoharis, T.[Theoharis], Kakadiaris, I.A.[Ioannis A.],
UHDB11 Database for 3D-2D Face Recognition,
PSIVT13(73-86).
Springer DOI
Dataset, Faces.


Colombo, A.[Alessandro], Cusano, C.[Claudio], Schettini, R.[Raimondo],
UMB-DB: A database of partially occluded 3D faces,
BenchFace11(2113-2119).
IEEE DOI
Dataset, Faces.


Somanath, G.[Gowri], Rohith, M.V., Kambhamettu, C.[Chandra],
VADANA: A dense dataset for facial image analysis,
BenchFace11(2175-2182).
IEEE DOI
Dataset, Faces.


Özcan, M.[Mert], Jie, L.[Luo], Ferrari, V.[Vittorio], Caputo, B.[Barbara],
A Large-Scale Database of Images and Captions for Automatic Face Naming,
BMVC11(xx-yy).
HTML Version.
Dataset, Faces.


Gupta, S.[Shalini], Castleman, K.R.[Kenneth R.], Markey, M.K.[Mia K.], Bovik, A.C.[Alan C.],
Texas 3D Face Recognition Database,
Southwest10(97-100).
IEEE DOI
Dataset, Faces.


Bastanfard, A.[Azam], Nik, M.A.[Melika Abbasian], Dehshibi, M.M.[Mohammad Mahdi],
Iranian Face Database with age, pose and expression,
ICMV07(50-55).
IEEE DOI
Dataset, Faces.


Denes, L.J., Metes, P., Liu, Y.,
Hyperspectral Face Database,
CMU-RI-TR-02-25, October, 2002.
WWW Link.
Dataset, Faces.


Kärkkäinen, K.[Kimmo], Joo, J.[Jungseock],
FairFace: Face Attribute Dataset for Balanced Race, Gender, and Age for Bias Measurement and Mitigation,
WACV21(1547-1557)
IEEE DOI
Dataset, Face Recognition.
WWW Link. Training, Social networking (online), Computational modeling, Multimedia Web sites, Decision making, Media


Face Recogniton Home Page,
Online2006.
WWW Link. Code, Face Recognition. Dataset, Faces. Listing of research groups, databases, and vendors.


Face Detection Home Page,
Online2007.
WWW Link. Code, Face Detection. Dataset, Faces. Listing of research groups, databases, and vendors.


BioID Face Database,
2006. Dataset, Faces.
WWW Link.
See also HumanScan, BioID.


Mian, A.S.[Ajmal S.], Bennamoun, M.[Mohammed], Owens, R.A.[Robyn A.],
Three-Dimensional Model-Based Object Recognition and Segmentation in Cluttered Scenes,
PAMI(28), No. 10, October 2006, pp. 1584-1601.
IEEE DOI
Dataset, 3-D Data.
HTML Version. And
HTML Version.
Earlier:
3D Recognition and Segmentation of Objects in Cluttered Scenes,
WACV05(I: 8-13).
IEEE DOI

And:
Region-based Matching for Robust 3D Face Recognition,
BMVC05(xx-yy).
HTML Version.

And:
Matching Tensors for Pose Invariant Automatic 3D Face Recognition,
SafeSecur05(III: 120-120).
IEEE DOI

Earlier:
Performance analysis of an improved tensor based correspondence algorithm for automatic 3d modeling,
ICIP04(III: 1951-1954).
IEEE DOI

And:
Matching Tensors for Automatic Correspondence and Registration,
ECCV04(Vol II: 495-505).
Springer DOI
Model range data with tensors. Match stored tensor representations.


Min, R.[Rui], Kose, N., Dugelay, J.L.,
KinectFaceDB: A Kinect Database for Face Recognition,
SMCS(44), No. 11, November 2014, pp. 1534-1548.
IEEE DOI
Dataset, Faces, 3-D. face recognition


Equinox: Human Identification at a Distance,
HID. 2006. IR images available. Face Recognition.
HTML Version. Dataset, Faces.
See also Equinox Corporation.


Moschoglou, S., Papaioannou, A., Sagonas, C., Deng, J.K.[Jian-Kang], Kotsia, I., Zafeiriou, S.P.[Stefanos P.],
AgeDB: The First Manually Collected, In-the-Wild Age Database,
FaceWild17(1997-2005)
IEEE DOI
Dataset, Face Age. Databases, Estimation, Face, Face recognition, Machine learning, Protocols


Yu, J.H.[Jian-Hui], Zhu, H.[Hao], Jiang, L.M.[Li-Ming], Loy, C.C.[Chen Change], Cai, W.D.[Wei-Dong], Wu, W.[Wayne],
CelebV-Text: A Large-Scale Facial Text-Video Dataset,
CVPR23(14805-14814)
IEEE DOI
Dataset, Facial Features.


Zhu, H.[Hao], Wu, W.[Wayne], Zhu, W.T.[Wen-Tao], Jiang, L.M.[Li-Ming], Tang, S.W.[Si-Wei], Zhang, L.[Li], Liu, Z.W.[Zi-Wei], Loy, C.C.[Chen Change],
CelebV-HQ: A Large-Scale Video Facial Attributes Dataset,
ECCV22(VII:650-667).
Springer DOI
Dataset, Facial Features.


Jalal, A.[Ahsan], Tariq, U.[Usman],
The LFW-Gender Dataset,
CV4AC16(III: 531-540).
Springer DOI
Dataset, Gender.


Dago-Casas, P.[Pablo], Gonzalez-Jimenez, D.[Daniel], Yu, L.L.[Long Long], Alba-Castro, J.L.[Jose Luis],
Single- and cross- database benchmarks for gender classification under unconstrained settings,
BenchFace11(2152-2159).
IEEE DOI
Dataset, Faces.


MAFL: Multi-Attribute Facial Landmark,
2014.
HTML Version. Dataset, Facial Landmarks.
See also Learning Deep Representation for Face Alignment with Auxiliary Attributes.


Yang, S.[Shuo], Luo, P.[Ping], Loy, C.C.[Chen Change], Tang, X.[Xiaoou],
Faceness-Net: Face Detection through Deep Facial Part Responses,
PAMI(40), No. 8, August 2018, pp. 1845-1859.
IEEE DOI
Detectors, Face, Face detection, Mouth, Neural networks, Proposals, Training, Face detection, convolutional neural network, deep learning
Earlier:
WIDER FACE: A Face Detection Benchmark,
CVPR16(5525-5533)
IEEE DOI
Dataset, Face Detection.
Earlier:
From Facial Parts Responses to Face Detection: A Deep Learning Approach,
ICCV15(3676-3684)
IEEE DOI
Detectors; Face; Face detection; Hair; Mouth; Nose; Proposals


Kostinger, M.[Martin], Wohlhart, P.[Paul], Roth, P.M.[Peter M.], Bischof, H.[Horst],
Annotated Facial Landmarks in the Wild: A large-scale, real-world database for facial landmark localization,
BenchFace11(2144-2151).
IEEE DOI
Dataset, Faces, Features.


Schneiderman, H.[Henry], Kanade, T.[Takeo],
A Statistical Method for 3D Object Detection Applied to Faces and Cars,
CVPR00(I: 746-751).
IEEE DOI

And:
A Histogram-based Method for Detection of Faces and Cars,
ICIP00(Vol III: 504-507).
IEEE DOI

And:
Frontal Face Images,
WWW Link. Dataset, Faces. Combined CMU MIT face dataset.


CMU Profile Face Images,
2000.
HTML Version. Dataset, Faces.


Frejlichowski, D.[Dariusz], Tyszkiewicz, N.[Natalia],
The West Pomeranian University of Technology Ear Database: A Tool for Testing Biometric Algorithms,
ICIAR10(II: 227-234).
Springer DOI
Dataset, Biometrics.


O'Toole, A.J.[Alice J.], Harms, J.[Joshua], Snow, S.L.[Sarah L.], Hurst, D.R.[Dawn R.], Pappas, M.R.[Matthew R.], Ayyad, J.H.[Janet H.], Abdi, H.[Herve],
A Video Database of Moving Faces and People,
PAMI(27), No. 5, May 2005, pp. 812-816.
IEEE Abstract.
Dataset, Faces. Face database.


Pandey, P.[Prashant], Tyagi, A.K.[Aayush Kumar], Ambekar, S.[Sameer], Prathosh, A.P.,
Unsupervised Domain Adaptation for Semantic Segmentation of NIR Images Through Generative Latent Search,
ECCV20(VI:413-429).
Springer DOI
Dataset, Segmentation.
WWW Link.


Gu, Q., Wang, G., Chiu, M.T., Tai, Y., Tang, C.,
LADN: Local Adversarial Disentangling Network for Facial Makeup and De-Makeup,
ICCV19(10480-10489)
IEEE DOI
Dataset, Faces.
WWW Link. face recognition, feature extraction, LADN, local adversarial disentangling network, facial makeup, Mouth


Li, S.Z.[Stan Z.], Yi, D.[Dong], Lei, Z.[Zhen], Liao, S.C.[Sheng-Cai],
The CASIA NIR-VIS 2.0 Face Database,
PBVS13(348-353)
IEEE DOI
Dataset, Face Recognition. IR dataset.


Dynamic 2D/3D Speaking Face Dataset with Synchronized Audio,
2019.
HTML Version. Dataset, Lip Reading. Refer to:
See also 3D Visual passcode: Speech-driven 3D facial dynamics for behaviometrics.


Language Independent Lip Reading,
2007.
HTML Version. Dataset, Lip Reading.


OuluVS database,
2009.
WWW Link. Dataset, Lip Reading.


OnMapGaze: A new gaze dataset for map perception modeling,
2024.
WWW Link.
WWW Link. Dataset, Gaze. Code, Gaze. Gaze data collected during the observation of different cartographic backgrounds used in five online map services,


Xu, T.[Tao], Wu, B.[Bo], Bai, Y.Q.[Yu-Qiong], Zhou, Y.[Yun],
RavenGaze: A Dataset for Gaze Estimation Leveraging Psychological Experiment Through Eye Tracker,
FG23(1-6)
IEEE DOI
Dataset, Gaze Tracking. Visualization, Target tracking, Estimation, Psychology, Gesture recognition, Visual databases


He, Q.H.[Qiu-Hai], Hong, X.P.[Xiao-Peng], Chai, X.J.[Xiu-Juan], Holappa, J.[Jukka], Zhao, G.Y.[Guo-Ying], Chen, X.L.[Xi-Lin], Pietikäinen, M.[Matti],
OMEG: Oulu Multi-Pose Eye Gaze Dataset,
SCIA15(418-427).
Springer DOI
Dataset, Gaze.


Hadizadeh, H., Enriquez, M.J., Bajic, I.V.,
Eye-Tracking Database for a Set of Standard Video Sequences,
IP(21), No. 2, February 2012, pp. 898-903.
IEEE DOI
Dataset, Eye Tracking.


Fox, N.A.[Niall A.], O'Mullane, B.A.[Brian A.], Reilly, R.B.[Richard B.],
VALID: A New Practical Audio-Visual Database, and Comparative Results,
AVBPA05(777).
Springer DOI
WWW Link.
Dataset, Faces.


Sharma, P.[Prag], Reilly, R.B.[Richard B.],
The UCD Colour Face Image Database for Face Detection,
Online1998.
WWW Link. Dataset, Faces.


Mollahosseini, A., Hasani, B., Mahoor, M.H.,
AffectNet: A Database for Facial Expression, Valence, and Arousal Computing in the Wild,
AffCom(10), No. 1, January 2019, pp. 18-31.
IEEE DOI
Dataset, Facial Expressions. Databases, Computational modeling, Face, Face recognition, Affective computing, Magnetic heads, arousal


Papaioannou, A.[Athanasios], Gecer, B.[Baris], Cheng, S.[Shiyang], Chrysos, G.[Grigorios], Deng, J.K.[Jian-Kang], Fotiadou, E.[Eftychia], Kampouris, C.[Christos], Kollias, D.[Dimitrios], Moschoglou, S.[Stylianos], Songsri-In, K.[Kritaphat], Ploumpis, S.[Stylianos], Trigeorgis, G.[George], Tzirakis, P.[Panagiotis], Ververas, E.[Evangelos], Zhou, Y.X.[Yu-Xiang], Ponniah, A.[Allan], Roussos, A.[Anastasios], Zafeiriou, S.P.[Stefanos P.],
MimicME: A Large Scale Diverse 4D Database for Facial Expression Analysis,
ECCV22(VIII:467-484).
Springer DOI
Dataset, Facial Expressions.


CMU Facial Expression Database,
1999 Dataset, Faces. Dataset, Facial Expression.
HTML Version. Includes annotation.


Matuszewski, B.J.[Bogdan J.], Quan, W.[Wei], Shark, L.K.[Lik-Kwan],
High-resolution comprehensive 3-D dynamic database for facial articulation analysis,
BenchFace11(2128-2135).
IEEE DOI
Dataset, Facial Expressions.


Lucey, P.[Patrick], Cohn, J.F.[Jeffrey F.], Prkachin, K.M.[Kenneth M.], Solomon, P.E.[Patricia E.], Matthews, I.[Iain],
Painful data: The UNBC-McMaster shoulder pain expression archive database,
FG11(57-64).
IEEE DOI
Dataset, Facial Expression.


McDuff, D.J.[Daniel J.], Amr, M., el Kaliouby, R.[Rana],
AM-FED+: An Extended Dataset of Naturalistic Facial Expressions Collected in Everyday Settings,
AffCom(10), No. 1, January 2019, pp. 7-17.
IEEE DOI
Dataset, Facial Expressions. Videos, Encoding, Face recognition, Training, Lighting, Task analysis, Databases, Facial expressions, facial action coding system, corpora


Lyons, M.J., Akamatsu, S., Kamachi, M., Gyoba, J.,
Coding Facial Expressions with Gabor Wavelets,
AFGR98(200-205).
IEEE DOI Dataset, Facial Expressions.
HTML Version. 213 images of 7 facial expressions, 10 Japanese female subjects.


Children Spontaneous Facial Expression Video Database (LIRIS-CSE),
2019. Dataset, Facial Expressions.
WWW Link. spontaneous / natural facial expressions of 12 children in diverse settings with variable recording scenarios showing six universal or prototypic emotional expressions (happiness, sadness, anger, surprise, disgust and fear).
See also novel database of children's spontaneous facial expressions (LIRIS-CSE), A.


Yan, W.J.[Wen-Jing], Wu, Q.[Qi], Liu, Y.J.[Yong-Jin], Wang, S.J.[Su-Jing], Fu, X.L.[Xiao-Lan],
CASME database: A dataset of spontaneous micro-expressions collected from neutralized faces,
FG13(1-7)
IEEE DOI
Dataset, Facial Expressions. computer vision


Oulu-CASIA NIR&VIS facial expression database,
2008.
WWW Link. Dataset, Facial Expressions. 6 typical expressions from 80 subjects.


BU-3DFE (Binghamton University 3D Facial Expression) Database,
Dataset, Facial Expressions.
HTML Version.


The AR Face Database,
1998.
HTML Version. Or:
HTML Version. Dataset, Faces.


Sim, T.[Terence], Baker, S., Bsat, M.,
The CMU Pose, Illumination, and Expression Database,
PAMI(25), No. 12, December 2003, pp. 1615-1618.
IEEE Abstract.
Dataset, Faces.
Earlier:
The CMU Pose, Illumination, and Expression (PIE) Database of Human Faces,
AFGR02(46-51).
IEEE DOI
HTML Version.

And: CMU-RI-TR-01-02, January, 2001.
HTML Version.
PDF File.
PS File.
HTML Version.


Gross, R.[Ralph], Matthews, I.[Iain], Cohn, J.F.[Jeffrey F.], Kanade, T.[Takeo], Baker, S.[Simon],
Multi-PIE,
IVC(28), No. 5, May 2010, pp. 807-813.
Elsevier DOI
Dataset, Faces.
Earlier: FG08(1-8).
IEEE DOI
Face database; Face recognition across pose; Face recognition across illumination; Face recognition across expression
See also CMU Pose, Illumination, and Expression Database, The.


Kanade, T.[Takeo], Cohn, J.F.[Jeffrey F.], Tian, Y.L.[Ying-Li],
Comprehensive Database for Facial Expression Analysis,
AFGR00(46-53).
IEEE DOI
Dataset, Faces. Dataset, Expressions.


Wang, S., Liu, Z., Lv, S., Lv, Y., Wu, G., Peng, P., Chen, F., Wang, X.,
A Natural Visible and Infrared Facial Expression Database for Expression Recognition and Emotion Inference,
MultMed(12), No. 7, 2010, pp. 682-691.
IEEE DOI
Dataset, Facial Expressions.


Matuszewski, B.J.[Bogdan J.], Quan, W.[Wei], Shark, L.K.[Lik-Kwan], McLoughlin, A.S.[Alison S.], Lightbody, C.E.[Catherine E.], Emsley, H.C.A.[Hedley C.A.], Watkins, C.L.[Caroline L.],
Hi4D-ADSIP 3-D dynamic facial articulation database,
IVC(30), No. 10, October 2012, pp. 713-727.
Elsevier DOI
Dataset, Facial Expressions. Facial articulation database; Expression recognition; Facial; Dysfunctions; Facial expression validation


Wang, S.F.[Shang-Fei], Liu, Z.L.[Zhi-Lei], Wang, Z.Y.[Zhao-Yu], Wu, G.B.[Guo-Bing], Shen, P.J.[Pei-Jia], He, S.[Shan], Wang, X.[Xufa],
Analyses of a Multimodal Spontaneous Facial Expression Database,
AffCom(4), No. 1, January 2013, pp. 34-46.
IEEE DOI
Dataset, Expression Recognition.


Baveye, Y., Dellandrea, E., Chamaret, C., Chen, L.M.[Li-Ming],
LIRIS-ACCEDE: A Video Database for Affective Content Analysis,
AffCom(6), No. 1, January 2015, pp. 43-55.
IEEE DOI
Dataset, Affective. copyright


Kossaifi, J.[Jean], Walecki, R.[Robert], Panagakis, Y.[Yannis], Shen, J.[Jie], Schmitt, M.[Maximilian], Ringeval, F.[Fabien], Han, J.[Jing], Pandit, V.[Vedhas], Toisoul, A.[Antoine], Schuller, B.[Björn], Star, K.[Kam], Hajiyev, E.[Elnar], Pantic, M.[Maja],
SEWA DB: A Rich Database for Audio-Visual Emotion and Sentiment Research in the Wild,
PAMI(43), No. 3, March 2021, pp. 1022-1040.
IEEE DOI
Dataset, Emotion. Databases, Tools, Computational modeling, Biological system modeling, Sensors, Affective computing, facial action units


Zhang, Z., Girard, J.M., Wu, Y., Zhang, X., Liu, P., Ciftci, U., Canavan, S., Reale, M., Horowitz, A., Yang, H., Cohn, J.F., Ji, Q., Yin, L.,
Multimodal Spontaneous Emotion Corpus for Human Behavior Analysis,
CVPR16(3438-3446)
IEEE DOI
Dataset, Emotion.


Aubrey, A.J.[Andrew J.], Marshall, D.[David], Rosin, P.L.[Paul L.], Vendeventer, J.[Jason], Cunningham, D.W.[Douglas W.], Wallraven, C.[Christian],
Cardiff Conversation Database (CCDb): A Database of Natural Dyadic Conversations,
LV13(277-282)
IEEE DOI
Dataset, Facial Expressions. Conversations; Database; Facial Expressions


Lucey, P.[Patrick], Cohn, J.F.[Jeffrey F.], Kanade, T.[Takeo], Saragih, J.M.[Jason M.], Ambadar, Z.[Zara], Matthews, I.[Iain],
The Extended Cohn-Kanade Dataset (CK+): A complete dataset for action unit and emotion-specified expression,
CVPR4HB10(94-101).
IEEE DOI
Dataset, Facial Expressions.


Sapinski, T.[Tomasz], Kaminska, D.[Dorota], Pelikant, A.[Adam], Ozcinar, C.[Cagri], Avots, E.[Egils], Anbarjafari, G.[Gholamreza],
Multimodal Database of Emotional Speech, Video and Gestures,
MIPPSNA18(153-163).
Springer DOI
Dataset, Emotions.


Sneddon, I., McRorie, M., McKeown, G., Hanratty, J.,
The Belfast Induced Natural Emotion Database,
AffCom(3), No. 1, 2012, pp. 32-41.
IEEE DOI
Dataset, Emotions.


OMG-Emotion (One-Minute Gradual-Emotional Behavior),
2018
WWW Link. Dataset, Emotion Recognition. Developed for a challenge. 500+ 1 minute emotion videos.


Petridis, S.[Stavros], Martinez, B.[Brais], Pantic, M.[Maja],
The MAHNOB Laughter database,
IVC(31), No. 2, February 2013, pp. 186-202.
Elsevier DOI
Dataset, Laughter. Laughter; Audiovisual; Thermal; Database; Audiovisual automatic laughter-speech discrimination


Abadi, M.K., Subramanian, R., Kia, S.M., Avesani, P., Patras, I., Sebe, N.,
DECAF: MEG-Based Multimodal Database for Decoding Affective Physiological Responses,
AffCom(6), No. 3, July 2015, pp. 209-222.
IEEE DOI
Dataset, Affective Responses. Databases


Provost, E.M.[Emily Mower], Yuan, S.G.[Shang-Guan], Busso, C.[Carlos],
UMEME: University of Michigan Emotional McGurk Effect Data Set,
AffCom(6), No. 4, October 2015, pp. 395-409.
IEEE DOI
Dataset, Emotion Recognition. Emotion recognition


Yan, J.J.[Jing-Jie], Wang, B.[Bei], Liang, R.Y.[Rui-Yu],
A Novel Bimodal Emotion Database from Physiological Signals and Facial Expression,
IEICE(E101-D), No. 7, July 2018, pp. 1976-1979.
WWW Link.
Dataset, Emotions.


Lee, J.Y.[Ji-Young], Kim, S.R.[Seung-Ryong], Kim, S.[Sunok], Park, J.[Jungin], Sohn, K.H.[Kwang-Hoon],
Context-Aware Emotion Recognition Networks,
ICCV19(10142-10151)
IEEE DOI
Dataset, Emotion Recognition.
WWW Link. emotion recognition, face recognition, feature extraction, image fusion, neural nets, visual scene, boosting manner, Adaptive systems


Ong, D.C.[Desmond C.], Wu, Z.X.[Zheng-Xuan], Tan, Z.X.[Zhi-Xuan], Reddan, M.[Marianne], Kahhale, I.[Isabella], Mattek, A.[Alison], Zaki, J.[Jamil],
Modeling Emotion in Complex Stories: The Stanford Emotional Narratives Dataset,
AffCom(12), No. 3, July 2021, pp. 579-594.
IEEE DOI
Dataset, Emotion. Computational modeling, Hidden Markov models, Affective computing, Biological system modeling, Videos, emotional corpora


Vicol, P.[Paul], Tapaswi, M.[Makarand], Castrejón, L.[Lluís], Fidler, S.[Sanja],
MovieGraphs: Towards Understanding Human-Centric Situations from Videos,
CVPR18(8581-8590)
IEEE DOI
WWW Link. Dataset, Gestures. Videos of social situations to teach robots to understand people. Videos, Motion pictures, Semantics, Natural languages, Face, Automobiles, Legged locomotion


Nguyen, H.[Hung], Kotani, K.[Kazunori], Chen, F.[Fan], Le, B.[Bac],
A Thermal Facial Emotion Database and Its Analysis,
PSIVT13(397-408).
Springer DOI
Dataset, Facial Expression.


Liu, H.Y.[Hai-Yang], Zhu, Z.H.[Zi-Hao], Iwamoto, N.[Naoya], Peng, Y.C.[Yi-Chen], Li, Z.Q.[Zheng-Qing], Zhou, Y.[You], Bozkurt, E.[Elif], Zheng, B.[Bo],
BEAT: A Large-Scale Semantic and Emotional Multi-modal Dataset for Conversational Gestures Synthesis,
ECCV22(VII:612-630).
Springer DOI
Dataset, Emotions.


Wei, H.L.[Hao-Lin], Monaghan, D.S.[David S.], O'Connor, N.E.[Noel E.], Scanlon, P.[Patricia],
A New Multi-modal Dataset for Human Affect Analysis,
HBU14(42-51).
Springer DOI
Dataset, Human Affect.


Miranda-Correa, J.A.[Juan Abdon], Abadi, M.K.[Mojtaba Khomami], Sebe, N.[Nicu], Patras, I.[Ioannis],
AMIGOS: A Dataset for Affect, Personality and Mood Research on Individuals and Groups,
AffCom(12), No. 2, April 2021, pp. 479-493.
IEEE DOI
Dataset, Emotion. Videos, Databases, Mood, Physiology, Electroencephalography, Brain modeling, Electrocardiography, Emotion classification, EEG, affective computing


VGG Pose Datasets,
2013 Dataset, Human Pose.
HTML Version. A collection of several human pose datasets, BBC Pose, YouTube Pose, ChaLearn Pose.


Extended BBC Pose Dataset,
2013 Dataset, Human Pose.
WWW Link. Original BBC Pose plus more. 92 Videos.


FLIC: Frames Labelled in Cinema,
2013 Dataset, Human Pose.
HTML Version.
See also MODEC: Multimodal Decomposable Models for Human Pose Estimation.


Verma, M., Kumawat, S., Nakashima, Y., Raman, S.,
Yoga-82: A New Dataset for Fine-grained Classification of Human Poses,
VUHCS20(4472-4479)
IEEE DOI
Dataset, Homan Pose. Legged locomotion, Wheels, Pose estimation, Visualization, Skeleton, Image resolution


Bourdev, L.[Lubomir], and Malik, J.[Jitendra],
H3D Dataset,
2009. Dataset, Humans.
WWW Link. Annotated human images.


3DHumans: Dataset for Human Body Models,
2023 Dataset, Human Shapes.
WWW Link. he 3DHumans dataset provides around 180 meshes of people in diverse body shapes in various garments styles and sizes.
See also Indian Institute of Technology, Hyderabad.


Nibali, A.[Aiden], Millward, J.[Joshua], He, Z.[Zhen], Morgan, S.[Stuart],
ASPset: An outdoor sports pose video dataset with 3D keypoint annotations,
IVC(111), 2021, pp. 104196.
Elsevier DOI
Dataset, Human Pose. Markerless motion capture, Human pose estimation, Triangulation, Camera calibration


van der Aa, N.P., Luo, X., Giezeman, G.J., Tan, R.T., Veltkamp, R.C.,
UMPM benchmark: A multi-person dataset with synchronized video and motion capture data for evaluation of articulated human motion and interaction,
HICV11(1264-1269).
IEEE DOI
Dataset, Human Pose.


Combettes, S.W.[Sylvain W.], Boniol, P.[Paul], Mazarguil, A.[Antoine], Wang, D.P.[Dan-Ping], Vaquero-Ramos, D.[Diego], Chauveau, M.[Marion], Oudre, L.[Laurent], Vayatis, N.[Nicolas], Vidal, P.P.[Pierre-Paul], Roren, A.[Alexandra], Lefèvre-Colau, M.M.[Marie-Martine],
Arm-CODA: A Data Set of Upper-limb Human Movement During Routine Examination,
IPOL(14), 2024, pp. 1-13.
DOI Link
Dataset, Upper Body Motion.


HandNet Hand Images,
2015 Dataset, Gestures.
WWW Link.
More than 214971 images of 10 different particpants' hands captured by a RealSense RGBD sensor performing random articulations. Annotations include: per pixel classes, 6D fingertip pose, heatmap. Recorded at GIP Lab, Technion.


Aristotle University of Thessaloniki UAV Gesture Dataset,
2022
WWW Link. Dataset, Gestures. Public video dataset for gesture recognition in human-UAV/drone interaction. AUTH UAV Gesture Dataset consists of 4930 videos (resolution: 1920 x 1080), distributed along 6 classes, at 30 frames per second. Both indoors and outdoors settings are included, while 58 different human subjects have been employed for filming the sequences.


Fanelli, G., Gall, J., Romsdorfer, H., Weise, T., Van Gool, L.J.,
A 3-D Audio-Visual Corpus of Affective Communication,
MultMed(12), No. 6, 2010, pp. 591-598.
IEEE DOI
Dataset, Gestures.


Molina, J.[Javier], Pajuelo, J.A.[José A.], Escudero-Viñolo, M.[Marcos], Bescós, J.[Jesús], Martínez, J.M.[José M.],
A natural and synthetic corpus for benchmarking of hand gesture recognition systems,
MVA(25), No. 4, May 2014, pp. 943-954.
Springer DOI
Dataset, Hand Gestures.


Guyon, I.[Isabelle], Athitsos, V.[Vassilis], Jangyodsuk, P.[Pat], Escalante, H.J.[Hugo Jair],
The ChaLearn gesture dataset (CGD 2011),
MVA(25), No. 8, November 2014, pp. 1929-1951.
Springer DOI
Dataset, Gesture.


Materzynska, J., Berger, G., Bax, I., Memisevic, R.,
The Jester Dataset: A Large-Scale Video Dataset of Human Gestures,
Hands19(2874-2882)
IEEE DOI
Dataset, Gestures. convolutional neural nets, gesture recognition, human computer interaction, video signal processing, deep learning


Myanganbayar, B.[Battushig], Mata, C.[Cristina], Dekel, G.[Gil], Katz, B.[Boris], Ben-Yosef, G.[Guy], Barbu, A.[Andrei],
Partially Occluded Hands: A Challenging New Dataset for Single-Image Hand Pose Estimation,
ACCV18(V:85-98).
Springer DOI
Dataset, Hand Pose.
WWW Link.


Bloom, V.[Victoria], Argyriou, V.[Vasileios], Makris, D.[Dimitrios],
Linear latent low dimensional space for online early action recognition and prediction,
PR(72), No. 1, 2017, pp. 532-547.
Elsevier DOI

Earlier: A1, A3, A2:
G3D: A gaming action dataset and real time action recognition evaluation framework,
CVCG12(7-12).
IEEE DOI
Dataset, Gesture Recognition. Action, recognition


Moon, G.[Gyeongsik], Yu, S.I.[Shoou-I], Wen, H.[He], Shiratori, T.[Takaaki], Lee, K.M.[Kyoung Mu],
Interhand2.6m: A Dataset and Baseline for 3d Interacting Hand Pose Estimation from a Single RGB Image,
ECCV20(XX:548-564).
Springer DOI
Dataset, Hand Pose.


Buehler, P.[Patrick], Everingham, M.R.[Mark R.], Huttenlocher, D.P.[Daniel P.], Zisserman, A.[Andrew],
Upper Body Detection and Tracking in Extended Signing Sequences,
IJCV(95), No. 2, November 2011, pp. 180-197.
WWW Link.

Earlier:
Long Term Arm and Hand Tracking for Continuous Sign Language TV Broadcasts,
BMVC08(xx-yy).
PDF File.
PDF File. Data available. Dataset, Sign Language.
HTML Version.


The BANCA Database,
2007.
WWW Link. Dataset, Biometrics.


Soft-Biometric in Surveillance (SoBiS) Dataset,
2017
WWW Link. Dataset, Biometrics. Recorded at Fraunhofer IOSB.


Ortega-Garcia, J., Fierrez-Aguilar, J., Simon, D., Gonzalez, J., Faundez-Zanuy, M., Espinosa, V., Satue, A., Hernaez, I., Igarza, J.J., Vivaracho, C., Escudero, D., Moro, Q.I.,
MCYT baseline corpus: a bimodal biometric database,
VISP(150), No. 6, December 2003, pp. 395-401.
IEEE Abstract.
Dataset, Biometrics.


Fierrez-Aguilar, J.[Julian], Ortega-Garcia, J.[Javier], Toledano, D.T.[Doroteo Torre], Gonzalez-Rodriguez, J.[Joaquin],
Biosec baseline corpus: A multimodal biometric database,
PR(40), No. 4, April 2007, pp. 1389-1392.
Elsevier DOI
Dataset, Biometrics. Multimodal; Biometrics; Authentication; Verification; Database; Performance; Fingerprint; Iris; Face; Voice


Ortega-Garcia, J.[Javier], Fierrez, J.[Julian], Alonso-Fernandez, F.[Fernando], Galbally, J.[Javier], Freire, M.R.[Manuel R.], Gonzalez-Rodriguez, J.[Joaquin], Garcia-Mateo, C.[Carmen], Alba-Castro, J.L.[Jose-Luis], Gonzalez-Agulla, E.[Elisardo], Otero-Muras, E.[Enrique], Garcia-Salicetti, S.[Sonia], Allano, L.[Lorene], Ly-Van, B.[Bao], Dorizzi, B.[Bernadette], Kittler, J.V.[Josef V.], Bourlai, T.[Thirimachos], Poh, N.[Norman], Deravi, F.[Farzin], Ng, M.N.R.[Ming N. R.], Fairhurst, M.C.[Michael C.], Hennebert, J.[Jean], Humm, A.[Andreas], Tistarelli, M.[Massimo], Brodo, L.[Linda], Richiardi, J.[Jonas], Drygajlo, A.[Andrezj], Ganster, H.[Harald], Sukno, F.M.[Federico M.], Pavani, S.K.[Sri-Kaushik], Frangi, A.[Alejandro], Akarun, L.[Lale], Savran, A.[Arman],
The Multiscenario Multienvironment BioSecure Multimodal Database (BMDB),
PAMI(32), No. 6, June 2010, pp. 1097-1111.
IEEE DOI
Dataset, Biometrics. Withing the Europens BioSecure framework. 600 individuals. Acquired over internet, in office, and indoor/outdoor with portable hardware. Audio/Video data, signature, fingerprint.


Santos, G., Fiadeiro, P.T., Proença, H.,
BioHDD: a dataset for studying biometric identification on heavily degraded data,
IET-Bio(4), No. 1, 2015, pp. 1-9.
DOI Link
Dataset, Biometrics. biometrics (access control)


Zhang, Y.H.[Yuan-Han], Yin, Z.F.[Zhen-Fei], Li, Y.D.[Yi-Dong], Yin, G.J.[Guo-Jun], Yan, J.J.[Jun-Jie], Shao, J.[Jing], Liu, Z.W.[Zi-Wei],
Celeba-Spoof: Large-scale Face Anti-spoofing Dataset with Rich Annotations,
ECCV20(XII: 70-85).
Springer DOI
Dataset, Face Anti-Spoofing.


Oliveira, H.P.[Hélder P.], Magalhães, F.[Filipe],
Two Unconstrained Biometric Databases,
ICIAR12(II: 11-19).
Springer DOI
Dataset, Biometrics.


Zafeiriou, S.P.[Stefanos P.], Hansen, M.[Mark], Atkinson, G.A.[Gary A.], Argyriou, V.[Vasileios], Petrou, M.[Maria], Smith, M.L.[Melvyn L.], Smith, L.N.[Lyndon N.],
The Photoface database,
Biometrics11(132-139).
IEEE DOI
Dataset, Faces.


Nizami, H., Adkins-Hill, J.P., Zhang, Y.[Yong], Sullins, J.R., McCullough, C., Canavan, S., Yin, L.J.[Li-Jun],
A biometric database with rotating head videos and hand-drawn face sketches,
BTAS09(1-6).
IEEE DOI
Dataset, Biometrics.


Li, S.Z.[Stan Z.], Lei, Z.[Zhen], Ao, M.[Meng],
The HFB Face Database for Heterogeneous Face Biometrics research,
OTCBVS09(1-8).
IEEE DOI
Dataset, Faces.


Martinho-Corbishley, D., Nixon, M.S.[Mark S.], Carter, J.N.[John N.],
Soft Biometric Retrieval to Describe and Identify Surveillance Images,
ISBA16(xx-xx).
IEEE DOI Dataset, Soft Biometrics. SoBiR Dataset
WWW Link.


Messer, K., Matas, J.G., Kittler, J.V., Luettin, J., Maitre, G.,
XM2VTSDB: The Extended M2VTS Database,
AVBPA99(xx-yy).
WWW Link. Dataset, Biometrics. 4 versions of 295 subjects.


Donida Labati, R., Genovese, A.[Angelo], Piuri, V.[Vincenzo], Scotti, F.[Fabio], Vishwakarma, S.[Sarvesh],
I-SOCIAL-DB: A labeled database of images collected from websites and social media for Iris recognition,
IVC(105), 2021, pp. 104058.
Elsevier DOI
Dataset, Iris. Biometrics, Iris, Web images


UBIRIS database,
2007, Department of Computer Science, University of Beira Interior, Portugal.
WWW Link. Dataset, Iris Images. The enhanced version is available only for the Iris Segmentation Contest. 241 subjects, 1877 images.


CASIA Iris Image Database,
2007, Chinese Academy of Sciences.
HTML Version. Dataset, Iris Images. Various versions. Version 3. 60 subjects, 2400 images.


NIST ICE Iris Image Database,
2007, NIST.
WWW Link. Dataset, Iris Images. 132 subjects, 2953 images. For most recent info:
See also NIST IREX, Iris Exchange Datasets. and also
See also Iris Recognition Database.


Iris Recognition Database,
2007
HTML Version. Dataset, Iris Images. Derived from University of Bath
See also University of of Bath. in association with Smart Sensors Ltd.
See also Smart Sensors Limited. High resolution images, 20 each eye for 800 people.


Iris Recognition Database,
2009
HTML Version. Dataset, Iris Images. ND-IRIS-0405. A superset of ICE2005 and ICE2006 datasets. (
See also NIST ICE Iris Image Database. ) 64,980 iris images from 712 irises of 356 human subjects. From the Notre Dame group.
See also University of Notre Dame. For more updates:
See also NIST IREX, Iris Exchange Datasets.


UTIRIS: University of Tehran IRIS Image Repository,
Online2014
WWW Link. Dataset, Iris Images.
Visible and Infrared.


NIST IREX, Iris Exchange Datasets,
2020
WWW Link. Dataset, Iris.
See also Iris Recognition Database.


Dobeš, M.[Michal], and Machala, L.[Libor],
Iris Database,
Online2006
WWW Link. Dataset, Iris Images. The database used for:
See also Human eye localization using the modified Hough transform.
See also Human Eye Iris Recognition Using the Mutual Information.


Proenca, H.[Hugo], Filipe, S.[Silvio], Santos, R.[Ricardo], Oliveira, J.[Joao], Alexandre, L.A.[Luis A.],
The UBIRIS.v2: A Database of Visible Wavelength Iris Images Captured On-the-Move and At-a-Distance,
PAMI(32), No. 8, August 2010, pp. 1529-1535.
IEEE DOI
Dataset, Iris Recognition.
WWW Link. Visible wavelength, 4-8 meters distance, people moving.


Omelina, L.[Lubos], Goga, J.[Jozef], Pavlovicova, J.[Jarmila], Oravec, M.[Milos], Jansen, B.[Bart],
A survey of iris datasets,
IVC(108), 2021, pp. 104109.
Elsevier DOI
Survey, Iris Reognition. Dataset, Iris Recognition. Biometrics, Iris recognition, Iris datasets, Human iris


Petrovska-Delacretaz, D., Lelandais, S., Colineau, J., Chen, L.M., Dorizzi, B., Ardabilian, M., Krichen, E., Mellakh, M.A., Chaari, A., Guerfi, S., d'Hose, J., Ben Amor, B.[Boulbaba],
The IV2 Multimodal Biometric Database (Including Iris, 2D, 3D, Stereoscopic, and Talking Face Data), and the IV2-2007 Evaluation Campaign,
BTAS08(1-7).
IEEE DOI
Dataset, Iris Recognition.


Maltoni, D.[Davide], Maio, D.[Dario], Jain, A.K.[Anil K.], Prabhakar, S.[Salil],
Handbook of Fingerprint Recognition,
Springer2009. ISBN: 978-1-84882-253-5 Second Edition.
WWW Link.

Earlier: Springer-VerlagNew York, 2003
WWW Link. Survey, Fingerprints. Dataset, Fingerprints. The new edition is greatly expanded. Algorithms, evaluations, sensors, standards, security. Buy this book: Handbook of Fingerprint Recognition


Maio, D., Maltoni, D.[Davide], Cappelli, R.[Raffaele], Wayman, J.L., Jain, A.K.,
FVC2000: Fingerprint Verification Competition,
PAMI(24), No. 3, March 2002, pp. 402-412.
IEEE DOI
Dataset, Fingerprints.
Earlier:
Invited Paper: FVC2000: Fingerprint Verification Competition,
ICPR00(Vol IV: No paper).


Wilson, C.L., Watson, C.I.,
NIST Special Database 4, Fingerprint Database,
NISTIRMarch 1992.
WWW Link. Dataset, Fingerprints.


Wang, Q., Li, S.Y.,
Database of human segmented images and its application in boundary detection,
IET-IPR(6), No. 3, 2012, pp. 222-229.
DOI Link
Dataset, Segmentation.


ADE20K Dataset,
2017. Dataset, Segmentation.
WWW Link. Annotated data,


LHI Segmentation Dataset,
Subset of larger dataset. Online2008
HTML Version. Dataset, Segmentation.
See also Lotus Hill Institute.


The PASCAL Visual Object Classes Challenge 2012,
Online2012 Dataset, Segmentation.
WWW Link. Various PASCAL datasets for different years
See also Pascal: Pattern Analysis, Statistical Modelling and Computational Learning.


COCO: Common Objects in Context,
Online Dataset, Segmentation.
WWW Link. Large-scale object detection, segmentation, and captioning dataset. Used for ECCV 2018 challange:
HTML Version.


DIS5K,
2022 Dataset, Segmentation.
WWW Link. 5,470 high-resolution (e.g., 2K, 4K or larger) images covering camouflaged, salient, or meticulous objects in various backgrounds.
See also Highly Accurate Dichotomous Image Segmentation.


Barnard, K.[Kobus], Fan, Q.F.[Quan-Fu], Swaminathan, R.[Ranjini], Hoogs, A.[Anthony], Collins, R.[Roderic], Rondot, P.[Pascale], Kaufhold, J.[John],
Evaluation of Localized Semantics: Data, Methodology, and Experiments,
IJCV(77), No. 1-3, May 2008, pp. 199-217.
Springer DOI
Dataset, Segmentation. Dataset with hand segmentations.
WWW Link.


Kirillov, A.[Alexander], Mintun, E.[Eric], Ravi, N.[Nikhila], Mao, H.Z.[Han-Zi], Rolland, C.[Chloe], Gustafson, L.[Laura], Xiao, T.[Tete], Whitehead, S.[Spencer], Berg, A.C.[Alexander C.], Lo, W.Y.[Wan-Yen], Dollár, P.[Piotr], Girshick, R.[Ross],
Segment Anything,
ICCV23(3992-4003)
IEEE DOI
WWW Link.
Dataset, Segmentation.


Qi, L.[Lu], Kuen, J.[Jason], Shen, T.C.[Tian-Cheng], Gu, J.X.[Jiu-Xiang], Li, W.B.[Wen-Bo], Guo, W.D.[Wei-Dong], Jia, J.Y.[Jia-Ya], Lin, Z.[Zhe], Yang, M.H.[Ming-Hsuan],
High Quality Entity Segmentation,
ICCV23(4024-4033)
IEEE DOI Code:
WWW Link.
Dataset, Segmentation.


Upchurch, P.[Paul], Niu, R.[Ransen],
A Dense Material Segmentation Dataset for Indoor and Outdoor Scene Parsing,
ECCV22(VIII:450-466).
Springer DOI
Dataset, Segmentation.


Follmann, P.[Patrick], Böttger, T.[Tobias], Härtinger, P.[Philipp], König, R.[Rebecca], Ulrich, M.[Markus],
MVTec D2S: Densely Segmented Supermarket Dataset,
ECCV18(X: 581-597).
Springer DOI
Dataset, Segmentation.


Kampel, M.[Martin], Hanbury, A.[Allan], Blauensteiner, P.[Philipp], Wildenauer, H.[Horst],
Improved motion segmentation based on shadow detection,
ELCVIA(6), No. 3, December 2007, pp. 1-12.
DOI Link
Includes Test Data: Dataset, Shadow Detection.


Semantic Boundaries Dataset and Benchmark,
Online2011. Dataset, Segmentation.
HTML Version. or:
HTML Version.
See also Semantic contours from inverse detectors. Related to:
See also Berkeley Segmentation Dataset and Benchmark, The.
See also PASCAL Visual Object Classes Challenge 2012, The.


Arbelaez, P.[Pablo], Fowlkes, C.C.[Charless C.], and Martin, D.R.[David R.],
The Berkeley Segmentation Dataset and Benchmark,
Online2007. Dataset, Segmentation. Dataset, BSDS. Code, Segmentation.
WWW Link. The updated code and data for the earlier paper.
See also Database of Human Segmented Natural Images and its Application to Evaluating Segmentation Algorithms and Measuring Ecological Statistics, A.


Martin, D.R.[David R.], Fowlkes, C.C.[Charless C.], Tal, D.[Doron], Malik, J.[Jitendra],
A Database of Human Segmented Natural Images and its Application to Evaluating Segmentation Algorithms and Measuring Ecological Statistics,
ICCV01(II: 416-423).
IEEE DOI
Award, Helmholtz Prize.
And:
A Database of Human Segmented Natural Images and its Application to Evaluating Segmentation Algorithms,
PercOrg01(xx-yy). Dataset, Human Segmentation. BSDS300 DAtaset Multiple human segmentations and a segmentation consistency measure. Human-human are consistent with the measure, different images are not consistent. Promised online availability. 1000 images with hand segmentations. Multiple hand segmentations.


Anke, B.[Bellmann], Olaf, H.[Hellwich], Volker, R.[Rodehorst], Ulas, Y.[Yilmaz],
A Benchmark Dataset for Performance Evaluation of Shape-from-X Algorithms,
ISPRS08(B3b: 67 ff).
PDF File.
Dataset, Shape from X.


Aksoy, Y.[Yagiz], Kim, C.[Changil], Kellnhofer, P.[Petr], Paris, S.[Sylvain], Elgharib, M.[Mohamed], Pollefeys, M.[Marc], Matusik, W.[Wojciech],
A Dataset of Flash and Ambient Illumination Pairs from the Crowd,
ECCV18(IX: 644-660).
Springer DOI
Dataset, Illumination.


Narasimhan, S.G.[Srinivasa G.], Wang, C.[Chi], Nayar, S.K.[Shree K.],
All the Images of an Outdoor Scene,
ECCV02(III: 148 ff.).
Springer DOI
PDF File.
Dataset, Outdoor Scene. A database of the same location every hour for 5 months. Registered and calibrated.
WWW Link. for the database.


Shi, B.X.[Bo-Xin], Mo, Z.P.[Zhi-Peng], Wu, Z.[Zhe], Duan, D.L.[Ding-Long], Yeung, S.K.[Sai-Kit], Tan, P.[Ping],
A Benchmark Dataset and Evaluation for Non-Lambertian and Uncalibrated Photometric Stereo,
PAMI(41), No. 2, February 2019, pp. 271-284.
IEEE DOI

Earlier: A1, A3, A2, A4, A5, A6: CVPR16(3707-3716)
IEEE DOI
Dataset, Photometric Stereo. Lighting, Taxonomy, Benchmark testing, Shape, Brain modeling, Cameras, Heuristic algorithms, Photometric stereo, benchmark, dataset, uncalibrated


Recurrent Asynchronous Multimodal Networks + Events, Frames, Semantic labels, and Depth maps recorded in CARLA simulator,
2021
HTML Version. Code, Recurrent Networks. Code, Monocular Depth. Dataset, Monocular Depth.


Grosse, R.[Roger], Johnson, M.K.[Micah K.], Adelson, E.H.[Edward H.], Freeman, W.T.[William T.],
Ground truth dataset and baseline evaluations for intrinsic image algorithms,
ICCV09(2335-2342).
IEEE DOI
Dataset, Shading. For shading and reflectance computations.


Scharstein, D.[Daniel], Szeliski, R.S.[Richard S.],
A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms,
IJCV(47), No. 1-3, April-June 2002, pp. 7-42.
DOI Link
Code, Stereo. Dataset, Stereo. The data sets and code are also available:
WWW Link. Award, Everingham. for 2015


di Rita, M., Nascetti, A., Crespi, M.,
FOSS4G Date Assessment On the Isprs Optical Stereo Satellite Data: A Benchmark for DSM Generation,
Hannover17(635-638).
DOI Link
Dataset, Stereo. benchmark dataset with several stereo data sets from space borne stereo sensors


Scharstein, D.[Daniel], Hirschmüller, H.[Heiko], Kitajima, Y.[York], Krathwohl, G.[Greg], Nešic, N.[Nera], Wang, X.[Xi], Westling, P.[Porter],
High-Resolution Stereo Datasets with Subpixel-Accurate Ground Truth,
GCPR14(31-42).
Springer DOI
Dataset, Stereo. Award, GCPR.


Haeusler, R.[Ralf], Kondermann, D.[Daniel],
Synthesizing Real World Stereo Challenges,
GCPR13(164-173).
Springer DOI
Dataset, Stereo.


Janoch, A.[Allison], Karayev, S.[Sergey], Jia, Y.Q.[Yang-Qing], Barron, J.T.[Jonathan T.], Fritz, M.[Mario], Saenko, K.[Kate], Darrell, T.J.[Trevor J.],
A category-level 3-D object dataset: Putting the Kinect to work,
ConDepth11(1168-1174).
IEEE DOI
Dataset, Stereo. Color and depth pairs.


Browatzki, B.[Bjorn], Fischer, J.[Jan], Graf, B.[Birgit], Bulthoff, H.H.[Heinrich H.], Wallraven, C.[Christian],
Going into depth: Evaluating 2D and 3D cues for object classification on a new, large-scale object dataset,
ConDepth11(1189-1195).
IEEE DOI
Dataset, Stereo.


Haeusler, R.[Ralf], Klette, R.[Reinhard],
Analysis of KITTI Data for Stereo Analysis with Stereo Confidence Measures,
UnOptFlow12(II: 158-167).
Springer DOI

And:
Disparity Confidence Measures on Engineered and Outdoor Data,
CIARP12(624-631).
Springer DOI

Earlier:
Benchmarking Stereo Data (Not the Matching Algorithms),
DAGM10(383-392).
Springer DOI
Dataset, Stereo.


Janowski, A., Sawicki, P., Szulwic, J.,
Internet database for photogrammetric close range applications,
IEVM06(xx-yy).
PDF File.
Dataset, Photogrammetry.


CVLab dense multi-view stereo image database,
2010
HTML Version. Dataset, Stereo. Multiple views, ground level, of buildings


IS-3D: Data,
2008.
HTML Version. Dataset, Stereo. Multiple views of various structures.


Shao, S.[Shuai], Li, Z.M.[Ze-Ming], Zhang, T.Y.[Tian-Yuan], Peng, C.[Chao], Yu, G.[Gang], Zhang, X.Y.[Xiang-Yu], Li, J.[Jing], Sun, J.[Jian],
Objects365: A Large-Scale, High-Quality Dataset for Object Detection,
ICCV19(8429-8438)
IEEE DOI
Dataset, Object Detection. feature extraction, image annotation, image classification, image segmentation, learning (artificial intelligence), Clocks


DOTA: A Large-Scale Benchmark and Challenges for Object Detection in Aerial Images,
Online2021
WWW Link. Dataset, Aerial Objects.
2806 aerial images obtained from different sensors and platforms, including 15 classification categories (vehicle, track, storange tanks, sports fields, etc.)


TGRS-HRRSD-Dataset: High Resolution Remote Sensing Detection (HRRSD),
Online2017
WWW Link. Dataset, Aerial Objects.
21,761 images. in 13 categories.
See also Hierarchical and Robust Convolutional Neural Network for Very High-Resolution Remote Sensing Object Detection.


Wang, Q.[Qi], Zhu, G.K.[Guo-Kang], Yuan, Y.[Yuan],
Multi-spectral dataset and its application in saliency detection,
CVIU(117), No. 12, 2013, pp. 1748-1754.
Elsevier DOI
Dataset, Infrared. RGB+near infrared. Multi-spectral


Gauglitz, S.[Steffen], Höllerer, T.[Tobias], Turk, M.A.[Matthew A.],
Evaluation of Interest Point Detectors and Feature Descriptors for Visual Tracking,
IJCV(94), No. 3, September 2011, pp. 335-360.
WWW Link.
WWW Link.
Dataset, Tracking. Present a dataset with ground truth for evaluation. And evaluation of camera tracking.


Balntas, V.[Vassileios], Lenc, K.[Karel], Vedaldi, A.[Andrea], Tuytelaars, T.[Tinne], Matas, J.G.[Jiri G.], Mikolajczyk, K.[Krystian],
H-Patches: A Benchmark and Evaluation of Handcrafted and Learned Local Descriptors,
PAMI(42), No. 11, November 2020, pp. 2825-2841.
IEEE DOI

Earlier: A1, A2, A3, A6, Only: CVPR17(3852-3861)
IEEE DOI
Dataset, Local Descriptors. HPatches dataset. Benchmark testing, Detectors, Protocols, Task analysis, Feature extraction, Training, Image matching, Local features, patch classification. Feature extraction, Protocols, Size, measurement


CUReT: Columbia-Utrecht Reflectance and Texture Database,
2006. Dataset, Texture.
WWW Link.


MIT Texture Data,
1995. Dataset, Texture.
HTML Version.


Texture Data,
2006. Dataset, Texture.
WWW Link.


Outex: New framework for empirical evaluation of texture analysis algorithms,
2006. Dataset, Texture.
WWW Link.


Texure Image Data,
2006. Dataset, Texture.
WWW Link. A variety of texture datasets. Includes Brodatz.


The KTH-TIPS and KTH-TIPS2 image databases,
2006. Dataset, Texture.
WWW Link. Textures under varying illumination, pose and scale. Extension of:
See also CUReT: Columbia-Utrecht Reflectance and Texture Database.


TILDA: Textile Texture Database,
1996. Dataset, Texture.
WWW Link.


Describable Textures Dataset (DTD),
2014 Dataset, Texture.
WWW Link.
See also Describing Textures in the Wild.


Hossain, S.[Shahera], Serikawa, S.[Seiichi],
Texture databases: A comprehensive survey,
PRL(34), No. 15, 2013, pp. 2007-2022.
Elsevier DOI
Dataset, Texture. Survey, Texture Datasets. Texture.


Xue, J.[Jia], Wadekar, P.[Paras], Zhang, H.[Hang], Teran, L.[Leizer], Dana, K.[Kristin], Nishino, K.[Ko],
Ground Terrain Database, GTOS,
Online2017.
HTML Version. Dataset, Texture.


Lee, S.K.[Seung-Kyu], Liu, Y.X.[Yan-Xi],
PSU Near-Regular Texture Database,
OnlinePSU, 2005.
WWW Link. Dataset, Texture.


Total found: 699

For more information on the topics, contact information, etc. see the annotated Computer Vision Bibliography or the Complete Conference Listing for Computer Vision and Image Analysis

Return to summary listing