Databases or Datasets for Computer Vision Applications and Testing

Generally, to avoid confusion, in this bibliography, the word database is used for database systems or research and would apply to image database query techniques rather than a database containing images for use in specific applications. I have chosen to use dataset to describe collections of images used by researchers in some domain. In the past test data was difficult, but the advent of modern digital cameras has simplified acquiring data. But in order to test and especially compare algorithms, a common dataset is essential.

Test data is available in bits and pieces and in several larger repositories, These listed datasets are selected from the references in the Computer Vision Bibliography. There are other datasets and often older ones get removed from web sites. The links on the Author and Journal references in the list point to entries in that database. Current research and applications are highlighted in various Computer Vision and Image Processing Conferences. Some of these have evaluation sessions with related datasets.

Computer Vision resources include:

Computer Vision Dataset (Database) References Ordered by Dataset Category

For more information on the topics, contact information, etc. see the annotated Computer Vision Bibliography or the Complete Conference Listing for Computer Vision and Image Analysis

Dataset
Definition:* A collections of images used by researchers to evaluate programs. This is distinct from the uses of database, which is usually used to describe database systems or research or image databases used for querys.
  * *LHI Object Datasets
  * *NEC Animal Dataset
  * *Oxford Image Examples
  * *PEIPA Computer Vision Software
  * *Princeton
  * *Washington Ground Truth Image Database
  * Radius CDROM Ground Truthed Data Set, The

Dataset, 3-D Data
  * *ISPRS Terrestrial laser scanning and 3D imaging Datasets
  * *NaturePix: Visual Cognitive Modeling Research
  * *Stanford 3D Scanning Repository, The
  * How to measure the pose robustness of object views
  * Three-Dimensional Model-Based Object Recognition and Segmentation in Cluttered Scenes
  * WWW-Accessible 3D Image and Model Database for Computer Vision Research, A

Dataset, 3-D Models
  * *Large Geometric Models Archive

Dataset, 3D Data
  * *CalTech Turntable Images
  * Farman Institute 3D Point Sets: High Precision 3D Data Sets

Dataset, Action Recognition
  * Action Similarity Labeling Challenge, The
  * Active Range Imaging Dataset for Indoor Surveillance
  * BEHAVE video dataset: Ground truthed video for multi-person behavior classification, The
  * Free viewpoint action recognition using motion history volumes
  * HMDB: A large video database for human motion recognition
  * large-scale benchmark dataset for event recognition in surveillance video, A
  * New Image Dataset on Human Interactions, A
  * Novel Approach for Fast Action Recognition using Simple Features, A

Dataset, Actions
  * Actions as Space-Time Shapes
  * POETICON enacted scenario corpus: A tool for human and computational experiments on action understanding, The
  * Propagation networks for recognition of partially ordered sequential action
  * Recognizing human actions: a local SVM approach
  * Tracking Multiple Objects through Occlusions

Dataset, Active Appearance Model
  * *Active Appearance Models

Dataset, Activity Recognition
  * Guide to the Carnegie Mellon University Multimodal Activity (CMU-MMAC) Database
  * human motion database: A cognitive and parametric sampling of human motion, The
  * Opportunity challenge: A benchmark database for on-body sensor-based activity recognition, The
  * survey of video datasets for human action and activity recognition, A
  * TUM Kitchen Data Set of everyday manipulation activities for motion tracking and action recognition, The

Dataset, Activity Recogniton
  * *CLEAR: Classification of Events, Activities and Relationships
  * *i-LIDS: Bag and vehicle detection challenge
  * *OTCBVS Benchmark Dataset Collection
  * *PETS 2001 Benchmark Data
  * *PETS 2006 Benchmark Data
  * CHIL RT07 Evaluation Data, The

Dataset, Aerial Images
  * *Aerial Image Dataset

Dataset, Aesthetic Analysis
  * AVA: A large-scale database for aesthetic visual analysis

Dataset, Arabic Text
  * KHATT: An open Arabic offline handwritten text database

Dataset, Arabic
  * *ERIM Arabic Document Database
  * QUWI: An Arabic and English Handwriting Dataset for Offline Writer Identification

Dataset, Attion Recognition
  * Hollywood 3D: Recognizing Actions in 3D Natural Scenes

Dataset, Bangla
  * CMATERdb1: A database of unconstrained handwritten Bangla and Bangla-English mixed script document image

Dataset, Biometrics
  * *BANCA Database, The
  * biometric database with rotating head videos and hand-drawn face sketches, A
  * Biosec baseline corpus: A multimodal biometric database
  * MCYT baseline corpus: a bimodal biometric database
  * Multiscenario Multienvironment BioSecure Multimodal Database (BMDB), The
  * Two Unconstrained Biometric Databases
  * West Pomeranian University of Technology Ear Database: A Tool for Testing Biometric Algorithms, The
  * XM2VTSDB: The Extended M2VTS Database

Dataset, Building Detection
  * *ISPRS benchmark on urban object detection and 3D building reconstruction

Dataset, Buildings
  * Calibrated, Registered Images of an Extended Urban Area
  * Digital Basic Geodata Sets Hausumringe and Hauskoordinaten: Characterization and Pre-processing for Building Stock Analysis, The
  * Object retrieval with large vocabularies and fast spatial matching

Dataset, Cardiac MRI
  * Cardiac MRI dataset

Dataset, Change Detection
  * CDnet 2014: An Expanded Change Detection Benchmark Dataset
  * Changedetection.net: A new change detection benchmark dataset

Dataset, Checks
  * new database for research on bank-check processing, A

Dataset, Chinese Characters
  * SCUT-COUCH Textline_NU: An Unconstrained Online Handwritten Chinese Text Lines Dataset

Dataset, Color Calibration
  * Camera characterization for color research

Dataset, Color Constancy
  * Data Set for Colour Research, A

Dataset, Color Images
  * New Color Image Database TID2013: Innovations and Results, A

Dataset, Comics
  * eBDtheque: A Representative Database of Comics

Dataset, Cultural Heritage
  * *CyArk

Dataset, Discussion
  * Dataset Issues in Object Recognition

Dataset, Document Analysis
  * Media Team Document Database II

Dataset, Document Images
  * IUPR Dataset of Camera-Captured Document Images, The

Dataset, Documents
  * *NIST OCR Databases
  * *Warped Documents, IUPR
  * UvA color document dataset, The

Dataset, Emotions
  * Belfast Induced Natural Emotion Database, The

Dataset, Event Recognition
  * Event Recognition in Photo Collections with a Stopwatch HMM
  * large-scale benchmark dataset for event recognition in surveillance video, A

Dataset, Expression Recognition
  * Analyses of a Multimodal Spontaneous Facial Expression Database

Dataset, Expressions
  * *POSTECH Face Database
  * Comprehensive Database for Facial Expression Analysis

Dataset, Eye Fixation
  * Eye Fixation Database for Saliency Detection in Images, An

Dataset, Eye Tracking
  * Eye-Tracking Database for a Set of Standard Video Sequences

Dataset, Face Recognition
  * *OTCBVS Benchmark Dataset Collection
  * CASIA NIR-VIS 2.0 Face Database, The

Dataset, Faces
  * *Annotated Facial Dataset
  * *AR Face Database, The
  * *BioID Face Database
  * *CAS-PEAL Large-Scale Chinese Face Database and Baseline Evaluations, The
  * *CMU Facial Expression Database
  * *CMU Profile Face Images
  * *Color FERET Database, The
  * *CVL Face Database
  * *Description of the Collection of Facial Images
  * *Equinox: Human Identification at a Distance
  * *Face Detection Home Page
  * *Face Recognition Vendor Test 2006
  * *Face Recogniton Home Page
  * *FacePix Database
  * *FDDB: Face Detection Data Set and Benchmark
  * *FERET Database, The
  * *Frontal Face Images
  * *MIT Face Recognition Database
  * *NIST Mugshot Identification Database
  * *ORL Database of Faces, The
  * *POSTECH Face Database
  * *UB KinFace Database
  * *UCD Colour Face Image Database for Face Detection, The
  * *UMIST Face Database, The
  * *University of Oulu Face Video Database, The
  * *University of Oulu Physics-Based Face Database, The
  * *Yale Face Database
  * CMU Pose, Illumination, and Expression Database, The
  * Comprehensive Database for Facial Expression Analysis
  * FERET Database and Evaluation Procedure for Face-Recognition Algorithms, The
  * FERET Evaluation Methodology for Face-Recognition Algorithms, The
  * Hyperspectral Face Database
  * Iranian Face Database with age, pose and expression
  * Labeled faces in the wild: A database for studying face recognition in unconstrained environments
  * Large-Scale Database of Images and Captions for Automatic Face Naming, A
  * Multi-PIE
  * new multi-purpose audio-visual UNMC-VIER database with multiple variabilities, A
  * Photoface database, The
  * Single- and cross- database benchmarks for gender classification under unconstrained settings
  * Texas 3D Face Recognition Database
  * UHDB11 Database for 3D-2D Face Recognition
  * UMB-DB: A database of partially occluded 3D faces
  * VADANA: A dense dataset for facial image analysis
  * VALID: A New Practical Audio-Visual Database, and Comparative Results
  * Video Database of Moving Faces and People, A

Dataset, Faces, Features
  * Annotated Facial Landmarks in the Wild: A large-scale, real-world database for facial landmark localization

Dataset, Facial Action
  * DISFA: A Spontaneous Facial Action Intensity Database

Dataset, Facial Expression
  * *CMU Facial Expression Database
  * Painful data: The UNBC-McMaster shoulder pain expression archive database
  * Thermal Facial Emotion Database and Its Analysis, A

Dataset, Facial Expressions
  * *Oulu-CASIA NIR&VIS facial expression database
  * Affectiva-MIT Facial Expression Dataset (AM-FED): Naturalistic and Spontaneous Facial Expressions Collected In-the-Wild
  * Cardiff Conversation Database (CCDb): A Database of Natural Dyadic Conversations
  * CASME database: A dataset of spontaneous micro-expressions collected from neutralized faces
  * Coding Facial Expressions with Gabor Wavelets
  * Extended Cohn-Kanade Dataset (CK+): A complete dataset for action unit and emotion-specified expression, The
  * Hi4D-ADSIP 3-D dynamic facial articulation database
  * high-resolution 3D dynamic facial expression database, A
  * High-resolution comprehensive 3-D dynamic database for facial articulation analysis
  * high-resolution spontaneous 3D dynamic facial expression database, A
  * Natural Visible and Infrared Facial Expression Database for Expression Recognition and Emotion Inference, A

Dataset, Farsi Handwriting
  * New Large-Scale Multi-purpose Handwritten Farsi Database, A

Dataset, Fingerprints
  * FVC2000: Fingerprint Verification Competition
  * Handbook of Fingerprint Recognition
  * NIST Special Database 4, Fingerprint Database

Dataset, Fish
  * *Tropical Coral Reef Fish Detection, Tracking And Classification

Dataset, Flowers
  * Automated Flower Classification over a Large Number of Classes

Dataset, Foreground Extraction
  * Benchmark Dataset for Outdoor Foreground/Background Extraction, A

Dataset, Formula
  * MfrDB: Database of Annotated On-Line Mathematical Formulae

Dataset, Gait Recognition
  * University of Southampton Multi-Biometric Tunnel and introducing a novel 3D gait dataset, The

Dataset, Gait
  * *Baseline Algorithm and Performance for Gait Based Human ID Challenge Problem

Dataset, Gesture Recognition
  * G3D: A gaming action dataset and real time action recognition evaluation framework

Dataset, Gesture
  * *POSTECH Face Database

Dataset, Gestures
  * 3-D Audio-Visual Corpus of Affective Communication, A

Dataset, Hand Gestures
  * natural and synthetic corpus for benchmarking of hand gesture recognition systems, A

Dataset, Handwriting
  * *Ground Truthed Handwritten Word Images
  * *On-line Handwriting Database
  * *Unipen Project
  * *USPS Office of Advanced Technology Database of Handwritten Cities, States, ZIP Codes, Digits, and Alphabetic Characters
  * Database for Handwritten Text Recognition Research, A
  * MAYASTROUN: A Multilanguage Handwriting Database

Dataset, Handwritting, Arabic
  * KHATT: Arabic Offline Handwritten Text Database

Dataset, Human Actions
  * Berkeley MHAD: A comprehensive Multimodal Human Action Database

Dataset, Human Activity
  * PROMETHEUS: heterogeneous sensor database in support of research on human behavioral patterns in unrestricted environments

Dataset, Human Motion
  * HumanEva: Synchronized Video and Motion Capture Dataset and Baseline Algorithm for Evaluation of Articulated Human Motion
  * INRIA Person Dataset

Dataset, Human Pose
  * UMPM benchmark: A multi-person dataset with synchronized video and motion capture data for evaluation of articulated human motion and interaction

Dataset, Human Tracking
  * *Edinburgh Informatics Forum Pedestrian Database

Dataset, Humans
  * *H3D Dataset

Dataset, Hyperspectral
  * Information limits on neural identification of coloured surfaces in natural scenes
  * Statistics of spatial cone-excitation ratios in natural scenes

Dataset, Image Matting
  * *Alpha Matting Evaluation Website

Dataset, Image Quality
  * CID:IQ: A New Image Quality Database

Dataset, Image Retrieval
  * *Large Scale Dataset for Cross-Model Multimedia Analysis
  * Evaluating Image Retrieval

Dataset, Image Stitching
  * *Image Stitching Database

Dataset, Images
  * *Abel Stock
  * *CalTech Archived Images
  * *OSU Datasets
  * *University of Southern California, Signal and Image Processing

Dataset, Infrared
  * Multi-spectral dataset and its application in saliency detection

Dataset, Iris Images
  * *CASIA Iris Image Database
  * *Iris Recognition Database
  * *Iris Recognition Database
  * *NIST ICE Iris Image Database
  * *UBIRIS database
  * *UTIRIS: University of Tehran IRIS Image Repository
  * Iris Database

Dataset, Iris Recognition
  * IV2 Multimodal Biometric Database (Including Iris, 2D, 3D, Stereoscopic, and Talking Face Data), and the IV2-2007 Evaluation Campaign, The
  * UBIRIS.v2: A Database of Visible Wavelength Iris Images Captured On-the-Move and At-a-Distance, The

Dataset, Landmarks
  * PKUBench: A context rich mobile visual search benchmark

Dataset, Laughter
  * MAHNOB Laughter database, The

Dataset, Lip Reading
  * *Language Independent Lip Reading
  * *OuluVS database

Dataset, Logos
  * *FlickrLogos-32

Dataset, Lumber
  * *Wood image database

Dataset, Mammography
  * *DDSM: Digital Database for Screening Mammography
  * *MiniMammographic Database

Dataset, Matching
  * Registration of Challenging Image Pairs: Initialization, Estimation, and Decision

Dataset, Medical Images
  * *Medical Dataset Archive
  * *Visible Human Project

Dataset, Motion Capture
  * *CMU Graphics Lab Motion Capture Database

Dataset, Motion Detection
  * *Change Detection Benchmark Website

Dataset, Motion
  * *CMU VASC Image Database
  * *Hopkins 155
  * FeEval A Dataset for Evaluation of Spatio-temporal Local Features
  * Human-assisted motion annotation

Dataset, Multi-View Data
  * *University of Illinois Datasets

Dataset, Natural Image Text
  * NEOCR: A Configurable Dataset for Natural Image Text Recognition

Dataset, Natural Scenes
  * *CalTech 100 Natural Scenes
  * *University of Illinois Datasets

Dataset, Navigation
  * Indoor RGB-D Dataset for the Evaluation of Robot Navigation Algorithms, An

Dataset, Object Detection
  * CBCL StreetScenes Challenge Framework

Dataset, Object Recognition
  * *LHI Object Datasets
  * *NEC Animal Dataset
  * *University of Illinois Datasets
  * *Xcavator.Net
  * iCub World: Friendly Robots Help Building Good Vision Data-Sets
  * Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary

Dataset, Objects
  * *Animals with Attributes: A dataset for Attribute Based Classification
  * *CalTech 101 Objects Categories
  * *Image Net
  * *PASCAL Object Recognition Database Collection, The
  * *Video Objects: A Test Database for Video Object Recognition
  * Amsterdam Library of Object Images, The
  * Columbia Object Image Library (COIL-100)
  * Learning methods for generic object recognition with invariance to pose and lighting
  * Lost in quantization: Improving particular object retrieval in large scale image databases
  * Microsoft COCO: Common Objects in Context

Dataset, OCR
  * *ERIM Arabic Document Database
  * *Japanese Character Image Database
  * *NIST OCR Databases
  * *UMD Logo Database
  * CASIA Online and Offline Chinese Handwriting Databases
  * CASIA-OLHWDB1: A Database of Online Handwritten Chinese Characters
  * Creation of a Huge Annotated Database for Tamil and Kannada OHR
  * Empirical Evaluation on HIT-OR3C Database, An
  * FHT: An Unconstraint Farsi Handwritten Text Database
  * GERMANA Database, The
  * HAMEX: A Handwritten and Audio Dataset of Mathematical Expressions
  * HCL2000: A Large-scale Handwritten Chinese Character Database for Handwritten Character Recognition
  * IBM_UB_1: A Dual Mode Unconstrained English Handwriting Dataset

Dataset, Optical Flow
  * Database and Evaluation Methodology for Optical Flow, A

Dataset, Outdoor Scene
  * All the Images of an Outdoor Scene

Dataset, Pedestrian Detection
  * *Daimler Pedestrian Detection Benchmark
  * IAIR-CarPed: A psychophysically annotated dataset with fine-grained and layered semantic labels for object recognition

Dataset, Pedestrians
  * Experimental Study on Pedestrian Classification, An

Dataset, People
  * Accurate Object Localization with Shape Masks

Dataset, Perceptual Grouping
  * in-depth study of graph partitioning measures for perceptual organization, An

Dataset, Person Detection
  * corpus for benchmarking of people detection algorithms, A

Dataset, Photogrammetry
  * Internet database for photogrammetric close range applications

Dataset, Pottery
  * *Beazley Archive of Classical Art Pottery Database, The

Dataset, Re-Identification
  * Database for Person Re-Identification in Multi-Camera Surveillance Networks, A

Dataset, Recognition
  * Geometric Context from a Single Image

Dataset, Retina
  * DIARETDB1 diabetic retinopathy database and evaluation protocol, The

Dataset, Retrieval
  * *BBC Motion Gallery
  * *Washington Ground Truth Image Database
  * 80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition
  * LableMe: The Open Annotation Tool
  * segmented and annotated IAPR TC-12 benchmark, The

Dataset, Scene Understanding
  * SUN3D: A Database of Big Spaces Reconstructed Using SfM and Object Labels

Dataset, Segmentation
  * *LHI Segmentation Dataset
  * *LHI Surveillance Dataset
  * *Lotus Hill Institute
  * *Validation and Verification of Neural Network Systems
  * Berkeley Segmentation Dataset and Benchmark, The
  * Database of human segmented images and its application in boundary detection
  * Evaluation of Localized Semantics: Data, Methodology, and Experiments

Dataset, Shading
  * Ground truth dataset and baseline evaluations for intrinsic image algorithms

Dataset, Shape from X
  * Benchmark Dataset for Performance Evaluation of Shape-from-X Algorithms, A

Dataset, Sign Language
  * Long Term Arm and Hand Tracking for Continuous Sign Language TV Broadcasts

Dataset, Signatures
  * SID Signature Database: A Tunisian Off-line Handwritten Signature Database

Dataset, SLAM
  * benchmarking tool for MAV visual pose estimation, A

Dataset, Spectral Imaging
  * *Spectral Imaging Data Base

Dataset, Sports
  * *LHI Sports Activity Dataset
  * *UCF Sports Action Dataset

Dataset, Steganalysis
  * Unseen Challenge data sets, The

Dataset, Stereo Data
  * *University of Illinois Datasets

Dataset, Stereo
  * *CVLab dense multi-view stereo image database
  * *IS-3D: Data
  * Benchmarking Stereo Data (Not the Matching Algorithms)
  * category-level 3-D object dataset: Putting the Kinect to work, A
  * Going into depth: Evaluating 2D and 3D cues for object classification on a new, large-scale object dataset
  * Synthesizing Real World Stereo Challenges
  * Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms, A

Dataset, Surface Reconstruction
  * Benchmarking Dataset for Performance Evaluation of Automatic Surface Reconstruction Algorithms, A

Dataset, Surveillance
  * *Daimler Pedestrian Detection Benchmark
  * *Edinburgh Informatics Forum Pedestrian Database
  * *MIT Pedestrian Database MITP
  * *PETS Benchmark Data
  * *UCF Action Recogniton Dataset 50
  * *UCF-ARG
  * *UCF-iPhone
  * Crowd Flow Segmentation and Stability Analysis
  * Lagrangian Particle Dynamics Approach for Crowd Flow Segmentation and Stability Analysis, A
  * LOST: Longterm Observation of Scenes (with Tracks)
  * Terrascope Dataset: Scripted Multi-Camera Indoor Video Surveillance with Ground-truth, The

Dataset, Symmetry Images
  * Curved Glide-Reflection Symmetry Detection

Dataset, Text in Images
  * *Computer Vision Lab OCR DataBase: CVL OCR DB

Dataset, Text Retrieval
  * *Large Scale Dataset for Cross-Model Multimedia Analysis

Dataset, Texture
  * *CUReT: Columbia-Utrecht Reflectance and Texture Database
  * *KTH-TIPS and KTH-TIPS2 image databases, The
  * *MIT Texture Data
  * *Outex: New framework for empirical evaluation of texture analysis algorithms
  * *Texture Data
  * *Texure Image Data
  * *TILDA: Textile Texture Database
  * *University of Illinois Datasets
  * PSU Near-Regular Texture Database
  * Texture databases: A comprehensive survey

Dataset, Tracking
  * *OTCBVS Benchmark Dataset Collection
  * *UCF Parking Lot Tracking
  * Evaluation of Interest Point Detectors and Feature Descriptors for Visual Tracking
  * Floor Fields for Tracking in High Density Crowd Scenes
  * Tracking by an Optimal Sequence of Linear Predictors

Dataset, Traffic Signs
  * *Swedish Trafic Signs

Dataset, Traffic
  * Multi-sensor Traffic Scene Dataset with Omnidirectional Video, A

Dataset, Urdu Handwriting
  * New Large Urdu Database for Off-Line Handwriting Recognition, A

Dataset, Vehicles
  * *MIT Car Database MITC
  * Learning to Detect Objects in Images via a Sparse, Part-Based Representation
  * NYC3DCars: A Dataset of 3D Vehicles in Geographic Context

Dataset, Video Database
  * *Large Scale Video Database

Dataset, Video
  * *BBC Motion Gallery
  * *BEHAVE Interactions Test Case Scenarios
  * *CAVIAR Test Case Scenarios
  * *CVBASE Annotated Video Data
  * *Optic Flow Data
  * *University of Illinois Datasets

Dataset, Visual Hull
  * *University of Illinois Datasets

Dataset, Writer Identification
  * CVL-DataBase: An Off-Line Database for Writer Retrieval, Writer Identification and Word Spotting

Dataset,Surveillance
  * Dana36: A Multi-camera Image Dataset for Object Identification in Surveillance Scenarios

For more information on the topics, contact information, etc. see the annotated Computer Vision Bibliography or the Complete Conference Listing for Computer Vision and Image Analysis