Databases or Datasets for Computer Vision Applications and Testing

Test data is available in bits and pieces and in several larger repositories, These datasets are selected from references included in the Computer Vision Bibliography. The links on the Author and Journal references in the list point to entries in that database. Generally, to avoid confusion, in this bibliography database is used for database systems or research and would apply to image database query techniques rather than a database containing images for use in specific applications. I have thus chosen dataset to describe collections of images used by researchers in some domain. Current research and applications are highlighted in various Computer Vision and Image Processing Conferences. Some of these have evaluation sessions with related datasets.

Related resources include:


Dataset References by Dataset Use

For more information on the topics, contact information, etc. see the annotated Computer Vision Bibliography or the Complete Conference Listing for Computer Vision and Image Analysis

Dataset
Definition:* A collections of images used by researchers to evaluate programs. This is distinct from the uses of database, which is usually used to describe database systems or research or image databases used for querys.
      * *LHI Object Datasets
      * *Lotus Hill Institute
      * *NEC Animal Dataset
      * *Oxford Image Examples
      * *PEIPA Computer Vision Software
      * *Princeton
      * *Spectral Imaging Data Base
      * *Washington Ground Truth Image Database
      * Radius CDROM Ground Truthed Data Set, The

Dataset, 3-D Data
      * *ISPRS Terrestrial laser scanning and 3D imaging Datasets
      * *NaturePix: Visual Cognitive Modeling Research
      * *Stanford 3D Scanning Repository, The
      * How to measure the pose robustness of object views
      * Three-Dimensional Model-Based Object Recognition and Segmentation in Cluttered Scenes
      * WWW-Accessible 3D Image and Model Database for Computer Vision Research, A

Dataset, 3-D Models
      * *Large Geometric Models Archive

Dataset, Action Recognition
      * Free viewpoint action recognition using motion history volumes

Dataset, Actions
      * Actions as Space-Time Shapes
      * Propagation networks for recognition of partially ordered sequential action
      * Recognizing human actions: a local SVM approach
      * Tracking Multiple Objects through Occlusions

Dataset, Active Appearance Model
      * *Active Appearance Models

Dataset, Activity Recognition
      * Guide to the Carnegie Mellon University Multimodal Activity (CMU-MMAC) Database

Dataset, Activity Recogniton
      * *CLEAR: Classification of Events, Activities and Relationships
      * *i-LIDS: Bag and vehicle detection challenge
      * *OTCBVS Benchmark Dataset Collection
      * *PETS 2001 Benchmark Data
      * *PETS 2006 Benchmark Data
      * CHIL RT07 Evaluation Data, The

Dataset, Aerial Images
      * *Aerial Image Dataset

Dataset, Biometrics
      * *BANCA Database, The
      * Biosec baseline corpus: A multimodal biometric database
      * MCYT baseline corpus: a bimodal biometric database
      * XM2VTSDB: The Extended M2VTS Database

Dataset, Buildings
      * Calibrated, Registered Images of an Extended Urban Area
      * Object retrieval with large vocabularies and fast spatial matching

Dataset, Cardiac MRI
      * Cardiac MRI dataset

Dataset, Checks
      * new database for research on bank-check processing, A

Dataset, Color Calibration
      * Camera characterization for color research

Dataset, Color Constancy
      * Data Set for Colour Research, A

Dataset, Discussion
      * Dataset Issues in Object Recognition

Dataset, Documents
      * *NIST OCR Databases
      * *Warped Documents, IUPR
      * UvA color document dataset, The

Dataset, Expressions
      * *POSTECH Face Database
      * Comprehensive Database for Facial Expression Analysis

Dataset, Faces
      * *Annotated Facial Dataset
      * *AR Face Database, The
      * *BioID Face Database
      * *CAS-PEAL Large-Scale Chinese Face Database and Baseline Evaluations, The
      * *CMI Profile Face Images
      * *Color FERET Database, The
      * *CVL Face Database
      * *Description of the Collection of Facial Images
      * *Equinox: Human Identification at a Distance
      * *Face Detection Home Page
      * *Face Recognition Vendor Test 2006
      * *Face Recogniton Home Page
      * *FacePix Database
      * *FERET Database, The
      * *Frontal Face Images
      * *MIT Face Recognition Database
      * *NIST Mugshot Identification Database
      * *ORL Database of Faces, The
      * *POSTECH Face Database
      * *UCD Colour Face Image Database for Face Detection, The
      * *UMIST Face Database, The
      * *University of Oulu Physics-Based Face Database, The
      * *Yale Face Database
      * CMU Pose, Illumination, and Expression (PIE) Database of Human Faces, The
      * Comprehensive Database for Facial Expression Analysis
      * FERET Database and Evaluation Procedure for Face-Recognition Algorithms, The
      * FERET Evaluation Methodology for Face-Recognition Algorithms, The
      * Hyperspectral Face Database
      * Iranian Face Database with age, pose and expression
      * Labeled faces in the wild: A database for studying face recognition in unconstrained environments
      * VALID: A New Practical Audio-Visual Database, and Comparative Results
      * Video Database of Moving Faces and People, A

Dataset, Facial Expressions
      * Coding Facial Expressions with Gabor Wavelets

Dataset, Farsi Handwriting
      * New Large-Scale Multi-purpose Handwritten Farsi Database, A

Dataset, Fingerprints
      * FVC2000: Fingerprint Verification Competition
      * Handbook of Fingerprint Recognition
      * NIST Special Database 4, Fingerprint Database

Dataset, Flowers
      * Automated Flower Classification over a Large Number of Classes

Dataset, Gait Recognition
      * University of Southampton Multi-Biometric Tunnel and introducing a novel 3D gait dataset, The

Dataset, Gait
      * *Baseline Algorithm and Performance for Gait Based Human ID Challenge Problem

Dataset, Gesture
      * *POSTECH Face Database

Dataset, Handwriting
      * *Ground Truthed Handwritten Word Images
      * *On-line Handwriting Database
      * *Unipen Project
      * *USPS Office of Advanced Technology Database of Handwritten Cities, States, ZIP Codes, Digits, and Alphabetic Characters
      * Database for Handwritten Text Recognition Research, A

Dataset, Human Motion
      * HumanEva: Synchronized Video and Motion Capture Dataset and Baseline Algorithm for Evaluation of Articulated Human Motion
      * INRIA Person Dataset

Dataset, Hyperspectral
      * Information limits on neural identification of coloured surfaces in natural scenes
      * Statistics of spatial cone-excitation ratios in natural scenes

Dataset, Image Retrieval
      * Evaluating Image Retrieval

Dataset, Images
      * *Abel Stock
      * *CalTech Archived Images
      * *Fotosearch
      * *OSU Datasets
      * *University of Southern California, Signal and Image Processing

Dataset, Iris Images
      * *CASIA Iris Image Database
      * *Iris Recognition Database
      * *Iris Recognition Database
      * *NIST ICE Iris Image Database
      * *UBIRIS database
      * Iris Database

Dataset, Iris Recognition
      * IV2 Multimodal Biometric Database (Including Iris, 2D, 3D, Stereoscopic, and Talking Face Data), and the IV2-2007 Evaluation Campaign, The

Dataset, Lip Reading
      * *Language Independent Lip Reading

Dataset, Mammography
      * *DDSM: Digital Database for Screening Mammography
      * *MiniMammographic Database

Dataset, Matching
      * Registration of Challenging Image Pairs: Initialization, Estimation, and Decision

Dataset, Medical Images
      * *Medical Dataset Archive
      * *Visible Human Project

Dataset, Motion Capture
      * *CMU Graphics Lab Motion Capture Database

Dataset, Motion
      * *CMU VASC Image Database
      * Human-assisted motion annotation

Dataset, Multi-View Data
      * *University of Illinois Datasets

Dataset, Natural Scenes
      * *CalTech 100 Natural Scenes
      * *University of Illinois Datasets

Dataset, Object Detection
      * CBCL StreetScenes Challenge Framework

Dataset, Object Recognition
      * *LHI Object Datasets
      * *NEC Animal Dataset
      * *University of Illinois Datasets
      * *Xcavator.Net
      * Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary

Dataset, Objects
      * *CalTech 101 Objects Categories
      * *PASCAL Object Recognition Database Collection, The
      * *Video Objects: A Test Database for Video Object Recognition
      * Amsterdam Library of Object Images, The
      * Columbia Object Image Library (COIL-100)
      * Learning methods for generic object recognition with invariance to pose and lighting
      * Lost in quantization: Improving particular object retrieval in large scale image databases

Dataset, OCR
      * *ERIM Arabic Document Database
      * *Japanese Character Image Database
      * *NIST OCR Databases
      * *UMD Logo Database
      * CASIA-OLHWDB1: A Database of Online Handwritten Chinese Characters
      * FHT: An Unconstraint Farsi Handwritten Text Database
      * GERMANA Database, The
      * HCL2000: A Large-scale Handwritten Chinese Character Database for Handwritten Character Recognition

Dataset, Optical Flow
      * Database and Evaluation Methodology for Optical Flow, A

Dataset, Outdoor Scene
      * All the Images of an Outdoor Scene

Dataset, Pedestrians
      * Experimental Study on Pedestrian Classification, An

Dataset, People
      * Accurate Object Localization with Shape Masks

Dataset, Perceptual Grouping
      * in-depth study of graph partitioning measures for perceptual organization, An

Dataset, Photogrammetry
      * Internet database for photogrammetric close range applications

Dataset, Retina
      * DIARETDB1 diabetic retinopathy database and evaluation protocol, The

Dataset, Retrieval
      * *BBC Motion Gallery
      * *Washington Ground Truth Image Database
      * 80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition
      * segmented and annotated IAPR TC-12 benchmark, The

Dataset, Segmentation
      * *LHI Segmentation Dataset
      * *LHI Surveillance Dataset
      * *Validation and Verification of Neural Network Systems
      * Berkeley Segmentation Dataset and Benchmark, The
      * Evaluation of Localized Semantics: Data, Methodology, and Experiments

Dataset, Shape from X
      * Benchmark Dataset for Performance Evaluation of Shape-from-X Algorithms, A

Dataset, Sign Language
      * Long Term Arm and Hand Tracking for Continuous Sign Language TV Broadcasts

Dataset, Sports
      * *LHI Sports Activity Dataset
      * *UCF Sports Action Dataset

Dataset, Steganalysis
      * Unseen Challenge data sets, The

Dataset, Stereo Data
      * *University of Illinois Datasets

Dataset, Stereo
      * Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms, A

Dataset, Surface Reconstruction
      * Benchmarking Dataset for Performance Evaluation of Automatic Surface Reconstruction Algorithms, A

Dataset, Surveillance
      * *Daimler Pedestrian Detection Benchmark
      * *MIT Pedestrian Database MITP
      * *PETS Benchmark Data
      * Crowd Flow Segmentation and Stability Analysis
      * Lagrangian Particle Dynamics Approach for Crowd Flow Segmentation and Stability Analysis, A
      * Terrascope Dataset: Scripted Multi-Camera Indoor Video Surveillance with Ground-truth, The

Dataset, Texture
      * *CUReT: Columbia-Utrecht Reflectance and Texture Database
      * *KTH-TIPS and KTH-TIPS2 image databases, The
      * *MIT Texture Data
      * *Outex: New framework for empirical evaluation of texture analysis algorithms
      * *Texture Data
      * *TILDA: Textile Texture Database
      * *University of Illinois Datasets
      * PSU Near-Regular Texture Database

Dataset, Tracking
      * Tracking by an Optimal Sequence of Linear Predictors

Dataset, Urdu Handwriting
      * New Large Urdu Database for Off-Line Handwriting Recognition, A

Dataset, Vehicles
      * *MIT Car Database MITC
      * Learning to Detect Objects in Images via a Sparse, Part-Based Representation

Dataset, Video
      * *BBC Motion Gallery
      * *BEHAVE Interactions Test Case Scenarios
      * *CAVIAR Test Case Scenarios
      * *CVBASE Annotated Video Data
      * *Optic Flow Data
      * *University of Illinois Datasets

Dataset, Visual Hull
      * *University of Illinois Datasets

For more information on the topics, contact information, etc. see the annotated Computer Vision Bibliography or the Complete Conference Listing for Computer Vision and Image Analysis