The MNIST database of handwritten digits, available from this page, has a training set of 60,000 examples, and a test set of 10,000 examples. It is a subset of a larger set available from NIST.
It consists of four files that store data in a simple file format (idx format) which is documented in the MNIST database homepage. I wrote this C program to be able to extract the descriptions of handwritten digits of a certain value separately (eg. only the 0s or only the 1s). The text output is pretty simple and closely resembles the format of a PGM file. It is describes the grayscale pixel values of the image in decimal ASCII. Pixel values range from 0 to 255.
As a bonus, there is the opportunity to extract the images in portable graymap file format (PGM) image files.
This program was written with these guys mostly in mind.
[download mnist.c]
ok! eidame ton kwdika, katebasame ta arxeia apo to mnist, eidame kai to site tou wikipedia, apo ekei kai pera ti ginetai?
Παίρνεις τα data που σε ενδιαφέρουν (τα 0 και τα 1), τα βάζεις το MatLab …και κάνεις την άσκηση.
Γιώργο καλημέρα και καλή εβδομάδα !
Να υποθέσω ότι έμαθες για αυτό:
http://www.theregister.co.uk/2007/11/20/hmrc_loses_lots_data
Κανένα σχόλιο ;
@Dimitris Andrakakis:
Your wish is my command.
:)