Skip to content

Instantly share code, notes, and snippets.

@octaviomtz
Created March 26, 2015 20:47
Show Gist options
  • Save octaviomtz/d4d45a74fe972ec69536 to your computer and use it in GitHub Desktop.
Save octaviomtz/d4d45a74fe972ec69536 to your computer and use it in GitHub Desktop.
123 machine learning databases
Problem File Name Relation Name nRows TestMethod nTrain nTest nVars nTargets
abalone/abalone.arff abalone 4177 test-set cross-validation 4077 100 8 3
acute-inflammation/acute-inflammation.arff acute-inflammation 120 leave-one-out cross-validation 119 100 6 2
acute-nephritis/acute-nephritis.arff acute-nephritis 120 leave-one-out cross-validation 119 100 6 2
adult/adult_train.arff adult 32561 test-set cross-validation 32461 100 14 2
annealing/annealing_train.arff annealing 798 test-set cross-validation 698 100 31 5
arrhythmia/arrhythmia.arff arrhythmia 452 leave-one-out cross-validation 451 100 262 13
audiology-std/con_patrons_repetidos/audiology-std_train.arff audiology-std 194 leave-one-out cross-validation 193 100 59 18
audiology-std/audiology-std_train.arff audiology-std 171 leave-one-out cross-validation 170 100 59 18
balance-scale/balance-scale.arff balance-scale 625 test-set cross-validation 525 100 4 3
balloons/balloons.arff balloons 16 leave-one-out cross-validation 15 16 4 2
bank/bank.arff bank 4521 test-set cross-validation 4421 100 16 2
blood/blood.arff blood 748 test-set cross-validation 648 100 4 2
breast-cancer/breast-cancer.arff breast-cancer 286 leave-one-out cross-validation 285 100 9 2
breast-cancer-wisc/breast-cancer-wisc.arff breast-cancer-wisc 699 test-set cross-validation 599 100 9 2
breast-cancer-wisc-diag/breast-cancer-wisc-diag.arff breast-cancer-wisc-diag 569 test-set cross-validation 469 100 30 2
breast-cancer-wisc-prog/breast-cancer-wisc-prog.arff breast-cancer-wisc-prog 198 leave-one-out cross-validation 197 100 33 2
breast-tissue/breast-tissue.arff breast-tissue 106 leave-one-out cross-validation 105 100 9 6
car/car.arff car 1728 test-set cross-validation 1628 100 6 4
cardiotocography-3clases/cardiotocography-3clases.arff cardiotocography-3clases 2126 test-set cross-validation 2026 100 21 3
cardiotocography-10clases/cardiotocography-10clases.arff cardiotocography-10clases 2126 test-set cross-validation 2026 100 21 10
chess-krvk/chess-krvk.arff chess-krvk 28056 test-set cross-validation 27956 100 6 18
chess-krvkp/chess-krvkp.arff chess-krvkp 3196 test-set cross-validation 3096 100 36 2
congressional-voting/congressional-voting.arff congressional-voting 435 leave-one-out cross-validation 434 100 16 2
conn-bench-sonar-mines-rocks/conn-bench-sonar-mines-rocks.arff conn-bench-sonar-mines-rocks 208 leave-one-out cross-validation 207 100 60 2
conn-bench-vowel-deterding/conn-bench-vowel-deterding_train.arff conn-bench-vowel-deterding 528 test-set cross-validation 428 100 11 11
connect-4/connect-4.arff connect-4 67557 test-set cross-validation 67457 100 42 2
contrac/contrac.arff contrac 1473 test-set cross-validation 1373 100 9 3
credit-approval/credit-approval.arff credit-approval 690 test-set cross-validation 590 100 15 2
cylinder-bands/cylinder-bands.arff cylinder-bands 512 test-set cross-validation 412 100 35 2
dermatology/dermatology.arff dermatology 366 leave-one-out cross-validation 365 100 34 6
echocardiogram/echocardiogram.arff echocardiogram 131 leave-one-out cross-validation 130 100 10 2
ecoli/ecoli.arff ecoli 336 leave-one-out cross-validation 335 100 7 8
energy-y1/energy-y1.arff energy-y1 768 test-set cross-validation 668 100 8 3
energy-y2/energy-y2.arff energy-y2 768 test-set cross-validation 668 100 8 3
fertility/fertility.arff fertility 100 leave-one-out cross-validation 99 100 9 2
flags/flags.arff flags 194 leave-one-out cross-validation 193 100 28 8
glass/glass.arff glass 214 leave-one-out cross-validation 213 100 9 6
haberman-survival/haberman-survival.arff haberman-survival 306 leave-one-out cross-validation 305 100 3 2
hayes-roth/hayes-roth_train.arff hayes-roth 132 leave-one-out cross-validation 131 100 3 3
heart-cleveland/heart-cleveland.arff heart-cleveland 303 leave-one-out cross-validation 302 100 13 5
heart-hungarian/heart-hungarian.arff heart-hungarian 294 leave-one-out cross-validation 293 100 12 2
heart-switzerland/heart-switzerland.arff heart-switzerland 123 leave-one-out cross-validation 122 100 12 5
heart-va/heart-va.arff heart-va 200 leave-one-out cross-validation 199 100 12 5
hepatitis/hepatitis.arff hepatitis 155 leave-one-out cross-validation 154 100 19 2
hill-valley/hill-valley_train.arff hill-valley 606 test-set cross-validation 506 100 100 2
horse-colic/horse-colic_train.arff horse-colic 300 leave-one-out cross-validation 299 100 25 2
ilpd-indian-liver/ilpd-indian-liver.arff ilpd-indian-liver 583 test-set cross-validation 483 100 9 2
image-segmentation/image-segmentation_train.arff image-segmentation 210 leave-one-out cross-validation 209 100 18 7
ionosphere/ionosphere.arff ionosphere 351 leave-one-out cross-validation 350 100 33 2
iris/iris.arff iris 150 leave-one-out cross-validation 149 100 4 3
led-display/led-display.arff led-display 1000 test-set cross-validation 900 100 7 10
lenses/lenses.arff lenses 24 leave-one-out cross-validation 23 24 4 3
letter/letter.arff letter 20000 test-set cross-validation 19900 100 16 26
libras/libras.arff libras 360 leave-one-out cross-validation 359 100 90 15
low-res-spect/low-res-spect.arff low-res-spect 531 test-set cross-validation 431 100 100 9
lung-cancer/lung-cancer.arff lung-cancer 32 leave-one-out cross-validation 31 32 56 3
lymphography/lymphography.arff lymphography 148 leave-one-out cross-validation 147 100 18 4
magic/magic.arff magic 19020 test-set cross-validation 18920 100 10 2
mammographic/mammographic.arff mammographic 961 test-set cross-validation 861 100 5 2
miniboone/miniboone.arff miniboone 130064 test-set cross-validation 129964 100 50 2
molec-biol-promoter/molec-biol-promoter.arff molec-biol-promoter 106 leave-one-out cross-validation 105 100 57 2
molec-biol-splice/molec-biol-splice.arff molec-biol-splice 3190 test-set cross-validation 3090 100 60 3
monks-1/monks-1_train.arff monks-1 124 leave-one-out cross-validation 123 100 6 2
monks-2/monks-2_train.arff monks-2 169 leave-one-out cross-validation 168 100 6 2
monks-3/monks-3_train.arff monks-3 122 leave-one-out cross-validation 121 100 6 2
mushroom/mushroom.arff mushroom 8124 test-set cross-validation 8024 100 21 2
musk-1/musk-1.arff musk-1 476 leave-one-out cross-validation 475 100 166 2
musk-2/musk-2.arff musk-2 6598 test-set cross-validation 6498 100 166 2
nursery/nursery.arff nursery 12960 test-set cross-validation 12860 100 8 5
oocytes_merluccius_nucleus_4d/oocytes_merluccius_nucleus_4d.arff oocytes_merluccius_nucleus_4d 1022 test-set cross-validation 922 100 41 2
oocytes_merluccius_states_2f/oocytes_merluccius_states_2f.arff oocytes_merluccius_states_2f 1022 test-set cross-validation 922 100 25 3
oocytes_trisopterus_nucleus_2f/oocytes_trisopterus_nucleus_2f.arff oocytes_trisopterus_nucleus_2f 912 test-set cross-validation 812 100 25 2
oocytes_trisopterus_states_5b/oocytes_trisopterus_states_5b.arff oocytes_trisopterus_states_5b 912 test-set cross-validation 812 100 32 3
optical/optical_train.arff optical 3823 test-set cross-validation 3723 100 62 10
ozone/ozone.arff ozone 2536 test-set cross-validation 2436 100 72 2
page-blocks/page-blocks.arff page-blocks 5473 test-set cross-validation 5373 100 10 5
parkinsons/parkinsons.arff parkinsons 195 leave-one-out cross-validation 194 100 22 2
pendigits/pendigits_train.arff pendigits 7494 test-set cross-validation 7394 100 16 10
pima/pima.arff pima 768 test-set cross-validation 668 100 8 2
pittsburg-bridges-MATERIAL/pittsburg-bridges-MATERIAL.arff pittsburg-bridges-MATERIAL 106 leave-one-out cross-validation 105 100 7 3
pittsburg-bridges-REL-L/pittsburg-bridges-REL-L.arff pittsburg-bridges-REL-L 103 leave-one-out cross-validation 102 100 7 3
pittsburg-bridges-SPAN/pittsburg-bridges-SPAN.arff pittsburg-bridges-SPAN 92 leave-one-out cross-validation 91 92 7 3
pittsburg-bridges-T-OR-D/pittsburg-bridges-T-OR-D.arff pittsburg-bridges-T-OR-D 102 leave-one-out cross-validation 101 100 7 2
pittsburg-bridges-TYPE/pittsburg-bridges-TYPE.arff pittsburg-bridges-TYPE 105 leave-one-out cross-validation 104 100 7 6
planning/planning.arff planning 182 leave-one-out cross-validation 181 100 12 2
plant-margin/plant-margin.arff plant-margin 1600 test-set cross-validation 1500 100 64 100
plant-shape/plant-shape.arff plant-shape 1600 test-set cross-validation 1500 100 64 100
plant-texture/plant-texture.arff plant-texture 1599 test-set cross-validation 1499 100 64 100
post-operative/post-operative.arff post-operative 90 leave-one-out cross-validation 89 90 8 3
primary-tumor/primary-tumor.arff primary-tumor 330 leave-one-out cross-validation 329 100 17 15
ringnorm/ringnorm.arff ringnorm 7400 test-set cross-validation 7300 100 20 2
seeds/seeds.arff seeds 210 leave-one-out cross-validation 209 100 7 3
semeion/semeion.arff semeion 1593 test-set cross-validation 1493 100 256 10
soybean/soybean_train.arff soybean 307 leave-one-out cross-validation 306 100 35 18
spambase/spambase.arff spambase 4601 test-set cross-validation 4501 100 57 2
spect/spect_train.arff spect 79 leave-one-out cross-validation 78 79 22 2
spectf/spectf_train.arff spectf 80 leave-one-out cross-validation 79 80 44 2
statlog-australian-credit/statlog-australian-credit.arff statlog-australian-credit 690 test-set cross-validation 590 100 14 2
statlog-german-credit/statlog-german-credit.arff statlog-german-credit 1000 test-set cross-validation 900 100 24 2
statlog-heart/statlog-heart.arff statlog-heart 270 leave-one-out cross-validation 269 100 13 2
statlog-image/statlog-image.arff statlog-image 2310 test-set cross-validation 2210 100 18 7
statlog-landsat/statlog-landsat_train.arff statlog-landsat 4435 test-set cross-validation 4335 100 36 6
statlog-shuttle/statlog-shuttle_train.arff statlog-shuttle 43500 test-set cross-validation 43400 100 9 7
statlog-vehicle/statlog-vehicle.arff statlog-vehicle 846 test-set cross-validation 746 100 18 4
steel-plates/steel-plates.arff steel-plates 1941 test-set cross-validation 1841 100 27 7
synthetic-control/synthetic-control.arff synthetic-control 600 test-set cross-validation 500 100 60 6
teaching/teaching.arff teaching 151 leave-one-out cross-validation 150 100 5 3
thyroid/thyroid_train.arff thyroid 3772 test-set cross-validation 3672 100 21 3
tic-tac-toe/tic-tac-toe.arff tic-tac-toe 958 test-set cross-validation 858 100 9 2
titanic/titanic.arff titanic 2201 test-set cross-validation 2101 100 3 2
twonorm/twonorm.arff twonorm 7400 test-set cross-validation 7300 100 20 2
vertebral-column-2clases/datos_orixinais/column_3C_weka.arff column_3C_weka 310 leave-one-out cross-validation 309 100 6 3
vertebral-column-2clases/datos_orixinais/column_2C_weka.arff column_2C_weka 310 leave-one-out cross-validation 309 100 6 2
vertebral-column-2clases/vertebral-column-2clases.arff vertebral-column-2clases 310 leave-one-out cross-validation 309 100 6 2
vertebral-column-3clases/vertebral-column-3clases.arff vertebral-column-3clases 310 leave-one-out cross-validation 309 100 6 3
wall-following/wall-following.arff wall-following 5456 test-set cross-validation 5356 100 24 4
waveform/waveform.arff waveform 5000 test-set cross-validation 4900 100 21 3
waveform-noise/waveform-noise.arff waveform-noise 5000 test-set cross-validation 4900 100 40 3
wine/wine.arff wine 178 leave-one-out cross-validation 177 100 13 3
wine-quality-red/wine-quality-red.arff wine-quality-red 1599 test-set cross-validation 1499 100 11 6
wine-quality-white/wine-quality-white.arff wine-quality-white 4898 test-set cross-validation 4798 100 11 7
yeast/yeast.arff yeast 1484 test-set cross-validation 1384 100 8 10
zoo/zoo.arff zoo 101 leave-one-out cross-validation 100 100 16 7
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="utf-8">
<title>Loading CSV Data with D3</title>
<script type="text/javascript" src="http://d3js.org/d3.v3.min.js"></script>
</head>
<body>
<p>Not much to see here; try looking in the console!</p>
<script type="text/javascript">
d3.select("body").append("svg").attr("width",900).attr("height",400)
d3.select("svg").append("circle").attr("r",30).attr("fill","rgba(230, 230, 230, 0.15)").attr("cy",200).attr("cx",200)
d3.select("svg").append("circle").attr("r",35).attr("fill","rgba(230, 230, 230, 0.20)").attr("cy",200).attr("cx",250)
d3.select("svg").append("circle").attr("r",40).attr("fill","rgba(230, 230, 230, 0.25)").attr("cy",200).attr("cx",300)
d3.select("svg").append("circle").attr("r",45).attr("fill","rgba(230, 230, 230, 0.30)").attr("cy",200).attr("cx",350)
d3.select("svg").append("circle").attr("r",50).attr("fill","rgba(230, 230, 230, 0.35)").attr("cy",200).attr("cx",400)
d3.select("svg").append("circle").attr("r",55).attr("fill","rgba(230, 230, 230, 0.40)").attr("cy",200).attr("cx",450)
d3.select("svg").append("circle").attr("r",60).attr("fill","rgba(230, 230, 230, 0.45)").attr("cy",200).attr("cx",500)
d3.select("svg").append("circle").attr("r",65).attr("fill","rgba(230, 230, 230, 0.50)").attr("cy",200).attr("cx",550)
d3.select("svg").append("circle").attr("r",70).attr("fill","rgba(230, 230, 230, 0.55)").attr("cy",200).attr("cx",600)
//Load in contents of CSV file
d3.csv("123 datasets.csv", function(data) {
//Now CSV contents have been transformed into
//an array of JSON objects.
//Log 'data' to the console, for verification.
console.log(data);
});
</script>
</body>
</html>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment