AuthorLanguageLicensePurpose

http://tiny.cc/regress

Example regression datasets: 61 CSV files (abalone, auto93, housing, wine quality, ...) with self-describing headers — column names encode type and goal, so no separate schema files are needed. Data only, no code.

# install
git clone http://tiny.cc/konfig ../konfig
git clone http://tiny.cc/regress regress && cd regress
make help

qr

Sections: NAME | DATA | FILES | SEE ALSO | LICENSE | AUTHOR

NAME

regress - regression benchmark CSVs. headers are the schema;
last goal columns end in '+' (maximize) or '-' (minimize).

DATA

CSV with self-describing header; no separate schema file:

  first char UPPER  -> numeric (Num)
  first char lower  -> symbolic (Sym)
  suffix '+'        -> numeric goal, maximize
  suffix '-'        -> numeric goal, minimize
  suffix '!'        -> symbolic goal (klass)
  suffix 'X'        -> ignore
  else              -> predictor
  missing value     -> '?'

E.g. auto93.csv: Clndrs,Volume,HpX,Lbs-,Acc+,Model,origin,Mpg+

FILES

2dplanes.csv | abalone.csv | ailerons.csv | auto93.csv | autohorse.csv | autompg.csv | autoprice.csv | bank32nh.csv | bank8FM.csv | baskball.csv | bodyfat.csv | bolts.csv | breasttumor.csv | cal.housing.csv | cholesterol.csv | cleveland.csv | cloud.csv | cpu.act.csv | cpu.csv | cpu.small.csv | delta.ailerons.csv | delta.elevators.csv | detroit.csv | diabetes.numeric.csv | echomonths.csv | elevators.csv | elusage.csv | fishcatch.csv | fried.csv | fruitfly.csv | gascons.csv | house16H.csv | house8L.csv | housing.csv | hungarian.csv | kin8nm.csv | longley.csv | lowbwt.csv | machine.cpu.csv | mbagrade.csv | meta.csv | mv.csv | pbc.csv | pharynx.csv | pol.csv | pollution.csv | puma32H.csv | puma8NH.csv | pwlinear.csv | pyrim.csv | quake.csv | schlvote.csv | sensory.csv | servo.csv | sleep.csv | stock.csv | strike.csv | triazines.csv | veteran.csv | vineyard.csv | wisconsin.csv

SEE ALSO

konfig    http://tiny.cc/konfig   shared Makefile, dotfiles
optimiz   http://tiny.cc/optimiz  optimization datasets
klassif   http://tiny.cc/klassif  classification datasets
luamine   http://tiny.cc/luamine  code that reads these files

LICENSE

MIT. https://choosealicense.com/licenses/mit/

AUTHOR

Tim Menzies <timm@ieee.org>

built by gistsite