source: proiecte/Parallel-DT/R8/Data/soybean.names @ 22

Last change on this file since 22 was 22, checked in by (none), 14 years ago

blabla

File size: 6.9 KB
Line 
1|1. Title: Large Soybean Database
2|
3|2. Sources:
4|     (a) Michalski,R.S. Learning by being told and learning from
5|         examples: an experimental comparison of the two methodes of knowledge
6|         acquisition in the context of developing an expert system for soybean
7|         desease diagnoiss", International Journal of Policy Analysis and
8|         Information Systems, 1980, 4(2), 125-161.
9|     (b) Donor: Ming Tan & Jeff Schlimmer (Jeff.Schlimmer%cs.cmu.edu)
10|     (c) Date: 11 July 1988
11|
12|3. Past Usage:
13|      1. See above.
14|      2. Tan, M., & Eshelman, L. (1988). Using weighted networks to represent
15|         classification knowledge in noisy domains.  Proceedings of the Fifth
16|         International Conference on Machine Learning (pp. 121-134). Ann Arbor,
17|         Michigan: Morgan Kaufmann.
18|         -- IWN recorded a 97.1% classification accuracy
19|            -- 290 training and 340 test instances
20|      3. Fisher,D.H. & Schlimmer,J.C. (1988). Concept Simplification and
21|         Predictive Accuracy. Proceedings of the Fifth
22|         International Conference on Machine Learning (pp. 121-134). Ann Arbor,
23|         Michigan: Morgan Kaufmann.
24|         -- Notes why this database is highly predictable
25|
26|4. Relevant Information Paragraph:
27|    There are 19 classes, only the first 15 of which have been used in prior
28|    work.  The folklore seems to be that the last four classes are
29|    unjustified by the data since they have so few examples.
30|    There are 35 categorical attributes, some nominal and some ordered.  The
31|    value ``dna'' means does not apply.  The values for attributes are
32|    encoded numerically, with the first value encoded as ``0,'' the second as
33|    ``1,'' and so forth.  An unknown values is encoded as ``-1.''
34|
35|5. Number of Instances: 316
36|
37|6. Number of Attributes: 35 (all have been nominalized)
38|
39|7. Attribute Information:
40|    -- 19 Classes
41|     diaporthe-stem-canker, charcoal-rot, rhizoctonia-root-rot,
42|     phytophthora-rot, brown-stem-rot, powdery-mildew,
43|     downy-mildew, brown-spot, bacterial-blight,
44|     bacterial-pustule, purple-seed-stain, anthracnose,
45|     phyllosticta-leaf-spot, alternarialeaf-spot,
46|     frog-eye-leaf-spot, diaporthe-pod-&-stem-blight,
47|     cyst-nematode, 2-4-d-injury, herbicide-injury.   
48|
49|    1. date:           april,may,june,july,august,september,october,?.
50|    2. plant-stand:    normal,lt-normal,?.
51|    3. precip:         lt-norm,norm,gt-norm,?.
52|    4. temp:           lt-norm,norm,gt-norm,?.
53|    5. hail:           yes,no,?.
54|    6. crop-hist:      diff-lst-year,same-lst-yr,same-lst-two-yrs,
55|                        same-lst-sev-yrs,?.
56|    7. area-damaged:   scattered,low-areas,upper-areas,whole-field,?.
57|    8. severity:       minor,pot-severe,severe,?.
58|    9. seed-tmt:       none,fungicide,other,?.
59|   10. germination:    90-100%,80-89%,lt-80%,?.
60|   11. plant-growth:   norm,abnorm,?.
61|   12. leaves:         norm,abnorm.
62|   13. leafspots-halo: absent,yellow-halos,no-yellow-halos,?.
63|   14. leafspots-marg: w-s-marg,no-w-s-marg,dna,?.
64|   15. leafspot-size:  lt-1/8,gt-1/8,dna,?.
65|   16. leaf-shread:    absent,present,?.
66|   17. leaf-malf:      absent,present,?.
67|   18. leaf-mild:      absent,upper-surf,lower-surf,?.
68|   19. stem:           norm,abnorm,?.
69|   20. lodging:        yes,no,?.
70|   21. stem-cankers:   absent,below-soil,above-soil,above-sec-nde,?.
71|   22. canker-lesion:  dna,brown,dk-brown-blk,tan,?.
72|   23. fruiting-bodies:        absent,present,?.
73|   24. external decay: absent,firm-and-dry,watery,?.
74|   25. mycelium:       absent,present,?.
75|   26. int-discolor:   none,brown,black,?.
76|   27. sclerotia:      absent,present,?.
77|   28. fruit-pods:     norm,diseased,few-present,dna,?.
78|   29. fruit spots:    absent,colored,brown-w/blk-specks,distort,dna,?.
79|   30. seed:           norm,abnorm,?.
80|   31. mold-growth:    absent,present,?.
81|   32. seed-discolor:  absent,present,?.
82|   33. seed-size:      norm,lt-norm,?.
83|   34. shriveling:     absent,present,?.
84|   35. roots:          norm,rotted,galls-cysts,?.
85|
86|8. Number of Missing Attribute Values:
87|    1. date: 0
88|    2. plant-stand: 1
89|    3. precip: 8
90|    4. temp: 11
91|    5. hail: 7
92|    6. crop-hist: 41
93|    7. area-damaged: 1
94|    8. severity: 1
95|    9. seed-tmt: 41
96|   10. germination: 41
97|   11. plant-growth: 36
98|   12. leaves: 1
99|   13. leafspots-halo: 0
100|   14. leafspots-marg: 25
101|   15. leafspot-size: 25
102|   16. leaf-shread: 25
103|   17. leaf-malf: 26
104|   18. leaf-mild: 25
105|   19. stem: 30
106|   20. lodging: 1
107|   21. stem-cankers: 41
108|   22. canker-lesion: 11
109|   23. fruiting-bodies: 11
110|   24. external decay: 35
111|   25. mycelium: 11
112|   26. int-discolor: 11
113|   27. sclerotia: 11
114|   28. fruit-pods: 11
115|   29. fruit spots: 25
116|   30. seed: 35
117|   31. mold-growth: 29
118|   32. seed-discolor: 29
119|   33. seed-size: 35
120|   34. shriveling: 29
121|   35. roots: 35
122|
123|9. Class Distribution:
124|   1. diaporthe-stem-canker: 10
125|   2. charcoal-rot: 10
126|   3. rhizoctonia-root-rot: 10
127|   4. phytophthora-rot: 40
128|   5. brown-stem-rot: 20
129|   6. powdery-mildew: 10
130|   7. downy-mildew: 10
131|   8. brown-spot: 40
132|   9. bacterial-blight: 10
133|  10. bacterial-pustule: 10
134|  11. purple-seed-stain: 10
135|  12. anthracnose: 20
136|  13. phyllosticta-leaf-spot: 10
137|  14. alternarialeaf-spot: 40
138|  15. frog-eye-leaf-spot: 40
139|  16. diaporthe-pod-&-stem-blight: 6
140|  17. cyst-nematode: 6
141|  18. 2-4-d-injury: 1
142|  19. herbicide-injury: 4
143|
144|----------------------------------------------------------
145
146|  Classes (19)
147|  -------
148
149     diaporthe-stem-canker, charcoal-rot, rhizoctonia-root-rot,
150     phytophthora-rot, brown-stem-rot, powdery-mildew,
151     downy-mildew, brown-spot, bacterial-blight,
152     bacterial-pustule, purple-seed-stain, anthracnose,
153     phyllosticta-leaf-spot, alternarialeaf-spot,
154     frog-eye-leaf-spot, diaporthe-pod-&-stem-blight,
155     cyst-nematode, 2-4-d-injury, herbicide-injury.     
156
157
158|  Attributes
159|  ----------
160
161date:           april,may,june,july,august,september,october.
162plant-stand:    normal,lt-normal.
163precip:         lt-norm,norm,gt-norm.
164temp:           lt-norm,norm,gt-norm.
165hail:           yes,no.
166crop-hist:      diff-lst-year,same-lst-yr,same-lst-two-yrs,
167                 same-lst-sev-yrs.
168area-damaged:   scattered,low-areas,upper-areas,whole-field.
169severity:       minor,pot-severe,severe.
170seed-tmt:       none,fungicide,other.
171germination:    90-100%,80-89%,lt-80%.
172plant-growth:   norm,abnorm.
173leaves:         norm,abnorm.
174leafspots-halo: absent,yellow-halos,no-yellow-halos.
175leafspots-marg: w-s-marg,no-w-s-marg,dna.
176leafspot-size:  lt-1/8,gt-1/8,dna.
177leaf-shread:    absent,present.
178leaf-malf:      absent,present.
179leaf-mild:      absent,upper-surf,lower-surf.
180stem:           norm,abnorm.
181lodging:        yes,no.
182stem-cankers:   absent,below-soil,above-soil,above-sec-nde.
183canker-lesion:  dna,brown,dk-brown-blk,tan.
184fruiting-bodies:        absent,present.
185external decay: absent,firm-and-dry,watery.
186mycelium:       absent,present.
187int-discolor:   none,brown,black.
188sclerotia:      absent,present.
189fruit-pods:     norm,diseased,few-present,dna.
190fruit spots:    absent,colored,brown-w/blk-specks,distort,dna.
191seed:           norm,abnorm.
192mold-growth:    absent,present.
193seed-discolor:  absent,present.
194seed-size:      norm,lt-norm.
195shriveling:     absent,present.
196roots:          norm,rotted,galls-cysts.
Note: See TracBrowser for help on using the repository browser.