正在加载图片...
8 Support Vector Machines in R promotergene Promoters have a region where a protein (RNA polymerase)must make contact and the helical DNA sequence must have a valid conformation so that the two pieces of the contact region spatially align.The dataset in package kernlab contains DNA sequences of promoters and non-promoters in a data frame with 106 observations and 58 variables.The DNA bases are coded as follows:'a'adenine,'c'cytosine,'g' guanine,and 't'thymine. Vowel Speaker independent recognition of the eleven steady state vowels of British English using a specified training set of LPC derived log area ratios.The vowels are indexed by integers 0 to 10.This dataset in package mlbench (Leisch and Dimitriadou 2001)has 990 observations on 10 independent variables. DNA in package mlbench consists of 3,186 data points (splice junctions).The data points are described by 180 indicator binary variables and the problem is to recognize the 3 classes ('ei','ie',neither),i.e.,the boundaries between exons (the parts of the DNA sequence retained after splicing)and introns(the parts of the DNA sequence that are spliced out). Breast Cancer in package mlbench is a data frame with 699 observations on 11 variables, one being a character variable,9 being ordered or nominal,and 1 target class.The objective is to identify each of a number of benign or malignant classes. BostonHousing Housing data in package mlbench for 506 census tracts of Boston from the 1970 census.There are 506 observations on 14 variables. B3 German Bussiness Cycles from 1955 to 1994 in package klaR.A data frame with 157 observations on the following 14 variables. Dataset #Examples #Attributes Class Distribution(%) b c m cl iris 150 5 2 33.3/33.3/33.3 spam 4601 572 39.40/60.59 musk 476 166 2 42.99/57.00 promotergene 106 57 2 50.00/50.00 Vowel 990 1 910 10.0/10.0/.. DNA 3186 180 2 24.07/24.07/51.91 BreastCancer 699 9 2 34.48/65.52 BostonHousing 506 12 (regression) B3 506 13 4 37.57/15.28/29.93/17.19 Table 1:The data sets used throughout the paper. Legend: b=binary,c=categorical, m=metric.cl number of classes. 4.ksvm in kernlab Package kernlab(Karatzoglou,Smola,Hornik,and Zeileis 2004)aims to provide the R user with basic kernel functionality (e.g.,like computing a kernel matrix using a particular kernel),8 Support Vector Machines in R promotergene Promoters have a region where a protein (RNA polymerase) must make contact and the helical DNA sequence must have a valid conformation so that the two pieces of the contact region spatially align. The dataset in package kernlab contains DNA sequences of promoters and non-promoters in a data frame with 106 observations and 58 variables. The DNA bases are coded as follows: ‘a’ adenine, ‘c’ cytosine, ‘g’ guanine, and ‘t’ thymine. Vowel Speaker independent recognition of the eleven steady state vowels of British English using a specified training set of LPC derived log area ratios. The vowels are indexed by integers 0 to 10. This dataset in package mlbench (Leisch and Dimitriadou 2001) has 990 observations on 10 independent variables. DNA in package mlbench consists of 3,186 data points (splice junctions). The data points are described by 180 indicator binary variables and the problem is to recognize the 3 classes (‘ei’, ‘ie’, neither), i.e., the boundaries between exons (the parts of the DNA sequence retained after splicing) and introns (the parts of the DNA sequence that are spliced out). BreastCancer in package mlbench is a data frame with 699 observations on 11 variables, one being a character variable, 9 being ordered or nominal, and 1 target class. The objective is to identify each of a number of benign or malignant classes. BostonHousing Housing data in package mlbench for 506 census tracts of Boston from the 1970 census. There are 506 observations on 14 variables. B3 German Bussiness Cycles from 1955 to 1994 in package klaR. A data frame with 157 observations on the following 14 variables. #Attributes Dataset #Examples b c m cl Class Distribution (%) iris 150 5 3 33.3/33.3/33.3 spam 4601 57 2 39.40/60.59 musk 476 166 2 42.99 / 57.00 promotergene 106 57 2 50.00 / 50.00 Vowel 990 1 9 10 10.0/10.0/... DNA 3186 180 3 24.07/24.07/51.91 BreastCancer 699 9 2 34.48 / 65.52 BostonHousing 506 1 12 (regression) B3 506 13 4 37.57/15.28/29.93/17.19 Table 1: The data sets used throughout the paper. Legend: b=binary, c=categorical, m=metric, cl = number of classes. 4. ksvm in kernlab Package kernlab (Karatzoglou, Smola, Hornik, and Zeileis 2004) aims to provide the R user with basic kernel functionality (e.g., like computing a kernel matrix using a particular kernel)
<<向上翻页向下翻页>>
©2008-现在 cucdc.com 高等教育资讯网 版权所有