위키

포럼

도구

R 데이터 분할

Jmnote (토론 | 기여)님의 2020년 5월 5일 (화) 20:21 판 (→‎개요)

(차이) ← 이전 판 | 최신판 (차이) | 다음 판 → (차이)

2021-10-13

편집

역링크

토론

R
Iris

숨은 분류:

구식 source 태그를 사용하는 문서

1 개요

R 데이터 분할

nrow(iris) # 150
train_data <- iris[-c(seq(5,nrow(iris),5)),] # 매 5번째만 제외
test_data  <- iris[ c(seq(5,nrow(iris),5)),] # 매 5번째만 포함
nrow(train_data) # 120
nrow(test_data)  # 30

df = read.table( header=TRUE, stringsAsFactors=FALSE, text="
name  major gpa
Alice Math  3.3
Bob   Math  3.4
Carol Math  3.5
Dave  Math  3.6
Erin  Math  3.7
Frank Math  3.7
Grace Chem  3.6
Heidi Chem  3.6
Ivan  Chem  3.7
Judy  Chem  3.8
Kelly Chem  3.9
Lee   Chem  4.0
")
library(caTools)
msk = sample.split(df$major, SplitRatio=3/4)
train = df[ msk,]
test  = df[!msk,]
print( train )
print( test  )

2 같이 보기

원본 주소 "https://zetawiki.com/w/index.php?title=R_데이터_분할&oldid=578665"

R
Iris

숨은 분류:

구식 source 태그를 사용하는 문서

수정 2021-10-13 생성 2020-05-04

편집자

문서 댓글 ({{ doc_comments.length }})

{{ comment.name }} {{ comment.created | snstime }}

분류 댓글:
{{cat.name.replace(/_/g,' ')}} ({{cat.cnt}})

{{comment.page_title}}
― {{comment.name}}

CC-BY-SA 3.0 · Powered by MediaWiki

개인정보처리방침 · ABOUT