03.07.2018 Views

그것이 R고 싶다 - 맛보기

양중기 저 | 한빛미디어 | 2018년 07월 09일 32,000원

양중기 저 | 한빛미디어 | 2018년 07월 09일
32,000원

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

RRR<br />

201879<br />

<br />

262IT<br />

02–325–554402–336–7124<br />

199962410–1779ISBN979116224088593000<br />

<br />

<br />

<br />

<br />

<br />

www.hanbit.co.kr / ask@hanbit.co.kr<br />

PublishedbyHanbitMediaIncPrintedinKorea<br />

Copyright2018HanbitMediaInc<br />

<br />

<br />

<br />

writer@hanbit.co.kr


yjunggi@gmail.com<br />

9R20<br />

<br />

<br />

<br />

<br />

4


61<br />

<br />

<br />

IT<br />

<br />

<br />

<br />

<br />

5


R<br />

R<br />

R<br />

<br />

R<br />

R<br />

<br />

R<br />

<br />

<br />

6


RMROR<br />

R09910GUI<br />

<br />

R<br />

<br />

<br />

<br />

1<br />

<br />

2R<br />

32<br />

<br />

4<br />

<br />

<br />

R<br />

<br />

<br />

<br />

<br />

<br />

7


CONTENTS<br />

4<br />

5<br />

7<br />

PART 1 R<br />

CHAPTER1<br />

1.1 19<br />

111 19<br />

112 20<br />

113 22<br />

114 23<br />

1.2 24<br />

121 24<br />

122 26<br />

123 30<br />

CHAPTER2<br />

2.1 RMROR 45<br />

211R 46<br />

212MRO 54<br />

213R 60<br />

8


2.2 R 64<br />

221 64<br />

222 68<br />

223 71<br />

224RR 73<br />

PART 2 R<br />

CHAPTER3<br />

3.1 77<br />

311 78<br />

312 79<br />

313 80<br />

314 81<br />

315 81<br />

316 82<br />

3.2 84<br />

321 84<br />

322 85<br />

323 86<br />

324 88<br />

3.3 90<br />

331 92<br />

332 93<br />

9


CONTENTS<br />

333 94<br />

334 95<br />

3.4 98<br />

341 99<br />

342 101<br />

343 104<br />

3.5 106<br />

351if 106<br />

352switch 110<br />

353for 111<br />

354while 115<br />

CHAPTER4<br />

4.1 dplyrtidyr 118<br />

4.2 122<br />

421tbldf 123<br />

422glimpse 126<br />

423 127<br />

4.3 128<br />

431gather 128<br />

432spread 129<br />

433separate 130<br />

434unite 132<br />

4.4 133<br />

441filter 133<br />

10


442slice 134<br />

4.5 135<br />

451select 135<br />

4.6 139<br />

461bindcols 140<br />

462leftjoin 140<br />

463rightjoin 141<br />

464innerjoin 142<br />

465fulljoin 142<br />

4.7 143<br />

471bindrows 144<br />

472intersect 145<br />

473setdiff 145<br />

474union 146<br />

4.8 146<br />

481mutate 147<br />

482transmute 148<br />

PART 3 <br />

CHAPTER5<br />

5.1 CSVXLSTXT 151<br />

511CSV 154<br />

512 159<br />

11


CONTENTS<br />

513TXT 166<br />

5.2 XMLJSON 168<br />

521XML 170<br />

522JSON 173<br />

5.3 176<br />

531MSSQL 177<br />

532MySQL 182<br />

533dbplyrpool 183<br />

5.4 R 189<br />

5.5 featherfst 191<br />

CHAPTER6<br />

6.1 198<br />

611 198<br />

612p 200<br />

613 201<br />

614 206<br />

615 210<br />

6.2 213<br />

621 213<br />

622 215<br />

6.3 216<br />

631 216<br />

632t 218<br />

633 219<br />

12


CHAPTER7<br />

7.1 223<br />

7.2 227<br />

721 228<br />

722 229<br />

7.3 232<br />

731 232<br />

732R 233<br />

733 234<br />

7.4 239<br />

CHAPTER8<br />

8.1 ggplot 247<br />

811ggplot2 248<br />

812 248<br />

8.2 ggplot 252<br />

821 254<br />

822 255<br />

823colourgroup 258<br />

824 261<br />

825 263<br />

8.3 ggplot 266<br />

831 268<br />

832 275<br />

13


CONTENTS<br />

8.4 ggThemeAssist 280<br />

841ggThemeAssist 281<br />

842Settings 283<br />

843PanelBackground 284<br />

844Axis 288<br />

845Titleandlabel 291<br />

846Legend 293<br />

847SubtitleandCaption 296<br />

8.5 ggplotggThemeAssist 297<br />

PART 4 <br />

CHAPTER9<br />

9.1 303<br />

911 303<br />

912RR 309<br />

913R 315<br />

914 316<br />

9.2 319<br />

921 319<br />

922RR 325<br />

923R 329<br />

924 334<br />

14


CHAPTER10<br />

10.1 AWS 338<br />

1011 338<br />

1012R 340<br />

1013HTML 344<br />

1014R 347<br />

10.2 R 351<br />

1021R 352<br />

1022R 354<br />

1023 356<br />

1024 361<br />

1025 363<br />

10.3 shiny 365<br />

1031shiny 365<br />

1032 372<br />

1033 374<br />

10.4 shiny 377<br />

385<br />

15


CONTENTS<br />

16


PartI<br />

R<br />

1 <br />

17


Part I<br />

R<br />

1<br />

2<br />

<br />

<br />

18 1


CHAPTER 1<br />

<br />

19<br />

<br />

<br />

<br />

<br />

<br />

1.1 <br />

<br />

<br />

<br />

1.1.1 <br />

IoT<br />

<br />

1 <br />

19


1.1.2 <br />

<br />

<br />

<br />

<br />

20 1


DBA databaseadministration <br />

<br />

<br />

<br />

<br />

<br />

dataanalyst <br />

RSQL<br />

BI<br />

<br />

<br />

SQLR<br />

SQLBI<br />

<br />

R<br />

<br />

<br />

<br />

dataengineer AB<br />

<br />

<br />

<br />

<br />

<br />

<br />

datascientist featureengineering <br />

<br />

<br />

1 <br />

21


exploratory <br />

<br />

<br />

<br />

<br />

1.1.3 <br />

<br />

<br />

<br />

<br />

<br />

<br />

3V volume variety velocity <br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

22 1


RSQL<br />

<br />

PPT<br />

ggplot2<br />

<br />

<br />

1.1.4 <br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

RSQL<br />

<br />

<br />

<br />

<br />

<br />

<br />

1 <br />

23


1 2<br />

3 <br />

<br />

<br />

<br />

<br />

<br />

<br />

1.2 <br />

R<br />

R<br />

<br />

<br />

1.2.1 <br />

RR<br />

<br />

<br />

R<br />

<br />

R<br />

<br />

<br />

R<br />

R<br />

24 1


R<br />

R<br />

RevoScaleR<br />

inmemory R<br />

<br />

<br />

32GB<br />

<br />

<br />

<br />

RRevoScaleR<br />

<br />

SOAR<br />

MySQLMSSQLDB<br />

<br />

IaaS<br />

<br />

<br />

<br />

<br />

RRGUI<br />

<br />

1 <br />

25


1.2.2 <br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

Googleanalytics IT<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

ROI<br />

<br />

<br />

<br />

<br />

XMLPDFCSV<br />

<br />

<br />

26 1


AmazonWebServices AWS <br />

EC2AWS<br />

<br />

<br />

<br />

1 <br />

27


AWS1644<br />

617<br />

AWSGovCloud<br />

190<br />

<br />

AWS<br />

910AWS<br />

<br />

Azure R<br />

R<br />

R<br />

<br />

R100050<br />

28 1


RRRR<br />

InDB<br />

<br />

<br />

9<br />

<br />

MS<br />

DDoS<br />

<br />

LINE190<br />

30<br />

AWS<br />

<br />

1 <br />

29


AWS<br />

<br />

1<br />

<br />

HighMemory256GB2775<br />

AWS<br />

<br />

MicroCentOSRRAWS<br />

<br />

1.2.3 <br />

<br />

<br />

<br />

R<br />

<br />

30 1


Tableau <br />

<br />

<br />

<br />

<br />

<br />

1 <br />

31


APIAWSCubes<br />

SQL<br />

SQL<br />

<br />

32 1


MicroStrategy <br />

<br />

70<br />

<br />

<br />

<br />

<br />

AnnotationChart <br />

<br />

<br />

<br />

1 <br />

33


34 1


BI<br />

MSBI PowerBI <br />

<br />

<br />

<br />

MS<br />

BI<br />

<br />

<br />

<br />

<br />

BIRRRggplot2<br />

dplyrR<br />

<br />

1 <br />

35


Plotly R<br />

<br />

36 1


Rggplot2<br />

PT<br />

Rggplot2<br />

<br />

<br />

ggplot2<br />

104<br />

1 <br />

37


ggplot2<br />

<br />

<br />

<br />

38 1


1 <br />

39


PNG<br />

<br />

<br />

googleVis<br />

googleVisHTML5SVG<br />

googleVis<br />

<br />

<br />

<br />

<br />

<br />

40 1


1 <br />

41


GUI<br />

<br />

GUI<br />

42 1


googleVis<br />

<br />

1 <br />

43


44 1


CHAPTER 2<br />

<br />

12R<br />

<br />

RRRRR<br />

<br />

2.1 RMROR<br />

RR<br />

CentOS<br />

SQLSQL<br />

<br />

RGNUGPL GeneralPublicLicense <br />

MRO<br />

R<br />

<br />

<br />

2 <br />

45


2.1.1 R<br />

64OSRMRO<br />

MRO<br />

R R<br />

DownloadRforWindows<br />

<br />

46 1


PartII<br />

R<br />

3 <br />

75


Part II<br />

R<br />

3<br />

4<br />

<br />

<br />

76 2


CHAPTER 3<br />

<br />

<br />

<br />

<br />

3.1 <br />

categorical continuous <br />

<br />

<br />

15<br />

<br />

<br />

<br />

<br />

<br />

3 <br />

77


R integer floatingpoint<br />

number<br />

complexnumber text logicalvalue <br />

3.1.1 <br />

R<br />

R<br />

<br />

<br />

<br />

notavailable<br />

<br />

<br />

notanumber<br />

<br />

315490807560<br />

5<br />

<br />

<br />

3-1 NA<br />

<br />

<br />

R33<br />

<br />

32549080<br />

75605<br />

3-2 NULL<br />

<br />

<br />

78 2


CHAPTER 4<br />

<br />

<br />

<br />

<br />

<br />

RR<br />

dplyr tidyr <br />

dplyrRDBMSSQL<br />

pipe <br />

SQLsqldf<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

4 <br />

117


4.1 dplyrtidyr<br />

cheatsheet 4<br />

dplyrtidyr8<br />

<br />

dplyrtidyr<br />

<br />

PDFR<br />

<br />

R<br />

RHelpCheatsheets<br />

118 2


PartIII<br />

<br />

4 <br />

149


Part III<br />

<br />

5<br />

6<br />

7<br />

8<br />

<br />

<br />

<br />

<br />

150 3


CHAPTER 5<br />

<br />

<br />

TXT CSVXLS<br />

MySQLMSSQLJSON<br />

XMLR<br />

<br />

5.1 CSVXLSTXT<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

151


152 3


CHAPTER 6<br />

<br />

<br />

<br />

67<br />

p<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

t<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

ANCOVA<br />

6 <br />

197


t<br />

63<br />

<br />

6.1 <br />

<br />

<br />

<br />

6.1.1 <br />

R<br />

<br />

52015<br />

6120022013<br />

100<br />

<br />

6-1 <br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

198 3


CHAPTER 7<br />

<br />

<br />

<br />

<br />

<br />

p<br />

6<br />

<br />

7.1 <br />

correlationanalysis <br />

<br />

<br />

Spearman <br />

<br />

<br />

12345<br />

7 <br />

223


40<br />

<br />

<br />

<br />

<br />

1 23 45 <br />

40<br />

5140<br />

<br />

<br />

<br />

correlationcoefficient r<br />

r1r11<br />

1<br />

<br />

<br />

<br />

<br />

1007<br />

0703<br />

0301<br />

0101<br />

0103<br />

0307<br />

0710<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

<br />

4 Pearson <br />

224 3


CHAPTER 8<br />

<br />

<br />

<br />

<br />

<br />

R<br />

<br />

<br />

8.1 ggplot<br />

ggplot2RRR<br />

R<br />

<br />

ggplot2<br />

<br />

<br />

<br />

8 <br />

247


123ggplot2<br />

104<br />

8.1.1 ggplot<br />

81ggplot2<br />

<br />

8-1 ggplot<br />

<br />

<br />

8.1.2 <br />

ggplot2<br />

ggplot2<br />

<br />

R vignette <br />

<br />

<br />

<br />

HTMLPDFR<br />

<br />

248 3


PartIV<br />

<br />

9 <br />

301


Part II<br />

<br />

9<br />

<br />

10 <br />

302 4


CHAPTER 9<br />

<br />

<br />

<br />

<br />

<br />

9.1 <br />

AWS1IaaS<br />

AWSR<br />

9.1.1 <br />

AWS11<br />

<br />

<br />

9 <br />

303


AWS <br />

1<br />

304 4


CHAPTER 10<br />

<br />

<br />

<br />

<br />

<br />

PPT<br />

R<br />

<br />

<br />

<br />

markuplanguage RR<br />

R RMarkdown <br />

AWS<br />

RHTML<br />

<br />

10 <br />

337


10.1 AWS<br />

AWS73HTML<br />

<br />

10.1.1 <br />

91AMIRR<br />

<br />

URLAWS<br />

<br />

338 4


385<br />

<br />

<br />

dbplyr183<br />

dplyr117<br />

feather191<br />

fst192<br />

ggplot237214247379<br />

ggThemeAssist280<br />

jsonlite173<br />

knitr356<br />

pool183<br />

randomForest240<br />

RMySQL182<br />

shiny365377<br />

tidyr117<br />

XML170<br />

<br />

200<br />

26<br />

200<br />

101201<br />

253<br />

228<br />

229<br />

228<br />

200<br />

21<br />

21<br />

21<br />

88<br />

361<br />

239<br />

252255<br />

228<br />

85<br />

33<br />

104<br />

253<br />

86<br />

210<br />

77<br />

84<br />

99<br />

101201<br />

219<br />

210<br />

102<br />

224<br />

INDEX


386 <br />

INDEX<br />

<br />

aestheticmapping253<br />

alternativehypothesis200<br />

AmazonWebServicesAWS 27303338<br />

analysisofvarianceANOVA219<br />

array86<br />

Azure28319<br />

categorical77<br />

cheatsheet118266352377<br />

chi squaretest216<br />

chunk349356361<br />

confidencelevel200<br />

continuous77<br />

correlationanalysis223<br />

correlationcoefficient224<br />

dataanalyst21<br />

dataengineer21<br />

dataframe88<br />

datascientist21<br />

decisiontree232<br />

descriptivestatistics101201<br />

function98<br />

geometricobject253<br />

globalvariable104<br />

Googleanalytics26<br />

googleVis40<br />

label361<br />

layer252255<br />

list85<br />

localvariable104<br />

logisticregression228<br />

matrix86<br />

223<br />

215<br />

248<br />

200<br />

27303338<br />

28319<br />

77<br />

89201<br />

200<br />

200<br />

232<br />

104<br />

213<br />

104<br />

349356361<br />

118266352377<br />

216<br />

31<br />

261<br />

BI35<br />

89101201<br />

101201<br />

36377<br />

98<br />

86<br />

227<br />

p200<br />

R337<br />

R60<br />

t218


mean89101201<br />

MicrosoftROpenMRO 54<br />

MicroStrategy33<br />

multinomiallogisticregression229<br />

multipleregression228<br />

normaldistribution213<br />

nullhypothesis200<br />

p value200<br />

parameter104<br />

percentile210<br />

Plotly36377<br />

PowerBI35<br />

quantile210<br />

quartile102<br />

RMarkdown337<br />

randomforest239<br />

regressionanalysis227<br />

RStudio60<br />

Shapiro Wilktest215<br />

significancelevel200<br />

significanceprobability200<br />

simpleregression228<br />

standarddeviation101201<br />

statisticalobject261<br />

summarystatistic89201<br />

t test218<br />

Tableau31<br />

testofhypothesis200<br />

variable99<br />

variance101201<br />

vector84<br />

vignette248<br />

<br />

387

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!