R
Isolation Forest
· β˜• 3 min read · ✍️ Hoontaek Lee
knitr::opts_chunk$set(eval = TRUE, echo = TRUE, warning = FALSE) References: solitude: https://github.com/talegari/solitude isofor: https://campus.datacamp.com/courses/anomaly-detection-in-r/isolation-forest # install.packages("mvoutlier")) # install.packages("remotes")) # remotes::install_github("Zelazny7/isofor") #import libraries library(ggplot2) library(ggpubr) library(solitude) # hereafter, "sol" library(isofor) # hereafter, "iso" library(viridis) packageVersion("solitude") #create sample data data("humus", package = "mvoutlier") # 2-dimensional columns_required <- c("Bi", "Cd") humus2 <- humus[ , columns_required] # multi-dimensional columns_required_mul <- setdiff(colnames(humus) , c("Cond", "ID", "XCOO", "YCOO", "LOI") ) humus_mul <- humus[ , columns_required_mul] str(humus2) str(humus_mul) #plot data ggplot(humus2, aes(x = Bi, y = Cd)) + geom_point(shape = 1, alpha = 0.

Parallel processing in R
· β˜• 4 min read · ✍️ Hoontaek Lee
λ“€μ–΄κ°€λ©° μ›Œν¬μŠ€ν…Œμ΄μ…˜ μ„±λŠ₯은 괴물같은데 ν•  일 없이 λ†€κ³ λ§Œ μžˆλ‹€. λ³‘λ ¬μ²˜λ¦¬λ‘œ λΉ‘μ„Έκ²Œ ꡴렀보자. μ΄λ†ˆλ“€. (ν•˜μ§€λ§Œ λ‚΄ μ½”λ“œκ°€ λ©€ν‹°μ½”μ–΄ μ‚¬μš©μœΌλ‘œλŠ” ν•΄κ²°λ˜μ§€ μ•Šμ„ μ’…λ₯˜

R packages by Hadley Wickham 정리
· β˜• 32 min read · ✍️ Hoontaek Lee
Getting started Introduction νŒ¨ν‚€μ§€λ₯Ό λ§Œλ“œλŠ” 이유 λ‹€λ₯Έ μ‚¬λžŒκ³Ό μ½”λ“œ 곡유 λ‹€λ₯Έ μ‚¬λžŒλ“€μ΄ λ‚΄ μ½”λ“œλ₯Ό μ‰½κ²Œ μ΄ν•΄ν•˜κ³  μ‚¬μš©ν•  수 있게 맀뉴얼 μž‘μ„± λ‚˜ μžμ‹ μ΄ λ‚΄ μ½”λ“œλ₯Ό μ‰½κ²Œ μ΄ν•΄ν•˜κ³  μ‚¬μš©ν•  수 있게 맀뉴

Advanced R (2nd ed.)
· β˜• 5 min read · ✍️ Hoontaek Lee
Log 2020-04-12: 23, 24.6 κ³Όν•™μ›μ—μ„œ κΈ°μƒμžλ£Œ QCν•˜λŠ” R μ½”λ“œλ₯Ό μ§°λŠ”λ°, κ½€ μ• λ¨Ήμ—ˆλ‹€. 컴퓨터 λ©”λͺ¨λ¦¬λŠ” 8GB. μΈν’‹μœΌλ‘œ μ‚¬μš©ν•˜λŠ” csv 파일이 μ΅œλŒ€ 2GB μ •λ„λ‘œ 크긴 ν–ˆμ§€λ§Œ λ©”λͺ¨λ¦¬μ—λŠ” λ¬Έμ œκ°€

Rμ—μ„œ Open API μ‚¬μš©ν•˜κΈ° - 카카였 지도
· β˜• 3 min read · ✍️ Hoontaek Lee
Intro μ—°κ΅¬μ‹€μ—μ„œ μ—¬λŸ¬ μž₯μ†Œμ˜ μ£Όμ†Œ –> κ²½μœ„λ„ μ’Œν‘œ λ³€ν™˜μ„ ν•  일이 μžˆλ‹€. ν•˜λ‚˜ν•˜λ‚˜ μ›ΉνŽ˜μ΄μ§€μ—μ„œ ν•˜κΈ°λŠ” λ§ŽμœΌλ―€λ‘œ 코딩을 ν•˜κΈ°λ‘œ ν–ˆλ‹€. APIλ₯Ό μ‚¬μš©ν•˜λ©΄ λ˜λŠ” λ“―ν•˜λ‹€. API? Application

ISLR Note
· β˜• 34 min read · ✍️ Hoontaek Lee
2. Statistical Learning 2.1. What Is Statistical Learning? Y = f(X) + e μ—¬κΈ°μ„œ ν•¨μˆ˜ fλŠ” Xκ°€ λ‹΄κ³  μžˆλŠ” Y에 λŒ€ν•œ systematic information이닀. eλŠ” fκ°€ ν‘œν˜„ν•˜μ§€ λͺ»ν•˜λŠ” random error term이닀. eλŠ” X에 λ