1
2
3
4
5
#

Name:

Description:

Machine Learning with R by Brett Lantz is a book that provides an introduction to machine learning using R. As far as I can tell, Packt Publishing does not make its datasets available online unless you buy the book and create a user account which can be a problem if you are checking the book out from the library or borrowing the book from a friend. All of these datasets are in the public domain but simply needed some cleaning up and recoding to match the format in the book.

Variables:

Columns

  • age: age of primary beneficiary

  • sex: insurance contractor gender, female, male

  • bmi: Body mass index, providing an understanding of body, weights that are relatively high or low relative to height,
    objective index of body weight (kg / m ^ 2) using the ratio of height to weight, ideally 18.5 to 24.9

  • children: Number of children covered by health insurance / Number of dependents

  • smoker: Smoking

  • region: the beneficiary's residential area in the US, northeast, southeast, southwest, northwest.

  • charges: Individual medical costs billed by health insurance

Link To Google Sheets:

Rows:

Columns:

License Type:

References/Notes/Attributions:

Acknowledgments

The dataset is available on GitHub here.

R Dataset Upload:

Use the following R code to directly access this dataset in R.

d <- read.csv("https://www.key2stats.com/Medical_Cost_Personal_Datasets_1552_91.csv")

R Coding Interface:


Datasets Tag Questions & Instructional Blocks

NumberContentType
#PROBLEM-49325

For the following questions, we'll analyze the "Medical Cost Personal Datasets". ...

Question
#PROBLEM-49329

For this exercise, we will analyze some data from the  "Medical Cost Personal Datasets&quo...

Question
Showing 1-2 of 2 items.