Assignment Instructions

Assignment overview: In recent years, advances in machine learning are opening the door for intelligent health care data prediction and decision-making..

Assignment: Data Science Project
Assignment overview:
In recent years, advances in machine learning are opening the door for intelligent health care data prediction
and decision-making. A variety of machine learning algorithms can be used to learn from complex historic
data to predict future events. Successful applications such as individualized diagnosis and prognosis, hospital
readmission prediction, and personalized medicine can lead to improvements in medical practices and health
care experiences.
Your final assignment will work on two health care datasets, one is the mammographic masses dataset, the
other one is the GBD dataset. The goal of this project is to follow the data science analysis pipeline to answer
interesting questions of your own choosing, acquire the data, perform data manipulations, design your
visualizations, build your predictive modelling and present the results in a report format.
Classification — HCV dataset
Step 1: Get your dataset: You will use one health care dataset called HCV dataset (retrieve it from
https://archive.ics.uci.edu/ml/datasets/HCV+data). Here, your goal is to classifiy the category (diagnosis) of
Blood Donor (blood donor + suspect blood donor) vs. Non-blood donor (Hepatitis + Fibrosis +Cirrhosis)
using laboratory values and demographic values given in the dataset.
Step 2: You will raise two interesting questions on the dataset and prepare to answer them in your following
analysis via data manipulation, visualization or predictive modeling, etc.
4/30/2021 64717 – Assignment: Data Science ProjectAssignment overview:In
https://www.bestchoicewriters.com/Recent_Question/64717/Assignment-Data-Science-ProjectAssignment-overview 3/5
Step 3: Data manipulation and cleaning: Observe your dataset and pre-process the data if necessary and
justify.
Step 4: Exploratory data analysis: perform initial investigations on data using summary statistic and
visualizations.
Step 5: You will select two classification methods and apply them to the dataset for predictive modeling. The
performances of different models should be evaluated.
Step 6: Analyze the results
Step 7: Document all your findings
Clustering — GBD dataset
Step 1.Get your dataset: You will use one health care dataset about Global Burden of Disease Study (GBD)
Data Set from LMS.
NOTE: IHME GBD data 2017_F_csv is the GDB data of females in 2017; IHME GBD data 2017_M_csv is
the GDB data of males in 2017. YOU ONLY NEED TO SELECT ANY ONE OF THEM FOR THE
FOLLOWING ANALYSIS.
Background of GBD: http://www.healthdata.org/gbd/about
Data retrieved from:
?http://ghdx.healthdata.org/gbd-results-tool
?http://ghdx.healthdata.org/record/ihme-data/global-health-spending-1995-2017
http://ghdx.healthdata.org/record/ihme-data/gbd-2017-socio-demographic-index-sdi-1950%E2%80%932017
Step 2: You will raise two interesting questions on the dataset and prepare to answer them in your following
analysis via data manipulation, visualization or clustering modeling, etc.
Step 2. Data manipulation and cleaning: Observe your dataset and pre-process the data if necessary and
justify.
Step 3. Exploratory data analysis: perform initial investigations on data using summary statistic and
visualizations.
Step 4. You will select two clustering methods to identify the groups of countries from the dataset. The
performances of different models should be evaluated.
Step 5. Analyze the results
Step 6. Document all your findings
What you need to submit:
R file
An essential part of your project is your R coding. Your R file should record the steps in developing your
solutions and obtaining the final data analysis results. Make sure your code matches the findings you put in
the report. For example, if there are three separate plots in the report, your code should produce exactly the
same three separate plots.
Report
You also need to submit an in-depth report including two parts – classification and clustering. The following
components and discussions might be considered in each part:
Overview of the project: Provide an overview of the project, the goals, and the motivation for it. Consider
that this will be read by people who first see your project.
Dataset: Describe the background of the dataset and provide the summary statistic. Interesting questions:
What questions are you trying to answer? Do any questions evolve throughout the project? Are there any new
questions you consider in the course of your analysis? …
Data manipulation and cleaning: Are there any data pre-processing steps performed, and why? Are there any
questions that can be answered via data manipulation? …
Exploratory data analysis: What visualizations did you use to look at your data in different ways? Are there
any detected outliers? …
Predictive modelling: What are the various machine learning methods you considered? Justify the decisions
you made. What are the main ideas of the selected methods? How do you build the models? Are there any
concerns when designing your model? …
Final analysis: What did you learn about the data? Which method statistically outperformed the rest? Have
you found the answers to the raised questions? How can you justify your answers? … Engagingly present
your results using text, visualizations.
Conclusion: Are there any limitations of your study? What is your future work?

Assignment overview: In recent years, advances in machine learning are opening the door for intelligent health care data prediction and decision-making.

Calculate Price


Price (USD)
$

Why Choose Us For Your Assignment?

Privacy

We value all our customers' privacy. For that reason, all information stays private and confidential and will never be shared with third parties.

Punctuality

With our service you will never miss a deadline. We use strict follow-ups with our writers to ensure that all papers are submitted on time.

Authenticity

We have no tolerance for plagiarism. All papers go through thorough checking to ensure that no assignments contain plagiarism.

Money Back

You feel unsatisfied with your results? No worries. We offer refunds to our customers if any paper is not written according to the instructions.

Clients Love Us

Client #121678
Client #121678
Read More
This is by far the best I have ever scored in a custom essay. I am surprised the writer handled this assignment so well despite the short notice. I will definitely use your service next time.
Client #21702
Client #21702
Read More
When I was recommended to you by my friends, I wasn't sure you could deliver excellent results for Masters research papers until I submitted my first order. I am all yours now.
Client #20730
Client #20730
Read More
Excellent Services! You are the only assignment helper I can rely on. I have worked with many before and your services are exceptional. I have recommended you to my friends and the results are similar.
Client #20387
Client #20387
Read More
I rarely write reviews online but your services are worth promoting. My paper was so urgent I was sure I was gonna miss the deadline but you turned things around. You are awesome!
Client #20189
Client #20189
Read More
I am a satisfied customer. I know I should have given a 5 star because you deserve it but I will give 4.6 because I almost missed a deadline because of a revision. Luckily it was minor and the writer acted promptly.
Client #20187
Client #20187
Read More
Great paper but there is still some room for improvement. I am impressed by your fast responses and how you tacked my concerns professionally. Thank you for being among the few genuine essay writing service providers.
Client #19783
Client #19783
Read More
I can't thank you enough for being a great part of my college life. I recommended you to two more of my friends. I am sure they will be making their orders soon. I love the fact that you offer free pages for referrals. I will be referring a few more and maybe I won't have to pay for my next two paper, LOL.
Previous
Next