module 4 data mining 1

Attached Files:

Data MIning Assignment_with rubric (181.304 KB)
usoccupations.xlsx (621.785 KB)
uscars.xlsx (31.055 KB)
uspopulation.xlsx (142.184 KB)

This assignment provides you with practice using R for data mining techniques. You will use R to classify and cluster dataset to show how data mining methods can be used to classify and cluster data.
Before beginning this assignment, review the learning resources for this module, especially Introduction to Data Mining with R from R DataMining.com, reviewing the steps taken to classify and cluster the iris data set in R.
The purpose of clustering is to form new classification from numerical variables. Therefore, it is important that you remove original classification from the data set prior to conducting clustering. For example, the species variable needs to be removed from the Iris data set because species is a classification. You may then merge back the original specifies variable and compare the newly formed clusters against the original classification to see how they differ.
Complete the following steps and write a report to record your work, results and analysis.

Install and load the *factoextra and **NbClust packages.
Select an appropriate data set in R or the MASS library and use the sample(), ctree() and predict() functions to build a decision tree and plot it. You may also use one of the data sets (usoccupations, uscars, uspopulation) attached to this module for this assignment (you may import directly or convert to CSV first).
Determine the appropriate number of clusters and produce a k-means cluster. Explain your findings.
Produce a density-based cluster with DBSCAN or use logistic regression to construct a binary classification and explain your findings.

*The factoextra package is used to determine the optimal number clusters for a given clustering methods and for data visualization
**The NcClust package provides 30 indices for determining the relevant number of clusters and the best clustering scheme from the different results obtained by varying all combinations of number of clusters, distance measures, and clustering methods. It can simultaneously compute all the indices and determine the number of clusters in a single function call.
Report
Your assignment/project should have a good cover/title page, introduction of what the goals of the project and the methods you use. It also should follow APA format with at least 1000 words (excluding title page and references page) and references page. In the body of your project you should incorporate the R codes and R outputs with interpretation of your results. Be sure to show all the elements in the official hypothesis, including the null and alternative hypothesis, the critical values, calculation of the test statistics and p-values. Finally, you need to make sense of your results to make good points with proper conclusions, to show your understanding of the course material and its application to the dataset.
Graphs, figures, charts, tables are very useful to increase visual effects to impress your readers. You also should do your best to give insight and understanding to the project with a good conclusion. Please use subtitles to make your assignment more reader friendly as well.
 
Do you need a similar assignment done for you from scratch? We have qualified writers to help you. We assure you an A+ quality paper that is free from plagiarism. Order now for an Amazing Discount! Use Discount Code “Newclient” for a 15% Discount!NB: We do not resell papers. Upon ordering, we do an original paper exclusively for you.

Order a similar paper and get 15% discount on your first order with us

Dr. Padma Myers
Dr. Padma Myers
98% Success Rate
Read More
“Hello, I deliver nursing papers on time following instructions from the client. My primary goal is customer satisfaction. Welcome for plagiarism free papers”
Stern Frea
Stern Frea
98% Success Rate
Read More
Hi! I am an English Language and Literature graduate; I have written many academic essays, including argumentative essays, research papers, and literary analysis.
Dr. Ishid Elsa
Dr. Ishid Elsa
98% Success Rate
Read More
"Hi, count on me to deliver quality papers that meet your expectations. I write well researched papers in the fields of nursing and medicine".
Dr. Paul P. Klug
Dr. Paul P. Klug
99% Success Rate
Read More
"A top writer with proven reliability and experience. I have a 99% success rate, overall rating of 10. Hire me for quality custom written nursing papers. Thank you"

How Our Essay Writing Service Works

Tell Us Your Requirements

Fill out order details and instructions, then upload any files or additional materials if needed. Then, confirm your order by clicking “Place an Order.”

Make your payment

Your payment is processed by a secure system. We accept Mastercard, Visa, Amex, and Discover. We don’t share any informati.on with third parties

The Writing Process

You can communicate with your writer. Clarify or track order with our customer support team. Upload all the necessary files for the writer to use.

Download your paper

Check your paper on your client profile. If it meets your requirements, approve and download. If any changes are needed, request a revision to be done.

Recent Questions

Help please!

Week 10 Discussion       Do you need a similar assignment done for you from scratch? We have qualified writers to help you. We

Read More »

help plz.

To support your work make sure to utilize your course and text readings. When asked also utilize outside sources as well. As in all assignments

Read More »

help to do this

Don’t overthink this. Use the fillable form 201 to document your IAP after you have considered what incident priorities you need to include. Look at

Read More »

Stay In Touch!

Leave your email and get discount promo codes and the best essay samples from our writers!