Data mining for business intelligence: concepts, techniques, and applications in Microsoft Office Excel with XLMiner / Galit Shmueli, Nitin R. The right of Galit Shmueli, Peter C. Bruce, Inbal Yahav, Nitin R. Patel, and Kenneth C. . Data Mining Software: The State of the Market (by Herb Edelstein) . The idea that we can use sensors to connect physical objects such as homes. Shmueli [PDF] [EPUB] Data Mining for Business Intelligence: Concepts, Techniques, and. Applications in Microsoft Office Excel with XLMiner.
|Language:||English, Spanish, German|
|Distribution:||Free* [*Registration Required]|
Data Mining for Business Analytics: Concepts, Techniques, and Applications in XLMiner®, Third Edition presents an applied approach to data mining and. the details of analytical methods and data mining and visualization tools such as XLMiner and Node XL. association. 5. Real world data analytic and business intelligence applications by Galit Shmueli, Nitin R. Patel, Peter C. Bruce. Publisher: Mohsin, Shi Guangyu, Guangyu Shi, HBS GSPDF-ENG. 3 . “Harrah's. Data Mining for Business Intelligence: Concepts, Techniques, and Applications in Microsoft Office Excel with XLMiner by Galit Shmueli, Nitin R.
Prediction and Classification Methods 6. Multiple Linear Regression 6. Introduction 6. Explanatory versus Predictive Modeling 6. Estimating the Regression Equation and Prediction 6. Variable Selection in Linear Regression 6. Reducing the Number of Predictors 6. How to Reduce the Number of Predictors 6.
Determining Neighbors 7. Classification Rule 7. Riding Mowers 7. Choosing k 7. Setting the Cutoff Value 7. Advantages and Shortcomings of k-NN Algorithms 7.
Naive Bayes 8. Introduction 8. Predicting Fraudulent Financial Reporting 8. Applying the Full Exact Bayesian Classifier 8. Example 3: Predicting Delayed Flights 8. Advantages and Shortcomings of the Naive Bayes Classifier 8. Classification and Regression Trees 9.
Introduction 9. Classification Trees 9. Recursive Partitioning 9. Riding Mowers 9. Measures of Impurity 9. Tree Structure 9. Classifying a New Observation 9. Evaluating the Performance of a Classification Tree 9. Acceptance of Personal Loan 9.
Avoiding Overfitting 9. Stopping Tree Growth: CHAID 9.
Pruning the Tree 9. Classification Rules from Trees 9. Classification Trees for More Than two Classes 9. Regression Trees 9. Prediction 9. Measuring Impurity 9. Evaluating Performance 9. Advantages, Weaknesses, and Extensions 9. Logistic Regression Introduction Logistic Regression Model Acceptance of Personal Loan Model with a Single Predictor Estimating the Logistic Model from Data: Computing Parameter Estimates Interpreting Results in Terms of Odds Evaluating Classification Performance Variable Selection Impact of Single Predictors Example of Complete Analysis: Predicting Delayed Flights Data Preprocessing Model Fitting and Estimation Model Interpretation Model Performance Logistic Regression for Profiling Appendix A: Appendix B: Evaluating Goodness of Fit Appendix C: Neural Nets Concept and Structure of a Neural Network Fitting a Network to Data Tiny Dataset Computing Output of Nodes Preprocessing the Data Training the Model Classifying Accident Severity Avoiding Overfitting Using the Output for Prediction and Classification Required User Input Exploring the Relationship Between Predictors and Response Advantages and Weaknesses of Neural Networks Discriminant Analysis Riding Mowers Personal Loan Acceptance Distance of an Observation from a Class Fisher's Linear Classification Functions Classification Performance of Discriminant Analysis Prior Probabilities Unequal Misclassification Costs Classifying More Than Two Classes Medical Dispatch to Accident Scenes Advantages and Weaknesses Mining Relationships Among Records Association Rules Discovering Association Rules in Transaction Databases Synthetic Data on downloads of Phone Faceplates Generating Candidate Rules The Apriori Algorithm Selecting Strong Rules Sign in.
Get my own profile Cited by View all All Since Citations h-index 37 26 iindex 81 Kimberly F. Ron S. Verified email at thomaslotze. Akhmed Umyarov University of Minnesota. Carlson School of Management.
Verified email at umn. Mayukh Dass J. Gerhard Tutz Verified email at stat. View all. Verified email at mx. Articles Cited by Co-authors. Title Cited by Year To explain or to predict? G Shmueli. Like most full time data miners, I would have difficulty living within the constraints of Excel.
XLMiner is a fine piece of software, but it lives inside Excel as an Excel add-on. The most famous limitation is having no more than 1,, rows of data, but that nature of that limitation applied to Data Mining is frequently misunderstood.
I am often on projects with "big data" clients where I only model , or fewer records. XLMiner allows you to read from a database larger than Excel can handle, and let's you write out to a database larger than Excel can handle.
I was surprised and impressed by this. In the end, though, it still isn't enough. I need to be able to merge and manipulate my large data files so that I can carefully select the smaller fraction that I am going to model. In short, I can't live without my more powerful tools. There is an essay offered as a sidebar in the book on the state of the Data Mining Software Tools market by Herb Edelstein which discusses exactly this fact.
XLMiner was originally developed as a piece of teaching software, and it excels at that. It doesn't intend to be a deployment tool for the whole business enterprise like some of the more powerful Data Mining suites. If you don't have access to such tools you might be pleasantly surprised what it can do since the other tools are many times more expensive.
Despite this limitation, this is a strong book. It would be great for a first course in Data Mining provided that it wasn't the first of many. If someone were about to embark on a Data Mining advanced degree, I don't think this book is the best route to go.
I also think it is an outstanding choice for a seminar leader that wants to offer demonstrations for the audience.
I would suggest providing the audience with copies or allowing them to get them. What a great way to learn the material - by doing. I debated using this book for exactly that purpose and ended up going with the Handbook of Statistical Analysis and Data Mining Applications only because I felt my audience, representing larger companies, would end up using one the Data Mining suites in the end, and I wanted them to see them.
I would also suggest this book for self study. It is as easy a read as this kind of material is going to get. Light reading? Not really. However, Data Mining algorithms never make for light reading. What you hope for is clarity, and the right amount of detail.
For the uninitiated, this is perfect. For Data Mining professionals, it would be just a very basic review. Some reviewers seems to have found it a tough slog. It is very much in the style of "here is the rough idea - try a case study".
If you've never studied statistics, there is no careful walk through of the formulas, but that is not the point of the book. Lots of other books do that. If you want to know how Data Mining works "under the hood" you won't really find that here either. For example, Regression is covered in about 15 pages.
Overall, I think it makes good choices in terms of detail. It covers all the material you need in an introduction. It offers a very brief initial chapter defining the subject.
It does a decent job at data visualization. It is a basic introduction the algorithms with supporting case studies.
The is almost no data preparation because XLMiner is not designed to do any heavy lifting here. It can do partitioning and explains why this is critical to data mining.
A surprising number of the famous techniques are here: The case studies are fairly basic, but well described. They are easy to download from the website. Again, perfect for a first course in Data Mining. Everything an instructor would need for a good solid introduction - exactly the audience the book was written for.
I was made to download this book as a part of one of my marketing classes in college. While the book explains the concepts decently well, the processes to answer the questions asked at the end of the chapters were often not explained.
The book works in conjunction with a software called DataMiner XL and the questions asked in the chapters relate to this software. The problem is that the book asks you to perform complex tasks, but never goes ahead to explain how to do them using the software. It is up to you to do external research in order to find out how to complete the tasks.
I believe that an instructional book such as this one should offer step by step guidance on the processes in which they expect you to perform. I would not recommend this book to anybody based on these reasons. The reason it got 2 stars is because the concepts were explained decently well. This book does a great job with overviews. By that, I mean it explains a general topic fairly well.