Free Essay

Intro to Data Mining

In: Computers and Technology

Submitted By mac2jen
Words 481
Pages 2
Assignment 1: Introduction
Miles A. Cabanos
Data Mining - 10303 CAP4770

Q1: Present an example where data mining is crucial to the success of a business. What data mining function does this business need? Can they be performed alternatively by data query processing or simple statistical analysis?
Best example that I could think of would be Amazon.com. I think one of the functions the business uses is the characterization. This way it can keep track of what type of products customers buy, and then the pop-up windows to suggest similar items. No, I do not think they could be performed by data query processing or simple statistical analysis.
Q2: Define each of the following data mining functionalities: characterization, discrimination, association and correlation analysis, classification, prediction and clustering. Give examples of each data mining functionality, using a real-life database with which you are familiar.
 Characterization: Given data that shares similarities of characteristics and/or other requested specific information. (Example: Data from Amazon.com customers buying superhero books, can then be used to determine what age group buys superhero books. Then can suggest other types of similar books)
 Discrimination: Compares or contrasts data information. (Example: Amazon could find out what customers types buy more superhero book compared to biographies)
 Association and correlation analysis: How certain types of data can be associated with each other. Patterns, relationships, or correlations. (Example: Amazon could also find out what types of items sell together like people who buy cooking with the George Forman grill also buy a George Foreman grill.
 Classification: Data that is can be used to create certain labels for certain classes with certain distinctions or that maybe unknown. (Example: Amazon could create classes of customer of age groups, sex, etc. that buy action or science fiction books)
 Prediction: Data gathered to guess the future of certain outcomes. (Example: Amazon collects data from the past two years of Christmas sales of certain item. They can try to predict sales, amount of items to have on hand in stock.)
 Clustering: Data that may not yet have classifications or labels, raw data gathered together that may share similarities or patterns. (Example: Amazons new data that may not yet be classed, but can be found through clusters of similar attributes.)
Q3: What are the major challenges of mining a huge amount of data (such as billions of tuples) in comparison with mining a small amount of data (such as a few hundred tuple data set)?
It seems that if I understand correctly it would be efficiency and scalability algorithms would have to be efficient, while at the same time be quick enough to be usable. It seems like that is one of the issues of huge amounts of data. Sometimes would have to be mined in partial pieces. Dealing with smaller amounts of data, algorithms are more easily configured to run efficiently for quicker performance.…...

Similar Documents

Premium Essay

Data Mining

...Data Mining Jenna Walker Dr. Emmanuel Nyeanchi Information Systems Decision Making May 30, 2012 Abstract Businesses are utilizing techniques such as data mining to create a competitive advantage customer loyalty. Data mining allows business to analyze customer information, such as demographics and purchase history for a better understanding of what the customers need and what they will respond to. Data mining currently takes place in several industries, and will only become even more widespread as the benefits are endless. The purpose of this paper is to gain research and examine data mining, its benefits to businesses, and issues or concerns it will need to overcome. Real world case studies of how data mining is used will also be presented for a deeper understanding. This study will show that despite its disadvantages, data mining is an important step for a business to better understand its customers, and is the future of business marking and operational planning. Tools and Benefits of data mining Before examining the benefits of data mining, it is important to understand what data mining is exactly. Data mining is defined as “a process that uses statistical, mathematical, artificial intelligence, and machine-learning techniques to extract and identify useful information and subsequent knowledge from large databases, including data warehouses” (Turban & Volonino, 2011). The information identified using data mining includes patterns indicating......

Words: 1900 - Pages: 8

Premium Essay

Data Mining

...Data Mining 0. Abstract With the development of different fields, artificial intelligence, machine learning, statistic, database, pattern recognition and neurocomputing they merge to a newly technology, the data mining. The ultimate goal of data mining is to obtain knowledge from the large database. It helps to discover previously unknown patterns, most of the time it is followed by deeper manual evaluation to explain and correlate the results to establish a new knowledge. It is often practically used by government, bank, insurance company and medical researcher. A general basic idea of data mining would be introduced. In this article, they are divided into four types, predictive modeling, database segmentation, link analysis and deviation detection. A brief introduction will explain the variation among them. For the next part, current privacy, ethical as well as technical issue regarding data mining will be discussed. Besides, the future development trends, especially concept of the developing sport data mining is written. Last but not the least different views on data mining including the good side, the drawback and our views are integrated into the paragraph. 1. Introduction This century, is the age of digital world. We are no longer able to live without the computing technology. Due to information explosion, we are having difficulty to obtain knowledge from large amount of unorganized data. One of the solutions, Knowledge Discovery in Database (KDD) is......

Words: 1700 - Pages: 7

Premium Essay

Data Mining

...Data mining is an iterative process of selecting, exploring and modeling large amounts of data to identify meaningful, logical patterns and relationships among key variables.  Data mining is used to uncover trends, predict future events and assess the merits of various courses of action.             When employing, predictive analytics and data mining can make marketing more efficient. There are many techniques and methods, including business intelligence data collection. Predictive analytics is using business intelligence data for forecasting and modeling. It is a way to use predictive analysis data to predict future patterns. It is used widely in the insurance, medical and credit industries. Assessment of credit, and assignment of a credit score is probably the most widely known use of predictive analytics. Using events of the past, managers are able to estimate the likelihood of future events. Data mining aids predictive analysis by providing a record of the past that can be analyzed and used to predict which customers are most likely to renew, purchase, or purchase related products and services. Business intelligence data mining is important to your marketing campaigns. Proper data mining algorithms and predictive modeling can narrow your target audience and allow you to tailor your ads to each online customer as he or she navigates your site. Your marketing team will have the opportunity to develop multiple advertisements based on the past clicks of your visitors.......

Words: 1136 - Pages: 5

Premium Essay

Data Mining

...Data Mining Objectives: Highlight the characteristics of Data mining Operations, Techniques and Tools. A Brief Overview Online Analytical Processing (OLAP): OLAP is the dynamic synthesis, analysis, and consolidation of large volumns of multi-dimensional data. Multi-dimensional OLAP support common analyst operations, such as: ▪ Considation – aggregate of data, e.g. roll-ups from branches to regions. ▪ Drill-down – showing details, just the reverse of considation. ▪ Slicing and dicing – pivoting. Looking at the data from different viewpoints. E.g. X, Y, Z axis as salesman, Nth quarter and products, or region, Nth quarter and products. A Brief Overview Data Mining: Construct an advanced architecture for storing information in a multi-dimension data warehouse is just the first step to evolve from traditional DBMS. To realize the value of a data warehouse, it is necessary to extract the knowledge hidden within the warehouse. Unlike OLAP, which reveal patterns that are known in advance, Data Mining uses the machine learning techniques to find hidden relationships within data. So Data Mining is to ▪ Analyse data, ▪ Use software techniques ▪ Finding hidden and unexpected patterns and relationships in sets of data. Examples of Data Mining Applications: ▪ Identifying potential credit card customer groups ▪ Identifying buying patterns of customers. ▪ Predicting trends of......

Words: 1258 - Pages: 6

Premium Essay

Data Mining

...Data Mining/Data Warehousing Matthew P Bartman Strayer University Ibrahim Elhag CIS 111– Intro to Relational Database Management June 9, 2013 Data Mining/Data Warehousing When it comes to technology especially in terms of storing data there are two ways that it can be done and that is through data mining and data warehousing. With each type of storage there are trends and benefits. In terms of data warehousing there are 5 key benefits one of them being that it enhance business intelligence. What this means is that business processes can be applied directly instead of things having to be done with limited information or on gut instinct. Another benefit of data warehousing is that it can also save time meaning that if a decision has to be made the data can be retrieved quickly instead of having to find data from multiple sources. Not only does data warehousing enhance business intelligence and save time but it can also enchance data quality and consistency.This is accomplished by converting all data into one common format and will make it consistent with all departments which ensures accuracy with the data as well. While these key benefits another one is that it can provide historical intelligence which means that analayze different time periods and trends to make future predictions. One other key benefit is that it provides a great return on investment. The reason being that a data warehouse generates more......

Words: 2018 - Pages: 9

Premium Essay

Data Mining

...Data Mining I found the topic of data mining very interesting in that it uncovers coveted information needed for improving and refining our daily lives. Information regarding traffic patterns, flight arrivals, consumer purchases, education, is collected and analyzed to improve a particular model. The data mining process is designed to gather information from a targeted sample which will enable companies to refine their business model in order to become more profitable. This process is not engineered to accumulate more information for an organization but to extract more meaningful information and correlate patterns of information that already exists in their data base. The importance of this information will allow companies to better analyze information to make quick effective decisions which will spur productivity. Data mining in turn can monitor and analyze these results to effectively manage assets. Organizations will be able to better predict the results of their decision making. How Data Mining Works A sample size is created by targeting large amounts of relevant information that is small enough to process. The information is then studied to find relationships which were anticipated , analyze trends, and recognize irregularities to gain knowledge for a design. “The data is then modified to transform the variables to focus the model selection process. A model is then selected by using analytical tools to search for a combination of data that......

Words: 888 - Pages: 4

Premium Essay

Data Mining

...Data Mining Information Systems for Decision Making 10 December 2013 Abstract Data mining the next big thing in technology, if used properly it can give businesses the advance knowledge of when they are going to lose customers or make them happy. There are many benefits of data mining and it can be accomplished in different ways. The problem with data mining is that it is only as reliable as the data going in and the way it is handled. There are also privacy concerns with data mining. Keywords: data mining, benefits, privacy concerns Data Mining Benefits of Data Mining for a Business Data mining can be explained as the process of a business collecting data on their customers or potential customers to increase customer business. A business will collect data on their customers or potential customers and use that data to give them coupons, promote sells, and analyze buying and selling trends. Data mining can benefit the customer as well as the business. Data mining can be used in the retail industry, the finance industry, and the healthcare industry. Any industry can benefit from data mining but those are the top three (Turban & Volonino, 2011). Data mining is a way for large businesses to get to know their customers. The information gathered from data mining can let a large company learn what their customers want and how they want it. It can also benefit large companies get to know their employees, the company can learn how to satisfy......

Words: 1953 - Pages: 8

Premium Essay

Data Mining

...Data Mining Professor Clifton Howell CIS500-Information Systems Decision Making March 7, 2014 Benefits of data mining to the businesses One of the benefits to data mining is the ability to utilize information that you have stored to predict the possibilities of consumer’s actions and needs to make better business decisions. We implement a business intelligence that will produce a predictive score for those consumers to determine these possibilities. Predictive analytics is the business intelligence technology that produces a predictive score for each customer or other organizational element. Assigning these predictive scores is the job of a predictive model which has, in turn, been trained over your data, learning from the experience of your organization. (Impact, 2014) The usefulness of predictive scoring is obvious. However, with no predictive model and no means to score your consumer, the possibility of gaining a competitive edge and revenue is also predictable. To discover consumer buying patterns from a transaction database, mining association rules are used to make better business decisions. However because users may only be interested in certain information from this database and do not want to invest a lot of time in searching for what they need, association discovery will assist in limiting the data to which only the end user needs. Association discovery will utilize algorithms to lessen the quantity of groupings of item sets or sequences in each......

Words: 1318 - Pages: 6

Premium Essay

Data Mining

...Data Mining By Jamia Yant June 1st, 2012 Predictive Analytics and Customer Behavior “Predictive analysis is the decision science that removes guesswork out of the decision-making process and applies proven scientific guidelines to find right solution in the shortest time possible.” (Kaith, 2011) There are seven steps to Predictive Analytics: spot the business problem, explore various data sources, extract patterns from data, build a sample model using data and problem, Clarify data – find valuable factors – generate new variables, construct a predictive model using sampling and validate and deploy the model. By using this method, businesses can make fast decisions using vast amounts of data. There are three main benefits of predictive analytics: minimizing risk, indentifying fraud, and pursuing new sources of revenue. Being able to predict the risks involved with loan and credit origination, fraudulent insurance claims, and making predictions with regard to promotional offers and coupons are all examples of these benefits. It basically reduces the cost of making mistakes. This type of algorithm allows businesses to test all sorts of situations and scenarios it could take years to test in the real world. Studying customer behavior gives businesses a competitive advantage and allows them to stay ahead of the competition in their market place. Associations Discovery and Customer Purchases Association analysis is useful for discovering interesting......

Words: 1650 - Pages: 7

Free Essay

Data Mining

...DATA MINING Generally, data mining is the process of analyzing data from different perspectives and summarizing it into useful information. Data mining software is one of a number of tools for analyzing data. It allows users to analyze data from many different dimensions or angels, categorize it, and summarize the relationships identified. Technically, data mining is the process of finding patterns among dozens of fields in large databases that are similar to one another. Data is any facts, numbers, or text that can be processed by a computer so in general it makes it easier for a company or business to see what the majority of customers want at a time. It’s almost like a survey that we don’t realize we are taking. I think it really can benefit consumers because we can walk into a place of business and see what we want on the shelves because it is in demand. Even better, the things we don’t want to purchase are not there because there is no demand for it. It gives us the choice to be heard and have a say in making decisions on things that impact us most. Information can be converted into knowledge about historical patterns and future trends. For example, summary information on retail supermarket sales can be analyzed in light of promotional efforts to provide knowledge of consumer buying behavior. Thus, a manufacturer or retailer could determine which items are most susceptible to promotional efforts. I don’t think......

Words: 1315 - Pages: 6

Premium Essay

Data Mining

...ITT TECHNICAL INSTITUTE DATA MINING REPORT ONE AND TWO Shirly Mu, Todd Hughson, James Wall PROBLEM SOLVING THEORY ONSITE COURSE GS1140 INSTUCTOR LJILJANA MORRIS Today we are doing a project report on Costco. For Sol and Robert Price in 1976 they asked friends and family to help out with an opening price of two point five millon to open Price Club on July twelfth, they open their shop in an air hanger on Boulevard in San Diego, California. They were originally going to serve only small business. Mr. Price found out that it will be more beneficial to serve select customers. Costco was founded by James Sinegal and Jeffery H. Brotman. Costco opened its doors in 1983 in Seattle, Washington. Price Club and Costco later merged and renamed the business PriceCostco. And in 1997 due to its success the name was changed again to Costco Wholesale. (About History, 2014) Costco Wholesale stores headquarters are located in Issaquah, WA. The mission statement for Costco is to continually provide our members with quality goods and services at the lowest possible prices. (Farfan, 2014) There are three types of membership cards at Costco; Executive membership, Business membership and Gold membership. The one I have is executive membership this cost about one hundred and twenty dollars sounds like a lot but I’ll be able to get two percent back in my shopping. If I do not get back more than fifty five dollars back for the entire year they will give me that amount or the two percent......

Words: 1306 - Pages: 6

Premium Essay

Intro to Data Mining

...Data Mining: Concepts and Techniques (3rd ed.) Jiawei Han, Micheline Kamber, and Jian Pei University of Illinois at Urbana-Champaign & Simon Fraser University ©2011 Han, Kamber & Pei. All rights reserved. Adapted for CSE 347-447, Lecture 1b, Spring 2015 1 1 Introduction n  n  n  n  n  n  n  n  n  n  Why Data Mining? What Is Data Mining? A Multi-Dimensional View of Data Mining What Kind of Data Can Be Mined? What Kinds of Patterns Can Be Mined? What Technologies Are Used? What Kind of Applications Are Targeted? Major Issues in Data Mining A Brief History of Data Mining and Data Mining Society Summary 2 Why Data Mining? n  The Explosive Growth of Data: from terabytes to petabytes n  Data collection and data availability n  Automated data collection tools, database systems, Web, computerized society n  Major sources of abundant data n  n  n  Business: Web, e-commerce, transactions, stocks, … Science: Remote sensing, bioinformatics, scientific simulation, … Society and everyone: news, digital cameras, YouTube n  n  We are drowning in data, but starving for knowledge! “Necessity is the mother of invention”—Data mining—Automated analysis of massive data sets 3 Evolution of Sciences: New Data Science Era n  n  Before 1600: Empirical science 1600-1950s: Theoretical science n  Each discipline has grown a theoretical component. Theoretical models often motivate experiments and generalize our......

Words: 3169 - Pages: 13

Free Essay

Data Mining

...Assignment-4 1) Determine the benefits of data mining to the businesses when employing: a. Predictive analytics to understand the behavior of customers b. Associations discovery in products sold to customers c. Web mining to discover business intelligence from Web customers d. Clustering to find related customer information Ans: a. Services and applications: Some examples regarding products and services made available from 3G wi-fi networks are usually CDMA2000, UMTS, and EDGE along with a list of others while 4G networks present Wimax2 and also LTE-Advance. The applications are where a great deal of end users find hooked on technology. 3G opened a whole new sphere regarding options together with purposes permitting end users to mode video clip and also sound, video clip getting in touch with, video clip conferencing plus a massive variety of media purposes in the cell phone natural environment. 4G purposes include things like video gaming products and services, amended cell phone net admittance, high definition cell phone television set, video clip conferencing, IP telephone systems and in some cases 3D images television set. b. Network Architecture: One of many variations involving both of these specifications regarding cpa networks is based on the particular network architecture; the particular 3G cell network is really a Large Spot Cell phone Dependent network that has a circuit-switched subsystem. It relies on substantial satellite connections......

Words: 2003 - Pages: 9

Premium Essay

Data Mining

...The increasing use of data mining by corporations and advertisers obviously creates apprehension from the perspective of the individual consumer due to privacy, security and the potential use of inaccurate information. The idea that there are data warehouses that contain customers’ personal information can be rather frightening. However, the use of data mining by these organizations can also lead to numerous benefits for consumers they otherwise would not have realized. Besides the obvious benefit of guiding consumers to products or services they’d be more interested in purchasing, the use of data mining by companies has also benefitted individuals’ health and financial safety. Not long after the use of data mining came into prominence the use of data mining consumer information vs. consumer privacy became a major issue in early 1998 after CVS and Giant entered into an agreement with Elensys, a Massachusetts direct marketing company, to send reminders to customers who had not renewed their prescriptions. However, neither CVS nor Giant explained how the program would work or received their customers' permission to transfer their prescription records to a third party (Pyatt Jr.). Giant and CVS’s rationale for entering into this agreement was “to develop a positive program to benefit consumers, many of whom don't take their medication properly,” (Pyatt Jr.). Even though their primary intention was good, Giant and CVS were not transparent about their agreement with......

Words: 949 - Pages: 4

Premium Essay

Data Mining

...Data Mining Teresa M. Tidwell Dr. Sergey Samoilenko Information Systems for Decision Making September 2, 2012 Data Mining The use of data mining by companies assists them with identifying information and knowledge from databases and data warehouses that would be beneficial for the company. The information is often buried in databases, records, and files. With the use of tools such as queries and algorithms, companies can access data, analyze it, and use it to increase their profit. The benefits of using data mining, its reliability, and privacy concerns will be discussed. Benefits of Data Mining 1. Predictive Analytics: This type of analysis uses the customer’s data to make a specific model for the business. Existing information is used such as a customer’s recent purchases and their income, to create a prediction of future purchases and how much or what type of item would be purchased. The more variables used the more likely that the prediction will be correct. Such variables include the customer ranking, based on the number of and most recent purchases and the average profit made per customer purchase. Without data made available through web access and purchases by the customer, predictive analysis would be difficult to perform. The company, therefore, would not be able to plan nor predict how well they are performing. 2. Associations Discovery: This part of data mining helps the company to discover the “relationships hidden in larger data sets”......

Words: 1443 - Pages: 6

Şevkat Yerimdar izle | Collana Donna BREIL ROLLING DIAMOND Acciaio Swarovski Viola Turchese | Full List