Gallery

Gallery

DMuses

Data Mining Uses

Uses of Data Mining

  • Data Base Marketing and Targeting
  • Credit Risk Management and Credit Scoring
  • Fraud Detection and Prevention
  • Healthcare Bioinflrmatics
  • Spam Filtering
  • Recommendation systems
  • Sentiment Analysis
  • Qualitative Data Mining (QDM)

DMhowto

Data Mining: Steps

  1. Business Understanding
  2. Establish the goals you wish to reach by using data mining. What is your plan? arrange a plan with a timeline, actions, and role assignments
  3. Data Understanding
  4. After your data is collected use data visalization tools such as spreadsheets to explore the properties of the data. Ensure that your findings will help you achieve your goal.
  5. Data Preparation
  6. The bulk of the work is concentrated here. Data is cleansed and ready to be 'mined'
  7. Data Modeling
  8. Mathematical models are then used to find patterns in the data using sophisticated data tools.
  9. Evaluation
  10. The findings are evaluated and compared to business objectives to determine if they should be deployed across the organization.
  11. Deployment
  12. In the final stage, the data mining findings are shared across everyday business operations. An enterprise business intelligence platform can be used to provide a single source of the truth for self-service data discovery.

workerschina

Challenges with Data Mining

The challenges of big data are prolific and penetrate every field that collects, stores, and analyzes data


  • Big Data:
  • Big data is characterized by four major challenges: volume, variety, veracity, and velocity
    1. Volume
    2. describes the challenge of storing and processing the enormous quantity of data collected by organizations. This enormous amount of data presents two major challenges: first, it is more difficult to find the correct data, and second, it slows down the processing speed of data mining tools.
    3. Vareity
    4. encompasses the many different types of data collected and stored. Data mining tools must be equipped to simultaneously process a wide array of data formats. Failing to focus an analysis on both structured and unstructured data inhibits the value added by data mining.
    5. Veracity
    6. details the increasing speed at which new data is created, collected, and stored. While volume refers to increasing storage requirement and variety refers to the increasing types of data, velocity is the challenge associated with the rapidly increasing rate of data generation.
    7. Velocity
    8. acknowledges that not all data is equally accurate. Data can be messy, incomplete, improperly collected, and even biased. With anything, the quicker data is collected, the more errors will manifest within the data. The challenge of veracity is to balance the quantity of data with its quality.