- •Contents
- •Data Mining Tutorials (Analysis Services)
- •Basic Data Mining Tutorial
- •Lesson 1: Preparing the Analysis Services Database (Basic Data Mining Tutorial)
- •Creating an Analysis Services Project (Basic Data Mining Tutorial)
- •Creating a Data Source (Basic Data Mining Tutorial)
- •Creating a Data Source View (Basic Data Mining Tutorial)
- •Lesson 2: Building a Targeted Mailing Structure (Basic Data Mining Tutorial)
- •Creating a Targeted Mailing Mining Model Structure (Basic Data Mining Tutorial)
- •Specifying the Data Type and Content Type (Basic Data Mining Tutorial)
- •Specifying a Testing Data Set for the Structure (Basic Data Mining Tutorial)
- •Lesson 3: Adding and Processing Models
- •Adding New Models to the Targeted Mailing Structure (Basic Data Mining Tutorial)
- •Processing Models in the Targeted Mailing Structure (Basic Data Mining Tutorial)
- •Lesson 4: Exploring the Targeted Mailing Models (Basic Data Mining Tutorial)
- •Exploring the Decision Tree Model (Basic Data Mining Tutorial)
- •Exploring the Clustering Model (Basic Data Mining Tutorial)
- •Exploring the Naive Bayes Model (Basic Data Mining Tutorial)
- •Lesson 5: Testing Models (Basic Data Mining Tutorial)
- •Testing Accuracy with Lift Charts (Basic Data Mining Tutorial)
- •Testing a Filtered Model (Basic Data Mining Tutorial)
- •Lesson 6: Creating and Working with Predictions (Basic Data Mining Tutorial)
- •Creating Predictions (Basic Data Mining Tutorial)
- •Using Drillthrough on Structure Data (Basic Data Mining Tutorial)
- •Lesson 1: Creating the Intermediate Data Mining Solution (Intermediate Data Mining Tutorial)
- •Creating a Solution and Data Source (Intermediate Data Mining Tutorial)
- •Lesson 2: Building a Forecasting Scenario (Intermediate Data Mining Tutorial)
- •Adding a Data Source View for Forecasting (Intermediate Data Mining Tutorial)
- •Creating a Forecasting Structure and Model (Intermediate Data Mining Tutorial)
- •Modifying the Forecasting Structure (Intermediate Data Mining Tutorial)
- •Customizing and Processing the Forecasting Model (Intermediate Data Mining Tutorial)
- •Exploring the Forecasting Model (Intermediate Data Mining Tutorial)
- •Creating Time Series Predictions (Intermediate Data Mining Tutorial)
- •Advanced Time Series Predictions (Intermediate Data Mining Tutorial)
- •Lesson 3: Building a Market Basket Scenario (Intermediate Data Mining Tutorial)
- •Adding a Data Source View with Nested Tables (Intermediate Data Mining Tutorial)
- •Creating a Market Basket Structure and Model (Intermediate Data Mining Tutorial)
- •Modifying and Processing the Market Basket Model (Intermediate Data Mining Tutorial)
- •Exploring the Market Basket Models (Intermediate Data Mining Tutorial)
- •Filtering a Nested Table in a Mining Model (Intermediate Data Mining Tutorial)
- •Predicting Associations (Intermediate Data Mining Tutorial)
- •Lesson 4: Building a Sequence Clustering Scenario (Intermediate Data Mining Tutorial)
- •Creating a Sequence Clustering Mining Model Structure (Intermediate Data Mining Tutorial)
- •Processing the Sequence Clustering Model
- •Exploring the Sequence Clustering Model (Intermediate Data Mining Tutorial)
- •Creating a Related Sequence Clustering Model (Intermediate Data Mining Tutorial)
- •Creating Predictions on a Sequence Clustering Model (Intermediate Data Mining Tutorial)
- •Lesson 5: Building Neural Network and Logistic Regression Models (Intermediate Data Mining Tutorial)
- •Adding a Data Source View for Call Center Data (Intermediate Data Mining Tutorial)
- •Creating a Neural Network Structure and Model (Intermediate Data Mining Tutorial)
- •Exploring the Call Center Model (Intermediate Data Mining Tutorial)
- •Adding a Logistic Regression Model to the Call Center Structure (Intermediate Data Mining Tutorial)
- •Creating Predictions for the Call Center Models (Intermediate Data Mining Tutorial)
- •Creating and Querying Data Mining Models with DMX: Tutorials (Analysis Services - Data Mining)
- •Bike Buyer DMX Tutorial
- •Lesson 1: Creating the Bike Buyer Mining Structure
- •Lesson 2: Adding Mining Models to the Bike Buyer Mining Structure
- •Lesson 3: Processing the Bike Buyer Mining Structure
- •Lesson 4: Browsing the Bike Buyer Mining Models
- •Lesson 5: Executing Prediction Queries
- •Market Basket DMX Tutorial
- •Lesson 1: Creating the Market Basket Mining Structure
- •Lesson 2: Adding Mining Models to the Market Basket Mining Structure
- •Lesson 3: Processing the Market Basket Mining Structure
- •Lesson 4: Executing Market Basket Predictions
- •Time Series Prediction DMX Tutorial
- •Lesson 1: Creating a Time Series Mining Model and Mining Structure
- •Lesson 2: Adding Mining Models to the Time Series Mining Structure
- •Lesson 3: Processing the Time Series Structure and Models
- •Lesson 4: Creating Time Series Predictions Using DMX
- •Lesson 5: Extending the Time Series Model
Microsoft Decision Trees Algorithm
Specifying the Data Type and Content Type (Basic Data Mining Tutorial)
Now that you have selected which columns to use for building your structure and training your models, make any necessary changes to the default data and content types that are set by the wizard.
Review and modify content type and data type for each column
1.On the Specify Columns' Content and Data Type page, click Detect to run an algorithm that determines the default data and content types for each column.
2.Review the entries in the Content Type and Data Type columns and change them if necessary, to make sure that the settings are the same as those listed in the following table.
Typically, the wizard will detect numbers and assign an appropriate numeric data type, but there are many scenarios where you might want to handle a number as text instead. For example, the GeographyKey should be handled as text, because it would be inappropriate to perform mathematical operations on this identifier.
Column |
Content Type |
Data Type |
|
|
|
Address Line1 |
Discrete |
Text |
|
|
|
Address Line2 |
Discrete |
Text |
|
|
|
Age |
Continuous |
Long |
|
|
|
Bike Buyer |
Discrete |
Long |
|
|
|
Commute Distance |
Discrete |
Text |
|
|
|
CustomerKey |
Key |
Long |
|
|
|
DateLastPurchase |
Continuous |
Date |
|
|
|
Email Address |
Discrete |
Text |
|
|
|
English Education |
Discrete |
Text |
|
|
|
English Occupation |
Discrete |
Text |
|
|
|
FirstName |
Discrete |
Text |
|
|
|
Gender |
Discrete |
Text |
|
|
|
Geography Key |
Discrete |
Text |
|
|
|
16
|
House Owner Flag |
Discrete |
Text |
|
|
|
|
|
Last Name |
Discrete |
Text |
|
|
|
|
|
Marital Status |
Discrete |
Text |
|
|
|
|
|
Number Cars Owned |
Discrete |
Long |
|
|
|
|
|
Number Children At Home |
Discrete |
Long |
|
|
|
|
|
Region |
Discrete |
Text |
|
|
|
|
|
Total Children |
Discrete |
Long |
|
|
|
|
|
Yearly Income |
Continuous |
Double |
|
|
|
|
3. Click Next. |
|
|
|
Next Task in Lesson |
|
|
Specifying a Testing Data Set for the Structure (Basic Data Mining Tutorial)
Previous Task in Lesson
Creating a Targeted Mailing Mining Model Structure (Basic Data Mining Tutorial)
See Also
Content Types (Data Mining) Data Types (Data Mining)
Specifying a Testing Data Set for the Structure (Basic Data Mining Tutorial)
In the final few screens of the Data Mining Wizard you will split your data into a testing set and a training set. You will then name your structure and enable drillthrough on the model.
Specifying a Testing Set
Separating data into training and testing sets when you create a mining structure makes it possible to easily assess the accuracy of the mining models that you create later. For more information on testing sets, see Partitioning Data into Training and Testing Sets (Analysis Services - Data Mining).
To specify the testing set
1.On the Create Testing Set page, for Percentage of data for testing, leave the default value of 30.
2.For Maximum number of cases in testing data set, type 1000.
3.Click Next.
17