CSV Data Setup Guide

How to prepare your data for DMAIC analysis

CSV Format Requirements
  • File must be in CSV format (.csv)
  • First row should contain column headers
  • Data should be clean (no empty rows in middle)
  • Maximum file size: 16MB
  • Encoding: UTF-8 or Latin-1
Numeric Columns

For measurements, defects, times, costs

Example: 14.5, 23, 8.9
Category Columns

For grouping, defect types, departments

Example: "Defect A", "Team 1"

Common Data Scenarios

Required columns:

  • Defect_Type (Category): "Scratch", "Dent", "Paint", etc.
  • Count (Numeric): Number of defects
  • Department (Category): "Assembly", "Painting", etc.
  • Date (Optional): For time series analysis
Defect_Type Count Department Cost
Scratch 15 Assembly 450
Paint 8 Painting 320

Required columns:

  • Process_Time (Numeric): Time in minutes/hours
  • Process_Type (Category): "Setup", "Run", "Cleanup"
  • Operator (Category): "A", "B", "C" for comparison
  • Shift (Category): "Day", "Night" for analysis
Process_Time Process_Type Operator Shift
24.5 Setup A Day
18.2 Run B Night

Required columns:

  • Satisfaction_Score (Numeric): 1-10 rating
  • Issue_Category (Category): "Delivery", "Quality", "Service"
  • Region (Category): Geographic grouping
  • Response_Time (Numeric): Days to resolution
Satisfaction_Score Issue_Category Region Response_Time
7 Delivery North 3
9 Quality South 1
Data Preparation Best Practices
  • Clean your data first: Remove duplicates, fix spelling errors
  • Use clear column names: "Defect_Count" not "DC" or "count1"
  • Consistent categories: "Team A" not sometimes "TeamA" or "team a"
  • Include context: Add columns for time, location, operator when relevant
  • Target values: Include your target/goal values for analysis
  • Sample size: Minimum 30 data points for statistical significance