AskTable
sidebar.freeTrial

Complete E-commerce Data Analysis Guide: From GMV to User Profiling, How to Drive Growth with AI

AskTable Team
AskTable Team 2026-03-02

E-commerce is the most typical data-driven industry. From GMV and conversion rates to user lifetime value (LTV), every decision requires data support. However, the complexity of e-commerce data also troubles many teams: data scattered across multiple systems, inconsistent indicator definitions, business personnel rely on technical teams for data queries...

This article systematically sorts out the complete system of e-commerce data analysis and explores how to improve analysis efficiency with AI-driven natural language query tools.

Four Dimensions of E-commerce Data Analysis

Dimension 1: Transaction Analysis

Core Objective: Monitor transaction health and discover growth opportunities

1. GMV (Gross Merchandise Volume)

Definition: Total value of merchandise ordered by users (not deducting refunds)

Calculation Formula:

GMV = SUM(Order Amount)
Condition: Order Status = 'Paid' or 'Completed'

Analysis Dimensions:

  • Time Dimension: Daily, weekly, monthly GMV, year-over-year/month-over-month growth
  • Category Dimension: GMV proportion and growth trends by category
  • Channel Dimension: GMV contribution by traffic source
  • Regional Dimension: GMV distribution by region

Key Insights:

  • GMV growth slowing → Check traffic, conversion rate, average order value
  • Abnormal GMV growth in a category → Might be a hit product, increase promotion
  • GMV decline in a channel → Investigate channel quality or promotion strategy

Natural Language Query Examples (using AskTable):

"This month's GMV compared to last month"
"GMV proportion by category"
"Which channel has the fastest GMV growth?"

2. Net Sales

Definition: Actual transaction amount after deducting refunds

Calculation Formula:

Net Sales = GMV - Refund Amount

Why It Matters:

  • GMV may be inflated (fake orders, malicious orders)
  • Net sales more truly reflects business health
  • Financial settlement is based on net sales

Analysis Scenarios:

  • Refund rate analysis: Refund rate = Refund amount / GMV
  • Refund reason analysis: Product quality, logistics issues, description mismatch
  • Abnormal order identification: Large refunds, frequent refund users

3. Order Volume

Definition: Total number of successfully paid orders

Key Metrics:

  • Daily average order volume: Reflects platform scale
  • Order volume growth rate: Reflects growth speed
  • Order distribution: Large order, small order proportion

Correlation Analysis:

GMV = Order Volume × Average Order Value

If GMV grows but order volume remains unchanged, it indicates average order value increased (possibly product upgrade or price increase)

Natural Language Query Examples:

"Order volume trend for the past 7 days"
"Compare this week's order volume with last week"
"Find the day when order volume suddenly dropped"

4. Average Order Value (AOV)

Definition: Average amount per order

Calculation Formula:

AOV = GMV / Order Volume

Strategies to Increase AOV: -满减 activities (spend 99 get 20 off)

  • Bundle recommendations (buy A get B)
  • Membership upgrades (member pricing)
  • High-AOV product promotion

Analysis Dimensions:

  • AOV for different user groups
  • AOV by category
  • AOV changes over time

Natural Language Query Examples:

"What is this month's AOV?"
"Which user group has the highest AOV?"
"Proportion of orders with AOV below 50 yuan"

Dimension 2: User Analysis

Core Objective: Understand user behavior and enhance user value

1. User Acquisition

Key Metrics:

  • New user count: Daily/weekly/monthly new registered users
  • Acquisition channels: Search engines, social media, advertising, organic traffic
  • CAC (Customer Acquisition Cost): Marketing spend / New users

Analysis Scenarios:

Scenario 1: Channel Effectiveness Analysis

Question: "Which channel has the lowest acquisition cost?"
Analysis: Compare CAC and user quality (retention rate, LTV) by channel
Decision: Increase investment in low-CAC and high-LTV channels

Scenario 2: Registration Conversion Funnel

Flow: Visit homepage → Registration page → Fill info → Verify phone → Registration success
Analysis: Find the step with the most serious attrition
Optimization: Simplify registration process, reduce attrition

Natural Language Query Examples:

"New users by channel this week"
"What is the registration conversion rate?"
"Which channel has the best user quality?" (need to define "quality")

2. User Activity

Key Metrics:

  • DAU (Daily Active Users)
  • MAU (Monthly Active Users)
  • Activity rate: Active users / Total users

Activity Definition: Different businesses define "active" differently:

  • Definition 1: Logging in counts as activity
  • Definition 2: Browsing products counts as activity
  • Definition 3: Making a purchase counts as activity

E-commerce Recommended Definition: At least one of the following:

  • Browsed product detail pages
  • Added to cart
  • Made a purchase
  • Participated in activities

Activity Tiering:

  • High activity: 5+ visits per week
  • Medium activity: 2-4 visits per week
  • Low activity: 1 visit per week
  • Dormant: No visit this month

Natural Language Query Examples:

"What is this month's MAU?"
"What is the activity rate trend?"
"How many users are in dormant state?"

3. User Retention

Definition: Proportion of new users who continue using the product after registration

Key Metrics:

  • Day 1 retention: Proportion of users still active on day 2 after registration
  • Day 7 retention: Proportion still active on day 7 after registration
  • Day 30 retention: Proportion still active on day 30 after registration

Retention Curve Analysis:

Excellent retention curve:
Day 1: 70%
Day 7: 50%
Day 30: 40%
Gradually flattens, indicating long-term value

Poor retention curve:
Day 1: 30%
Day 7: 10%
Day 30: 3%
Rapid decay, indicating insufficient product value

Natural Language Query Examples:

"Day 1 retention rate for users registered last week"
"Compare Day 7 retention by channel"
"Retention rate change trend over time"

4. User Conversion

Conversion Funnel:

Visit homepage (100%)
  ↓ 70%
Browse products (70%)
  ↓ 40%
Add to cart (28%)
  ↓ 50%
Submit order (14%)
  ↓ 80%
Payment success (11.2%)

Key Conversion Rates:

  • First purchase conversion rate: Proportion of new users completing first purchase
  • Repurchase conversion rate: Proportion of existing users making another purchase
  • Cart-to-pay conversion rate: Proportion completing payment after adding to cart

Strategies to Improve Conversion:

  • Reduce attrition: Simplify payment process, optimize page load speed
  • Increase trust: User reviews, return and exchange guarantees
  • Promotion incentives: New user discounts, limited-time discounts

Natural Language Query Examples:

"What is this month's first purchase conversion rate?"
"At which step is cart-to-pay conversion losing the most?"
"New user coupon usage rate"

5. User Value

Key Metrics:

ARPU (Average Revenue Per User):

ARPU = Total Revenue / Total Users

ARPPU (Average Revenue Per Paying User):

ARPPU = Total Revenue / Paying Users

LTV (Lifetime Value): Total value created by users throughout their lifecycle

Simplified formula: LTV = AOV × Purchase frequency × Average lifecycle (months)

Example:

  • AOV: 200 yuan
  • Purchase frequency: 2 times/month
  • Average lifecycle: 12 months
  • LTV = 200 × 2 × 12 = 4,800 yuan

LTV vs CAC Ratio:

  • LTV/CAC > 3: Healthy, can expand advertising
  • LTV/CAC = 1-3: Needs optimization
  • LTV/CAC < 1: Losing money, need to adjust strategy

Natural Language Query Examples:

"What is the ARPPU for paying users?"
"How many high-value users (LTV > 5000)?"
"Compare LTV by channel"

Dimension 3: Product Analysis

Core Objective: Optimize product structure, improve sales efficiency

1. Product Sales Performance

Key Metrics:

  • Top N by sales: Top N products by sales amount
  • Top N by volume: Top N products by sales volume
  • Sales velocity: Proportion of SKUs with sales / Total SKUs
  • Inventory turnover: Cost of goods sold / Average inventory

Analysis Scenarios:

Scenario 1: Hit Product Analysis

Identify hits: Products with sudden sales surge
Analyze reasons: Promotional activities, KOL recommendations, seasonal demand
Operations strategy: Increase inventory, boost exposure, similar product recommendations

Scenario 2: Slow-Moving Product Handling

Identify slow-movers: Products with no sales for 30 days
Analyze reasons: Price too high, unclear description, wrong category
Handling strategy: Price reduction, optimize detail page, adjust category

Natural Language Query Examples:

"Top 10 products by sales this month"
"Find products with no sales for 30 days"
"Category with the lowest inventory turnover"

2. Product Conversion Analysis

Conversion Funnel:

Product exposure
  ↓
Click to enter detail page
  ↓
Add to cart
  ↓
Purchase

Key Metrics:

  • CTR (Click-through rate): Clicks / Exposures
  • Add-to-cart rate: Add-to-cart / Detail page views
  • Conversion rate: Orders / Detail page views

Optimization Directions:

  • Improve CTR: Optimize main images, titles, price display
  • Improve add-to-cart rate: Optimize detail pages, increase user reviews
  • Improve conversion rate: Promotional activities, inventory reminders, limited-time discounts

3. Product Association Analysis

Market Basket Analysis:

Discovery: 60% of users who purchase Product A also purchase Product B
Applications:
- Recommend Product B on Product A detail page
- Set bundle discounts (A + B at 10% off)
- Place A and B together in warehouse to improve picking efficiency

Association Rules:

Support = P(A ∩ B): Proportion of orders containing both A and B
Confidence = P(B|A): Proportion purchasing B among those who purchased A
Lift = P(B|A) / P(B): Greater than 1 indicates positive correlation

Natural Language Query Examples:

"Which products are frequently purchased together?"
"What do users who bought phones also buy?"
"Recommend products with the highest association with Product A"

Dimension 4: Operations Analysis

Core Objective: Evaluate operations activity effectiveness, optimize operations strategies

1. Promotional Activity Analysis

Before activity:

  • Estimate targets: GMV, order volume, new users
  • Budget: Coupon costs, advertising spend
  • Expected ROI

During activity:

  • Real-time monitoring: GMV progress, inventory warnings
  • Exception handling: Hit product out of stock, system failures

After activity:

  • Effectiveness evaluation: Actual GMV vs target GMV
  • Cost analysis: Actual cost vs budget
  • ROI calculation: (GMV - Cost) / Cost

Natural Language Query Examples:

"What is Double 11 activity GMV?"
"New users during activity compared to normal days"
"Coupon usage and redemption rates"

2. Traffic Analysis

Traffic Sources:

  • Direct access: Users entering URL directly
  • Search engines: Baidu, Google, etc.
  • Social media: WeChat, Douyin, Xiaohongshu
  • Advertising: Baidu promotions, information flow ads
  • Other channels: Email marketing, SMS notifications

Key Metrics:

  • PV (Page Views)
  • UV (Unique Visitors)
  • Average visit duration
  • Bounce rate: Proportion leaving after viewing one page

Traffic Quality Evaluation:

High-quality traffic characteristics:
- Low bounce rate (< 30%)
- High average visit duration (> 3 minutes)
- High conversion rate (> 2%)

Low-quality traffic characteristics:
- High bounce rate (> 70%)
- Low average visit duration (< 30 seconds)
- Low conversion rate (< 0.5%)

Natural Language Query Examples:

"What are today's PV and UV?"
"Which channel has the highest bounce rate?"
"Which page has the highest traffic?"

3. User Profiling

Demographic Characteristics:

  • Age distribution
  • Gender ratio
  • Regional distribution
  • Device type (iOS/Android/PC)

Behavioral Characteristics:

  • Browsing preferences (preferred categories)
  • Purchase frequency (high/medium/low)
  • Price sensitivity (response to discounts)
  • Active time periods (morning/afternoon/evening)

User Tiering (RFM Model):

R (Recency): Days since last purchase
F (Frequency): Purchase frequency
M (Monetary): Spending amount

Important Value Customers: Low R, High F, High M
Important Keep Customers: Low R, High F, Medium M
Important Development Customers: Low R, Medium F, Low M
Important Retention Customers: High R, High F, High M

Precision Marketing Application:

Goal: Improve activity for important retention customers
Strategy:
1. Issue exclusive coupons
2. Push personalized products
3. VIP customer care

Natural Language Query Examples:

"User age distribution"
"How many important value customers?"
"Characteristics of churn risk users (no orders for 30 days)"

Common E-commerce Data Analysis Scenarios

Scenario 1: GMV Decline Diagnosis

Problem: GMV decreased 15% month-over-month, need to find the cause.

Analysis Path:

Step 1: Decompose GMV

GMV = Traffic × Conversion Rate × AOV

Using natural language query (AskTable):

"Compare this month's and last month's traffic, conversion rate, AOV"

Step 2: Locate Problem

Found: Traffic normal, conversion rate decreased 20%, AOV normal
Conclusion: Problem is with conversion rate

Step 3: Analyze Conversion Funnel

"Changes in add-to-cart and payment conversion rates"

Found: Add-to-cart conversion normal, payment conversion decreased
Conclusion: Problem is in the payment step

Step 4: Deep Investigation

Possible causes:
- Payment system malfunction
- Reduced promotional activities
- Competitor promotions stealing traffic

Query: "Number of payment failed orders"
Query: "Compare coupon usage rates"

Step 5: Verify Hypotheses

Found: Coupon usage rate dropped from 40% to 10%
Cause: Last month's promotion ended, reduced promotional intensity
Suggestion: Restore appropriate promotional activities

Scenario 2: New User Retention Optimization

Problem: New user Day 1 retention only 20%, far below industry average of 40%.

Analysis Path:

Step 1: Compare by Channel

"Day 1 retention by channel for new users"

Found: Information flow ad channel at 10%, organic search at 50%
Conclusion: Information flow ads have poor quality, dragging down overall retention

Step 2: Analyze Churn Reasons

"Post-registration behavior of churned users"

Found: 80% of churned users only browsed the homepage, didn't enter product detail pages
Cause: Homepage content not attractive, or target users not precise

Step 3: Analyze Retained User Characteristics

"Differences between retained and churned users"

Found retained user characteristics:
- Browsed 5+ products within 10 minutes after registration
- 70% used new user coupons
- 50% added to cart

Step 4: Optimization Strategies

Strategy 1: Optimize new user onboarding process
- Push personalized product recommendations after registration
- Strengthen new user coupon reminders

Strategy 2: Optimize advertising investment
- Pause low-retention channel advertising
- Optimize ad copy to attract precise users

Step 5: A/B Test Verification

Control group: Original process
Experimental group: New onboarding process

After 7 days, query: "Compare A/B test retention rates"
Verify effect, decide whether for full rollout

Scenario 3: Inventory Optimization

Problem: Warehouse has大量 slow-moving products accumulating, occupying capital and storage space.

Analysis Path:

Step 1: Identify Slow-Moving Products

"List of products with 0 sales in 30 days"
"Bottom 100 products by inventory turnover"

Step 2: Analyze Slow-Movement Reasons

Reason 1: Seasonal products out of season (e.g., winter down jackets)
Reason 2: Price too high (30% higher than similar products)
Reason 3: Insufficient exposure (low search ranking, few recommendations)
Reason 4: Product quality issues (many negative reviews)

Step 3: Classified Handling

Strategy 1 (Out-of-season products):
- Price reduction to clear
- Bundle sales
- Pre-sell next year's inventory

Strategy 2 (Too high price):
- Adjust pricing
- Bundle with hit products for promotion

Strategy 3 (Insufficient exposure):
- Increase advertising investment
- Optimize SEO
- Participate in event promotions

Strategy 4 (Quality issues):
- Stop restocking
- Clearance sale
- Change suppliers

Step 4: Effect Tracking

"Sales situation of clearance activities"
"Trend of slow-moving product inventory reduction"

How to Use AskTable to Improve E-commerce Data Analysis Efficiency

Traditional vs AI-Driven Approach

Pain Points of Traditional Approach:

Scenario: Operations wants to know "first purchase conversion rate for new users this week"

  1. Submit requirement to data team
  2. Wait for data team scheduling (1-3 days)
  3. Data team writes SQL
  4. Generate report
  5. Find data is wrong, need to re-communicate requirement
  6. Wait another 1-2 days for correct data

Total time: 3-5 days

AskTable Approach:

  1. Ask in natural language: "First purchase conversion rate for new users this week"
  2. AI automatically generates SQL and executes
  3. Return results and charts immediately
  4. If details needed, continue asking: "Group by registration channel"

Total time: 30 seconds

Value of Business Semantic Layer

Configure once, everyone uses

Configure business semantic layer in AskTable:

Indicator Definition:
  - Name: First Purchase Conversion Rate
    Definition: Proportion of new users completing first purchase within 30 days after registration
    Calculation: COUNT(DISTINCT First Purchase Users) / COUNT(DISTINCT New Users)
    Synonyms: [New User Conversion Rate, First Buy Conversion Rate]

  - Name: GMV
    Definition: Sum of amounts for paid orders
    Calculation: SUM(amount) WHERE status IN ('paid', 'completed')
    Synonyms: [Sales, Transaction Volume, Total Sales]

Dimension Definition:
  - Name: Registration Channel
    Field: users.register_channel
    Possible Values: [Search Engine, Social Media, Advertising, Organic Traffic]

  - Name: Time
    Field: created_at
    Supported Granularity: [Hour, Day, Week, Month, Quarter, Year]

Effects:

  • Everyone uses the same indicator definitions, avoiding inconsistent data standards
  • Business personnel ask questions in natural language, AI automatically calls semantic layer definitions
  • Complex SQL logic is encapsulated, reducing error risks

Practical Application Case

Case: A Cross-Border E-commerce Platform

Background:

  • Team of 50 people, 5 in data team
  • Receiving 20+ temporary data requirements daily
  • Data team exhausted responding, business team complains about slow response

After Introducing AskTable:

Week 1:

  • Configure data source connections (MySQL, ClickHouse)
  • Define 30 core indicators (GMV, order volume, conversion rates, etc.)
  • Train business personnel to use natural language queries

Week 2:

  • 70% of temporary query requirements completed by business personnel independently
  • Data team's requirements dropped from 20+/day to 5/day
  • Business team from requirement submission to data acquisition, shortened from 2-3 days to real-time

Month 1:

  • Business personnel asked over 500 questions
  • Discovered 3 growth opportunities originally overlooked
  • By optimizing a channel's investment strategy, ROI improved by 40%

After 3 Months:

  • Data-driven decision-making became daily practice
  • Data team freed from "data fetching" work, focusing on higher-value analysis
  • Decision cycle shortened by 50%, business iteration speed significantly improved

E-commerce Data Analysis Best Practices

1. Establish Indicator System

North Star Metric: Choose 1 most core metric as the business "north star"

Common e-commerce north star metrics:

  • GMV: Suitable for early stage, focusing on scale growth
  • Net Sales: Suitable for mature stage, focusing on profitability
  • LTV: Suitable for long-term, focusing on user value

Key Indicator Tree: Decompose around the north star metric into executable sub-indicators

North Star Metric: GMV
├─ Traffic
│  ├─ New Users
│  │  ├─ New users by channel
│  │  └─ Acquisition cost
│  └─ Active Users
│     ├─ DAU/MAU
│     └─ Activity rate
├─ Conversion Rate
│  ├─ First Purchase Conversion Rate
│  ├─ Repurchase Conversion Rate
│  └─ Conversion Rates by Step
└─ AOV
   ├─ ARPU
   ├─ ARPPU
   └─ Attachment Rate (products per order)

2. Regular Reviews

Daily Report (Operations):

  • Yesterday's GMV, order volume, new users
  • Abnormal metric alerts
  • Key activity effects

Weekly Report (Management):

  • Core metric completion this week
  • Year-over-year/month-over-month analysis
  • Next week's key work

Monthly Report (Whole company):

  • Monthly target achievement rate
  • Growth highlights and risks
  • Next month's strategic adjustments

3. Data-Driven Decision Process

Discover Problem → Propose Hypothesis → Verify with Data → Develop Strategy → A/B Test → Full Rollout

4. Data Security and Compliance

Sensitive Data Protection:

  • Phone number masking (138****5678)
  • ID number masking (110***********123)
  • Address masking (Beijing Chaoyang District***)

Permission Control:

  • Operations: Can only see overall data, cannot see user details
  • Customer service: Can see individual user info, cannot batch export
  • Management: Can see all data

Data Auditing:

  • Record all query logs
  • Regularly audit abnormal query behaviors
  • Prevent data leaks

Summary

E-commerce data analysis is a systematic project requiring:

Complete Indicator System:

  • Transaction indicators (GMV, order volume, AOV)
  • User indicators (acquisition, activity, retention, conversion, value)
  • Product indicators (sales performance, conversion, association)
  • Operations indicators (activity effects, traffic quality, user profiling)

Efficient Analysis Tools:

  • Traditional BI tools suitable for fixed reports
  • AI-driven natural language queries suitable for exploratory analysis
  • Business semantic layer ensures unified data standards

Data-Driven Culture:

  • All decisions based on data
  • Quickly verify hypotheses
  • Continuously iterate and optimize

In the AI era, data analysis is no longer the exclusive skill of data teams but should become a basic capability for every business person. Through AI tools like AskTable, "everyone is a data analyst" is no longer a slogan but reality.


Learn More:

  • Visit AskTable Official Website to apply for free trial
  • Download "E-commerce Data Analysis Indicator System Whitepaper"
  • Contact us for e-commerce industry solution demos

cta.readyToSimplify

sidebar.noProgrammingNeededsidebar.startFreeTrial

cta.noCreditCard
cta.quickStart
cta.dbSupport