Help CenterStatistical Lab

Statistical Lab

A statistical analysis platform that takes you from raw data to a publication-ready result — in six steps, without writing code.

Overview

Statistical Lab is organized into two modes. Explore is for understanding data and testing hypotheses — it holds Exploration, Comparison, Relationship, and Econometrics. Model is for prediction and structure — it holds Predictive, Clustering, Time Series, and Structural. Switch modes from the top of the sidebar to see the relevant analyses.

Not sure which method to use? Open Analysis Recommendation from the dashboard — it reads your dataset and suggests the most suitable Statistical Lab analysis, then sends you straight to it.

Explore & Model modes

Statistical Lab — Explore and Model modes

68+ Statistical Methods

From descriptive statistics to SEM, clustering, and time series — all in one place.

AI-powered results

Every analysis produces a plain-language Summary, a Reasoning tab, and a full APA report.

Reproducible code

Export Python or R code for any analysis — runnable in your own environment.

What you need

Data file: CSV or Excel (.xlsx / .xls)

Sample size: Varies by method — checked automatically

Variable types: Numeric, categorical, ordinal, or date/time

What you get

Summary: Plain-language findings + quality verdict

Reasoning: Statistical logic explained in plain language

Statistics: APA report, tables, plots, and export

Analysis categories

Use the left sidebar to navigate between categories. Upload your data at the top, search for a specific method, or browse by category. Each analysis page includes a format guide and a sample dataset to get started immediately.

Sidebar navigation & analysis guide

Statistical Lab — sidebar navigation and analysis guide

Statistical Lab organizes its 68+ methods into 8 categories by research goal.

Exploration

Explore

Understand your data before testing.

3 methods

Comparison

Explore

Test whether groups differ.

13 methods

Relationship

Explore

Model how variables predict each other.

10 methods

Econometrics

Explore

Estimate causal effects from observational data.

4 methods

Predictive

Model

Classify or forecast new observations.

12 methods

Structural

Model

Uncover latent constructs and path models.

9 methods

Clustering

Model

Find natural groups in your data.

6 methods

Time Series

Model

Analyze and forecast time-ordered data.

11 methods

Running an analysis

Each analysis page opens with a summary card — what it measures, when to use it, requirements, and what you'll learn. Load the sample dataset to see the full workflow before using your own data.

An analysis page with a sample dataset

Statistical Lab — individual analysis page with sample dataset

Uploading your own data

When you're ready to use your own dataset, just drag a file anywhere onto the page. A full-screen drop zone appears in the center — release the file there and it uploads instantly. CSV and Excel (.xlsx / .xls) are supported. You can also use the upload control at the top of the sidebar.

데이터를 먼저 정리해야 한다면 데이터 준비 가이드를 확인하세요. 결측치 처리, 이상치 제거, 변수 변환 등 전처리를 마친 뒤 Send to Statistical Lab 버튼으로 바로 연결됩니다.

Drag & drop to upload

Statistical Lab — drag a file to the center of the screen to upload

Every method follows the same 6-step workflow. Steps 1–3 cover selecting variables, configuring settings, and validating your data.

Variables

Select the columns your analysis will use.

Settings

Configure parameters — sensible defaults, change only what you need.

Validation

Automatic pre-flight checks before the analysis runs.

Summary

Plain-language findings + quality verdict.

Reasoning

The statistical logic behind the result, explained in plain language.

Statistics

Full APA-formatted report — tables, plots, metrics, and export.

Variables

Tell the analysis which columns to use. The variable picker automatically filters your dataset to show only columns compatible with the chosen method.

Step 1 — Variables

Numeric (continuous)

Numbers with meaningful magnitude — height, revenue, temperature, test scores.

e.g. age, price, duration, score

Categorical (nominal)

Labels with no inherent order — group names, product types, treatment conditions.

e.g. country, color, treatment_group

Ordinal

Categories with a meaningful order but unequal spacing.

e.g. S / M / L, 1–5 satisfaction, education level

Date / Time

Temporal data used to order observations or define a time axis.

e.g. order_date, timestamp, month_year

Variable pattern

Methods

Numeric outcome + Categorical group

t-Tests, ANOVA, Non-parametric comparison tests

Numeric outcome + Numeric or mixed predictors

Linear Regression (Simple, Multiple, Polynomial, Ridge, Lasso, Robust)

Categorical outcome + Any predictors

Logistic Regression, GLM, LDA, Decision Tree, Random Forest, XGBoost, SVM, KNN

All selected variables analyzed together

Clustering, PCA, EFA, CFA, MDS, SNA, Mediation, SEM

Date / time + Numeric value variable(s)

All Time Series methods (Trend, Decomposition, ARIMA, Forecasting…)

Two categorical variables

Crosstab & Chi-Squared

Settings

Configure the parameters for the chosen method. Every setting has an inline guide — hover the help icon to see what it does and the recommended default.

Step 2 — Settings

Setting type

What it controls

Applies to

Confidence / significance level

Sets the α threshold and CI width. Default 95% (α = 0.05) across almost all tests.

Most hypothesis tests

Tail direction

One-tailed tests a directional hypothesis; two-tailed tests for any difference.

t-Tests

Post-hoc test

Determines which pair of groups differ after a significant omnibus result.

ANOVA variants

Effect size metric

Quantifies practical magnitude independent of sample size. Options: η², ω², Cohen's d.

Comparison analyses

Scaler

Normalizes variable ranges before distance is computed. Options: Z-score, MinMax, Robust.

Clustering, KNN, SVM, PCA

Model hyperparameters

Method-specific controls: K clusters, tree depth, regularization strength λ, etc.

ML models, Clustering

Order parameters (p, d, q)

Controls autoregressive, differencing, and moving-average components of the model.

ARIMA / SARIMAX

Forecast horizon

How many future time steps to predict beyond the end of the training series.

Time Series forecasting

Validation

Before the analysis runs, Statistical Lab automatically checks your data — three categories, each returning pass, warning, or fail.

Step 3 — Validation

Data requirements

Minimum sample size met for the chosen method
Correct number and types of variables selected
No empty columns or all-missing variables

Statistical assumptions

Normality where required — Shapiro-Wilk test + Q-Q plot
Homogeneity of variance across groups — Levene's test
Independence of observations

Data quality

Missing values detected — choose to drop, impute, or proceed
Outliers identified via IQR and Z-score methods
Constant or near-constant columns flagged

When a check fails: you can still proceed — checks are never hard blockers — but Statistical Lab shows a recommended fix directly under the failing check.

Common issues

Batch Analysis

Batch Analysis runs many analyses on the same dataset in one pass — assign your variables once, select every method that fits, and get a single integrated interpretation instead of opening each analysis one by one.

Upload your data

Drop a CSV or Excel (.xlsx / .xls) file, or load a built-in example dataset.

Assign variables once

Set an optional target / dependent variable and a group variable. They are reused across every analysis that needs them.

Select analyses

Tick individual methods, or use “Select all feasible” to auto-pick every analysis your data supports.

Run & interpret together

Run the batch, then click Generate for an AI interpretation that integrates all results into one narrative.

When to use it: exploratory passes over a new dataset, or producing a full battery of results for a report. For a single focused test, run the analysis directly from its own page.

Understanding results

Once an analysis runs, the result is presented across three tabs of increasing depth — the same finding, framed for different audiences.

Variables

Select the columns your analysis will use.

Settings

Configure parameters — sensible defaults, change only what you need.

Validation

Automatic pre-flight checks before the analysis runs.

Summary

For: Anyone

Plain-language findings + quality verdict.

Reasoning

For: Curious readers

The statistical logic behind the result, explained in plain language.

Statistics

For: Researchers

Full APA-formatted report — tables, plots, metrics, and export.

Summary

Plain-language findings. Answers "what did the analysis find?" without statistical jargon.

Step 4 — Summary

Headline finding

One or two sentences stating what the analysis found in plain English — no jargon.

Key Findings panel

Bulleted highlights: key metrics, effect direction, group sizes, and anything notable about the data pattern.

Performance Assessment

A paragraph contextualizing the result — whether the model or test performed well, what the caveats are, and what the finding means in practice.

Quality Dimensions

A grid showing each quality dimension evaluated for this specific method, each with its own status badge. Dimensions vary by analysis.

Overall verdict chip

A single combined verdict at the bottom — e.g. "Moderate · 96.0%". This combines all dimension scores, not just the main metric.

"Why This Result?" button

Navigates directly to the Reasoning tab for a full dimension-by-dimension explanation.

Quality dimension badges

Badge

What it means

Strong

This dimension is healthy — no action needed.

Good

Performing well, minor considerations may apply.

Stable

Low variance or spread — results are consistent.

Moderate

Acceptable but worth reviewing before reporting.

Acceptable

Within usable range, but not ideal.

Weak

This dimension has concerns that may affect reliability.

Unstable

High variance or failed check — treat results with caution.

Reasoning

Answers "why did the result come out this way?" — translates statistical logic into plain language. Open when the verdict is Moderate or lower.

Step 5 — Reasoning

What the result means

Plain-language interpretation of the main finding — why the numbers came out the way they did given your data.

How reliable is this result?

Explains stability, variance, and consistency. For cross-validation: fold-to-fold spread and CV standard deviation.

What to consider next

Practical guidance: what the result supports, what it doesn't prove, and what follow-up steps would strengthen confidence.

Method-specific caveats

Warnings specific to the method — e.g. data leakage risks for cross-validation, multicollinearity for regression.

Diagnostic Guide

A reference table showing numeric thresholds for each dimension — what score range counts as Strong vs Moderate.

Bottom Line

A one-paragraph summary of the overall verdict with the key numbers — safe to use as a standalone takeaway.

Statistics

The full technical record — APA-formatted, with all numbers, tables, plots, and export options. Use when you need to publish, present, or hand off.

Step 6 — Statistics

Metric summary cards

Key numbers at a glance — the primary metric, spread/error, min/max, and sample size. Varies by method.

APA-style writeup

A publication-ready paragraph with all required statistics. Safe to copy directly into a manuscript.

Key Insights

Bulleted technical highlights — the most important numbers and what they imply, written for a statistically literate reader.

Visualizations

Charts specific to the method — Q-Q plots, residual plots, ROC curves, fold score charts, dendrograms, scree plots, and more.

Detailed tables

Coefficient tables, ANOVA tables, fold-by-fold results, cluster profiles, fit indices — whatever is standard for the chosen method.

Model parameters

A record of every setting used to run the analysis — so the result is fully reproducible.

Continue your analysis

Recommended next steps: Go Deeper, Consider Instead, and Related analyses tailored to your result.

Export menu

Download buttons for CSV, PNG, Word document, and reproducible Python / R code.

How to use all three tabs

Read the Summary.

Note the headline finding and check the overall verdict chip at the bottom.

If the verdict is Strong — you can stop here.

The Summary is self-contained. Report the headline finding with confidence.

If the verdict is Moderate or Weak — open Reasoning.

The Reasoning tab shows exactly which quality dimension is dragging the score down and what to do about it.

If you need to publish or present — open Statistics.

Copy the APA writeup, download the charts, or export the reproducible code.

Use "Continue your analysis" to decide what's next.

Go Deeper, Consider Instead, or Related analyses are suggested based on your result.

Common questions

Exporting & AI Chat

Take your analysis out of Statistical Lab — as data, an image, a Word document, or reproducible code. Or ask the AI Chat to help you interpret and communicate the result.

Export button location

CSV Spreadsheet

Further analysis in Excel, Google Sheets, or R / Python

Your original dataset enriched with the analysis output — cluster labels, predicted classes, residuals, or any other column the method adds.

Contains

All original columns from your dataset
New columns added by the analysis (e.g. predicted class, cluster label)
One row per original observation

Use it when

You want to keep working with the augmented data in another tool.

PNG Image

Slides, dashboards, quick sharing

A high-resolution screenshot of the entire Statistics tab — APA writeup, tables, and diagnostic plots captured as a single image.

Contains

The APA-style writeup paragraph
All result tables
All diagnostic plots
Captured at 2× resolution for sharpness on Retina displays

Use it when

You need a single image to drop into a presentation, email, or chat message.

Word Document (.docx)

Papers, formal reports, manuscripts

A fully editable Word file with the APA writeup, formatted tables, and embedded plots — a complete first draft ready for editing.

Contains

APA-formatted text paragraph — copy-paste ready
Tables formatted with Word styles (editable)
Embedded plot images

Use it when

You are writing a paper, thesis, or formal report and want to start from a complete draft.

Python or R Code

Reproducibility, peer review, custom modification

The exact script used to produce the result — runnable in your own environment with the same parameters you set in the UI.

Contains

Library imports and setup
The analysis call with your exact parameter values
Both Python (.py) and R (.R) versions available

Use it when

You need to reproduce the analysis offline, share it with a collaborator, or extend it beyond what the UI exposes.

AI Chat

AI Chat — ask anything about your result

Once an analysis runs, a circular toggle button appears at the bottom of the Statistics tab. Click it to open the AI Chat panel — the AI already knows which analysis was run and what the result was, so you can ask follow-up questions without re-explaining the context.

Explain results in plain language

Translates statistical output into plain sentences — useful for stakeholders who don't read p-values.

Interpret key statistics

Ask about any number in the result — effect sizes, confidence intervals, fit indices — and get a clear explanation.

Draft an APA summary

Request a publication-style paragraph based on the result, ready to copy into a manuscript.

Suggest next steps

Ask "so what?" — what the result implies in practice and what you should consider doing next.

Which format do I need?

Continue analyzing the data in Excel or Sheets

CSV

Drop a snapshot into a slide deck

PNG

Write a research paper or thesis

Word Document

Reproduce the result on your own machine

Code

Hand the analysis off for peer review

Code + Word

Caveats & tips

R code is independently written

Calculations on the platform run in Python. The R script is a faithful re-implementation, but small numerical differences can occur due to library differences. Verify before submission.

Word documents are starting drafts

The exported .docx is a complete first draft. Match the language to your journal's style guide before submitting.

PNGs are exported at 2× resolution

Images look sharp in slides and on Retina displays. If you need a vector format, use the Word document and extract the embedded plots.

Common issues

Guide & Terminology

Two reference tools you can open from any analysis page without losing your place — the Analysis Guide for the method you're using, and the Statistical Glossary for any term you don't recognize.

Analysis Guide

"Analysis Guide" button — top right of any analysis page

A method-specific reference explaining what the algorithm does, when to use it, what assumptions it makes, and what every parameter controls. Changes based on the analysis you're on.

Statistical Glossary

"?" button — top right of any analysis page

A searchable dictionary of statistical terms — p-values, effect sizes, confidence intervals, and method-specific vocabulary. Same on every page.

Guide & Terminology button location

Example — Cross-Validation Analysis Guide

What's inside every Analysis Guide

What is [Method]?

A plain-language explanation of what the algorithm does conceptually — how it works, what problem it solves, and how it differs from similar methods.

e.g. "Cross-validation is a technique for estimating generalization performance — how well a model will perform on unseen data."

Why use it?

The case for choosing this method over alternatives — key advantages, typical use cases, and what kind of data or research question it suits best.

e.g. "CV uses all data for both training and testing, reports mean AND std, and detects overfitting."

Methods / Variants

Where applicable, a breakdown of the variants available in the UI — what each one does, when to choose it, and practical tradeoffs.

e.g. K-Fold, Stratified K-Fold, Repeated K-Fold — each with when to use and what the tradeoff is.

Key insight

One or two sentences that capture the most important thing to understand about interpreting this method's output.

e.g. "The CV mean summarizes performance; the CV std measures stability — a lower mean but lower std may be more reliable."

Parameter reference

Every setting available in Step 2, with a plain-language description, the recommended default, and when you'd change it.

e.g. n_folds: "5 is a good default — 10 folds give lower bias but are slower and higher variance."

Statistical Glossary

Sample glossary terms

p-value

The probability of observing data at least as extreme as yours, assuming the null hypothesis is true. A small p-value is evidence against H₀ — not proof the effect is large or important.

Effect size

How large the difference or relationship is, independent of sample size. Unlike p-values, effect size tells you whether a result is practically meaningful.

Confidence interval

A range of plausible values for a parameter given the data. A 95% CI means: if you repeated the study 100 times, about 95 of those intervals would contain the true value.

Overfitting

When a model fits the training data too closely and fails to generalize to new data. High training accuracy but low cross-validated accuracy is the typical signal.

Homoscedasticity

Equal variance of residuals across all levels of the predictor. A key assumption of linear regression — violations can make standard errors unreliable.

Common issues