Predictive vs. Concurrent vs. Criterion Validity

All Blogs

Table of content

Example H2

Example H3

Example H4

Example H5

Example H6

predictive concurrent vs criterion validity

Summarise this post with:

ChatGPT Perplexity Gemini Grok Claude

This assessment is validated" sounds reassuring, but it can mean very different things — and not all of them are proof that the test predicts job performance. Validity is a family of concepts, and they're not interchangeable. A test can look perfectly job-relevant and still have no measured link to who succeeds. For hiring decisions, only one branch of the family closes that gap.

This guide maps the validity family in plain language, explains the difference between predictive and concurrent validity, and shows which type you actually need before you trust an assessment to gate hiring.

In short: Criterion validity — proof that assessment scores correlate with a real job outcome — is the type that matters for hiring. It comes in two forms: predictive validity (score candidates now, measure their outcomes later) and concurrent validity (score current employees whose performance you already know). Content and construct validity support an assessment's design, but only criterion validity proves it predicts performance.

The validity family, from weakest to strongest proof

Face validity — does it look relevant?

Face validity is whether an assessment simply appears job-related to candidates and stakeholders. It affects how seriously candidates take the test, so it's worth having — but it's an impression, not evidence. A test can have high face validity and predict nothing.

Content validity — does it cover the job?

Content validity is the degree to which the assessment samples the actual knowledge, skills, and behaviors the role requires. It's established by expert judgment and job analysis. Useful and important for defensibility — but it shows the test covers the domain, not that it predicts success in it.

Construct validity — does it measure the trait it claims?

Construct validity is whether the assessment truly measures the underlying trait it says it measures — conscientiousness, numerical reasoning, influence. It's the psychometric foundation: if a “resilience” score doesn't actually capture resilience, nothing built on it can be trusted. Necessary, but still upstream of the performance question.

Criterion validity — do scores predict a real outcome?

Criterion validity is the one hiring leaders should anchor on: a measured relationship between assessment scores and a real-world performance criterion — sales, retention, conversion, ratings. This is the type that answers the only question that matters at the point of decision: do higher scorers actually perform better?

The two forms of criterion validity

Criterion validity splits into two designs, depending on when you measure the outcome.

Predictive validity scores candidates now and measures their job outcomes later — say, 6 to 12 months after hire. Because the scores are collected before anyone knows the outcome, this is the cleanest possible evidence that the assessment predicts performance. The cost is time: you wait for the outcomes to mature.

Concurrent validity scores a group of current employees whose performance you already know, and checks whether scores track results. It's faster, because there's no waiting — and it's the natural choice when you already have an assessed population with performance history. The trade-off is that current employees are a pre-selected, surviving group, which can understate the full picture.

Both are criterion validity. Both test the same thing: does the score predict the result?

Validity Type	What It Proves	How It's Established	Proof for Hiring?
Face	The test looks relevant	Perception	Weak — impression only
Content	The test covers the job domain	Job analysis, expert review	Supports design
Construct	The test measures the intended trait	Psychometric analysis	Foundational
Criterion — Predictive	Scores predict future performance	Score now, measure outcomes later	Strongest
Criterion — Concurrent	Scores track current performance	Score current performers with known results	Strong, faster

Which one do you need for hiring?

Criterion validity — predictive or concurrent. Content and construct validity tell you the assessment is well-built and job-relevant, which matters for quality and defensibility. But the claim “this assessment predicts who will perform” can only be backed by criterion evidence against a real outcome inside your context.

When PMaps validated a tele-sales assessment for a leading BFSI lender, it was a criterion (predictive) validity study: 1,118 hires, scored before hire, measured a full year later against real disbursement. That design is what let the study say — with evidence, not assertion — that high performers had scored higher.

See the worked study in the predictive validity case study, and the full method in how to validate a hiring assessment.

Don't confuse validity with reliability

A quick but crucial distinction. Reliability is consistency — whether the test gives stable results across attempts and items. Validity is whether it measures and predicts what it should. A test can be highly reliable (consistent) and still invalid (consistently measuring the wrong thing). You need both: reliability is necessary, but it isn't sufficient.

Common misconceptions

Validated" means it predicts performance. Not necessarily — it may only mean content or construct validity. Ask which type.
Face validity is enough. Looking relevant is not the same as being predictive.
A vendor's global validity transfers to your roles. Treat external evidence as a starting point; criterion validity is strongest when established in your own context.
Reliability equals validity. Consistency isn't accuracy.
One validation lasts forever. Criterion validity should be re-checked as roles and pools shift.

How PMaps helps

PMaps is an AI-powered talent assessment platform that helps enterprises improve their hiring odds — scientifically. PMaps builds on sound construct and content design, then proves it where it counts: criterion validity against your real business outcomes, so a “valid” assessment means one that predicts performance — not just one that looks the part. Trusted by 200+ enterprise clients across 7 countries. [confirm current approved figures before publish]

Want criterion-validated assessments for your roles? Book a 30-minute walkthrough and we'll tie scores to your outcomes.

→ Book a demo | Start a 7-day free trial

Oops! Something went wrong while submitting the form.

Frequently Asked Questions

Learn more about this blog through the commonly asked questions:

Q1: What is criterion validity?

Criterion validity is the degree to which assessment scores correlate with a real-world performance outcome, such as sales, retention, or ratings. It's the type of validity that proves an assessment predicts job performance, which is what matters most for hiring.

Q2: What is the difference between predictive and concurrent validity?

Predictive validity scores candidates now and measures their outcomes later, giving the cleanest evidence but taking time. Concurrent validity scores current employees whose performance is already known, so results come faster. Both are forms of criterion validity and test whether scores predict the outcome.

Q3: What is the difference between content, construct, and criterion validity?

Content validity shows the test covers the job's required knowledge and skills. Construct validity shows it measures the trait it claims to. Criterion validity shows the scores actually predict a real performance outcome — the strongest proof for hiring decisions.

Q4: Which type of validity matters most for hiring?

Criterion validity, in its predictive or concurrent form, because it directly proves higher scorers perform better. Content and construct validity support the assessment's design but don't establish prediction.

Q5: Is validity the same as reliability?

No. Reliability is consistency of results; validity is whether the test measures and predicts what it should. A test can be reliable but invalid — consistently measuring the wrong thing.