European Fitness Evaluation Institute · Public standard

EFEI-02001

Comprehensive Evaluation Criterion for Fitness Equipment

This document is the authoritative methodology summary for FitnessTest evaluations. It binds together the operational lifecycle, the five weighted quality dimensions every product receives, the category expert review, and the mathematics used to decide FitnessTest Certified™ eligibility—all in one place for brands, laboratories, and the public registry.

Version 1.0Effective 2026-01Annual review cycle
Sample FitnessTest evaluation badge layout: rating band, score, product and brand lines, test date, evaluation ID, and report URL.
Illustrative sample. Live badges show product-specific results and the published verification link.

Purpose & principles

EFEI-02001 exists so that every product submitted to FitnessTest is measured against the same transparent rules. The standard encodes independence (outcomes follow data, not sponsorship), repeatability (documented steps and evidence), and public accountability (methods and results are published or registered).

  • Applies to all fitness hardware and closely coupled digital accessories evaluated under the program.
  • Works with category annexes (A1–A4) that spell out expert criteria and optional weight adjustments.
  • Feeds the FitnessTest Certified™ decision and the consumer-facing evaluation registry.

Definitions

Base score
Weighted aggregate of five dimension scores (§3), each on the 1.0–5.0 German academic band where lower is better.
Expert score
Single 1.0–5.0 score from the category panel against the annex checklist for the assigned product class (§4).
Comprehensive score
Linear combination of base and expert scores (§5) used for certification gating (§6).
Annex A
Laboratory and desk procedures that must be executed to produce each dimension score.

1. Scope

This standard applies to all fitness equipment evaluated by FitnessTest. It defines the end-to-end evaluation lifecycle, the base score (five dimensions shared by every product), the category-specific expert score, and the comprehensive score used for certification and publication. Operational stages are transparent and repeatable for every product accepted into the programme.

2. Evaluation lifecycle

The stages below mirror the public FitnessTest journey. Typical elapsed time from a lab-ready sample to a published outcome is about six to eight weeks in the standard queue; intake confirms exact dates. Expedited lanes, when offered, compress scheduling only—the technical content of this standard still applies in full.

  1. Request and intake. The brand submits structured product and contact data. FitnessTest confirms the primary category (§4), the governing annex, commercial terms, and logistics. No laboratory work starts before intake closes.
  2. Ship and receive. Hardware moves under an agreed chain of custody. Identity, configuration, and condition at arrival are recorded, culminating in a documented baseline assessment.
  3. Laboratory testing and analysis. Technicians execute Annex A for each dimension, preserving traceable evidence (logs, imagery, calibration references). Parallel category checks from the annex may run at this stage.
  4. Validation. Independent technical review confirms completeness, calibration validity, and adherence to the published protocol before scores are locked.
  5. Scoring. Dimension scores feed the base score (§3). The expert panel finalises the expert score (§4). The comprehensive score is calculated (§5).
  6. Publication and registry. Reports, summaries, and registry entries are released with verifiable identifiers. Products that do not certify still receive a factual publication (§6).
  7. Badge licensing. Qualifying products may license FitnessTest Certified™ assets under programme rules, including active subscription obligations spelled out commercially.

3. Base score

Each dimension i receives a score si ∈ [1.0, 5.0] with one decimal precision after rounding. Weights wi sum to 1.00:

Base score = Σi=1..5 (wi × si)

Default weights: Safety & Materials 0.24 · Performance & Data 0.22 · Durability 0.18 · Usability 0.20 · Ergonomics 0.16. Round the base score to one decimal before combining with the expert score.

  • Safety & Materials — 24%
  • Performance & Data Accuracy — 22%
  • Durability & Longevity — 18%
  • Usability & Experience — 20%
  • Ergonomics & Movement Fit — 16%

Annex A (below) is normative for how each si is produced. It is executed during lifecycle stage 3 and audited in stage 4.

4. Product categories and expert score

Each unit under test is assigned exactly one primary category. A multidisciplinary expert panel awards a single expert score (1.0–5.0) using the checklist and weighting rules in the matching annex. Borderline products adopt the closest category; the report states the rationale.

CategoryExamplesAnnex
Cardio equipmentTreadmills, bikes, rowers, ellipticalsEFEI-02001-A1
Strength equipmentDumbbells, kettlebells, resistance bands, racks, benchesEFEI-02001-A2
Wearables & trackersHeart rate monitors, fitness trackers, smartwatchesEFEI-02001-A3
Accessories & small equipmentMats, foam rollers, jump ropes, small toolsEFEI-02001-A4

Future annex revisions may rebalance the base versus expert weights for specialised equipment; any deviation from §5 defaults must be cited in the evaluation report and in the version history of this standard.

5. Comprehensive score

Comprehensive score = (Base × 0.70) + (Expert × 0.30)

Round once to one decimal. Both component scores are always disclosed in the public evaluation report. If a category annex prescribes different mixing weights, those weights override this paragraph for that product class and must be footnoted.

6. Certification threshold & interpretation bands

Scores follow the German academic convention: 1.0 is best, 5.0 is worst. FitnessTest Certified™ marks may only be used when the comprehensive score is ≤ 4.0. Scores strictly above 4.0 are not certified, yet the evaluation remains visible in the registry so shoppers can review methodology and outcomes.

BandGerman descriptorPublic designation
1.0 – 1.5Sehr gutFitnessTest Certified™ — Exceptional
1.6 – 2.5GutFitnessTest Certified™ — Excellent
2.6 – 3.5BefriedigendFitnessTest Certified™ — Verified
3.6 – 4.0AusreichendFitnessTest Certified™ — Performance Tested
4.1 – 5.0Nicht ausreichendNot certified (result still published)

7. Reporting requirements

Every published evaluation must include at minimum:

  • Product identification, category, and annex references.
  • Dimension scores, base score, expert score, and comprehensive score with rounding notes.
  • Certification decision and effective programme tier (if certified).
  • Summary of key evidence or links to controlled artefacts where confidentiality allows.
  • Statement of independence and version of EFEI-02001 applied.

8. Document control

FitnessTest maintains this standard and its annexes. Each amendment receives a semantic version, effective date, and change log. Annual governance review ensures alignment with laboratory capability, regulatory evolution, and consumer transparency expectations.

Annex A — Base score procedures (normative)

Execute the following steps for every dimension. Deviations require written approval from the technical director and must be disclosed in the evaluation report.

Part 1Safety & Materials

Weight in base score: 24% (w = 0.24)

  1. Identify product type and applicable standardsRecord the product category and applicable safety frameworks (e.g. strength/cardio machinery norms, low-voltage or radio equipment where relevant, and market-specific chemical or material rules).
  2. Physical hazard assessmentInspect pinch and entrapment points, stability and tip-over risk, sharp edges, guards for moving parts, and emergency stops where applicable. Document pass/fail with dated photographs.
  3. Materials and chemical compliance reviewCollect manufacturer declarations (REACH, restricted substances, coatings). Commission or reference third-party screening where the protocol requires it.
  4. Structural and surface integrityExamine welds, fasteners, load-bearing joints, and surface finish for defects that could compromise safety or accelerate failure.
  5. Dimension score (1–5)No critical non-conformities and strong alignment with requirements → 1–2. Isolated minor issues → 3. Significant gaps → 4. Critical failure or prohibited risk → 5.

Required evidence

Signed checklists, annotated photographs, declaration and test reports, incident notes.

Scoring rubric (1–5)

1–2: Meets or exceeds safety and materials expectations. 3: Minor findings. 4: Major concerns. 5: Unacceptable risk or critical non-compliance.

Part 2Performance & Data Accuracy

Weight in base score: 22% (w = 0.22)

  1. Capture all marketed performance claimsExtract rated load, speed, power, resistance curves, accuracy claims for sensors, battery life, and any quantitative marketing figures tied to the tested configuration.
  2. Mechanical and functional performance testsRun instrumented tests appropriate to the category; compare measurements to claims and record tolerance bands defined in the test plan.
  3. Sensor and data-path verification (if applicable)Validate outputs (e.g. heart rate, cadence, power estimates) against calibrated references; assess drift, latency, and consistency across sessions.
  4. Environmental and edge-case samplingRepeat priority measurements under declared operating conditions; note sensitivity to temperature, humidity, or power state where relevant.
  5. Dimension score (1–5)Claims within agreed tolerance → 1–2. Small deviations → 3. Systematic or misleading divergence → 4–5. Non-electronic products: score mechanical performance only.

Required evidence

Claim matrix, raw and summary measurement tables, calibration certificates, test logs.

Scoring rubric (1–5)

1–2: Strong alignment between claims and measurement. 3: Acceptable minor deviation. 4–5: Material inaccuracy or unsupported claims.

Part 3Durability & Longevity

Weight in base score: 18% (w = 0.18)

  1. Define endurance profileSelect cycle count, load level, and duty cycle from the category annex or approved lab protocol; document rationale when a variant is used.
  2. Baseline documentationRecord geometry, torque checks, noise baseline, and imagery before cycling. Define objective pass/fail criteria for the endurance leg.
  3. Execute endurance programmeRun the full cycle plan; log interruptions, adjustments, and intermediate inspections.
  4. Post-endurance inspectionRepeat key measurements; classify wear modes; estimate residual life or recommend re-test triggers if partial failure occurred.
  5. Dimension score (1–5)Comfortable margin beyond target with minimal wear → 1–2. Meets target → 3. Borderline wear or early warning signs → 4. Failure before target → 5.

Required evidence

Approved test plan, cycle counter logs, photographic series, failure analysis if applicable.

Scoring rubric (1–5)

1–2: Robust durability. 3: Adequate. 4: Concerning wear. 5: Premature structural or functional failure.

Part 4Usability & Experience

Weight in base score: 20% (w = 0.2)

  1. Assembly and first-use pathFollow the supplied instructions without manufacturer assistance; log time, tools required, ambiguities, and safety warnings encountered.
  2. Adjustment, storage, and daily operationEvaluate intuitiveness of adjustments, locking mechanisms, folding behaviour, footprint, and maintenance access.
  3. Documentation and labellingReview manuals, quick-start guides, on-device labels, and digital companion apps for clarity, completeness, and legally required warnings.
  4. Acoustic and vibration characterisationMeasure or subjectively rate noise and vibration under representative workloads; flag nuisance levels for residential or studio contexts.
  5. Declared user rangeCheck stated height, weight, and limb-length envelopes against adjustability actually available on the sample.
  6. Dimension score (1–5)Aggregate sub-scores using the rubric in the evaluation workbook; map the composite to the 1–5 band with written justification.

Required evidence

Assembly timeline, checklist outcomes, manual annotations, sound notes, ergonomics worksheet.

Scoring rubric (1–5)

1–2: Straightforward, quiet, well documented. 3: Minor friction. 4–5: Confusing, loud, or incomplete guidance.

Part 5Ergonomics & Movement Fit

Weight in base score: 16% (w = 0.16)

  1. Define primary use caseDocument intended training modality, skill level, and representative anthropometrics for the evaluation.
  2. Range of motion and joint alignmentAssess whether the equipment supports neutral joint tracking or forces compensations that elevate injury risk.
  3. Load distributionEvaluate pressure on hands, feet, seat, or back; flag asymmetric or excessive local loading.
  4. Comfort and strain indicatorsUse structured expert rating or instrumented proxies where available; note red-flag postures.
  5. Dimension score (1–5)Natural, balanced loading with comfortable margins → 1–2. Acceptable compromises → 3. Poor fit or elevated risk → 4–5.

Required evidence

Use-case brief, ROM worksheets, load diagrams, risk commentary.

Scoring rubric (1–5)

1–2: Strong movement fit. 3: Adequate. 4–5: Awkward mechanics or elevated strain.

For programme economics, turnaround tiers, and subscription mechanics, see the public pricing overview. For institutional background, use the methodology hub.