Reliability of a new test battery for fitness assessment of the European Astronaut corps

Petersen, Nora; Thieschäfer, Lutz; Ploutz-Snyder, Lori; Damann, Volker; Mester, Joachim

doi:10.1186/s13728-015-0032-y

Research
Open access
Published: 14 August 2015

Reliability of a new test battery for fitness assessment of the European Astronaut corps

Nora Petersen^1,2,5,
Lutz Thieschäfer³,
Lori Ploutz-Snyder⁴,
Volker Damann² &
…
Joachim Mester⁵

Extreme Physiology & Medicine volume 4, Article number: 12 (2015) Cite this article

5783 Accesses
10 Citations
4 Altmetric
Metrics details

Abstract

Background

To optimise health for space missions, European astronauts follow specific conditioning programs before, during and after their flights. To evaluate the effectiveness of these programs, the European Space Agency conducts an Astronaut Fitness Assessment (AFA), but the test–retest reliability of elements within it remains unexamined. The reliability study described here presents a scientific basis for implementing the AFA, but also highlights challenges faced by operational teams supporting humans in such unique environments, especially with respect to health and fitness monitoring of crew members travelling not only into space, but also across the world. The AFA tests assessed parameters known to be affected by prolonged exposure to microgravity: aerobic capacity (VO_2max), muscular strength (one repetition max, 1 RM) and power (vertical jumps), core stability, flexibility and balance. Intraclass correlation coefficients (ICC_3.1), standard error of measurement and coefficient of variation were used to assess relative and absolute test–retest reliability.

Results

Squat and bench 1 RM (ICC_3.1 = 0.94–0.99), hip flexion (ICC_3.1 = 0.99) and left and right handgrip strength (ICC_3.1 = 0.95 and 0.97), showed the highest test–retest reliability, followed by VO_2max (ICC_3.1 = 0.91), core strength (ICC_3.1 = 0.78–0.89), hip extension (ICC_3.1 = 0.63), the countermeasure (ICC_3.1 = 0.76) and squat (ICC_3.1 = 0.63) jumps, and single right- and left-leg jump height (ICC_3.1 = 0.51 and 0.14). For balance, relative reliability ranged from ICC_3.1 = 0.78 for path length (two legs, head tilted back, eyes open) to ICC_3.1 = 0.04 for average rotation velocity (one leg, eyes closed).

Conclusions

In a small sample (n = 8) of young, healthy individuals, the AFA battery of tests demonstrated acceptable test–retest reliability for most parameters except some balance and single-leg jump tasks. These findings suggest that, for the application with astronauts, most AFA tests appear appropriate to be maintained in the test battery, but that some elements may be unreliable, and require either modification (duration, selection of task) or removal (single-leg jump, balance test on sphere) from the battery. The test battery is mobile and universally applicable for occupational and general fitness assessment by its comprehensive composition of tests covering many systems involved in whole body movement.

Background

Assessments of physical fitness are frequently used in occupational health care settings to determine an individual’s suitability to operate in a specific environment and their capacity to perform required occupational activities without risk to their health and safety, or that of their colleagues [1, 2].

When performed periodically and systematically, these assessments may help identify small changes in an individual’s physical condition that could compromise their performance and/or ability to work safely, which can then be addressed with remedial action. Physical fitness assessments with European Space Agency (ESA) astronauts are performed to objectively quantify physical performance changes after return from space flight. To increase the quality of the data produced and support both operational and research needs, the former simple, gym-based testing protocol was replaced by the ESA’s European Astronaut Centre (EAC) Astronaut Fitness Assessment (AFA), a broader, instrumented test battery. An additional consideration is that the AFA setup must be mobile, as ESA astronauts returning from the International Space Station (ISS) may need to be tested away from ESA facilities, both in the United States and Russia, where post-flight rehabilitation is sometimes implemented. As such, not only must the test elements assess systems affected by space flight and comply with sport scientific evaluation standards, but the test equipment must also be portable and the assessment procedures implementable in various gym environments. This requires a simple test setup, but one that is still capable of producing meaningful data under “field testing” conditions, rather than the standard laboratory conditions available at EAC.

Ten individual tests are included in the AFA. These consider astronauts’ unique occupational performance profile, which is characterised by specific tasks and environmental factors, such as launch and landing, extra-vehicular activities (space ‘walks’) and ISS-specific operations whilst being exposed to microgravity (µG), and ultimately the return into the Earth’s gravity. Microgravity exposure for up to 6 months is known to induce decreases in muscle strength [3, 4], bone mineral density [5, 6], cardiovascular endurance [7, 8] and postural control [9–12], and the AFA tests are included based on this current knowledge: anthropometry (height, body mass, and body composition), hip flexibility, handgrip strength, balance, posture and gait characteristics, core, lower and upper body muscle strength, vertical jump (muscular power) and cardiovascular capacity.

A further difficulty in the development of the AFA test battery is the lack of a precise definition of the physical occupational demands of spaceflight. However, although test validity in relation to space flight occupational performance cannot be assessed at this time, the reliability of the new test battery can and should be assessed. This has not been performed previously, because EAC’s remit is to provide operational support to ESA astronauts and, historically, it has not had the resources to perform research activities. In addition, the test battery was established for organizational reasons (i.e. an increasing number of ESA long-duration space missions and increased independence of ESA from the other ISS Partners) and the battery was developed and used in parallel to operational implementation, and has undergone numerous changes in the course of development.

Towards this end, the purpose of this investigation was to report the reliability (retest correlation, systematic bias and random error) of each test element, to support the decision to keep, modify or remove them from the AFA.

Methods

Participants

Ten male subjects were recruited to participate in the study. The inclusion criteria were based on anthropometric selection standards for ESA Astronauts: healthy and matching the astronaut population in terms of body height (between 149.5 and 190.5 cm) and body mass (≤95.0 kg) [13]. The study was approved by the ethical board of the German Sport University in Cologne and all subjects provided informed written consent before participation.

Study design

The study used a test–retest design in a controlled laboratory environment, with participants making three visits to the German Sport University, with each visit separated by 7 days. Prior to the first experimental visit, participants performed a familiarisation session of the entire test battery. For experimental visits, participants arrived at approximately the same time of day, wearing the same clothes and shoes for each visit. They were instructed to not deviate from their usual training and eating habits during the testing period. To minimise measurement errors, subject position, movement speed, observer instructions, measuring instrument, location and test conditions were standardised between sessions.

The test elements were always implemented in the same order, with the aim of minimising fatigue effects (e.g. elements with a low physical demand were scheduled at the beginning of the battery prior to implementation and those requiring significant/maximal physical effort at the end) with 1–3-min rest breaks between measurements and, as with the AFA performed with ESA astronauts, all elements were completed in a 2-h time period. Consistent with normal AFA procedures, subjects ran on a treadmill for 10 min at 10 km h⁻¹ to warm up and no other specific warm-up exercises were completed. To avoid observer bias, all experimental staff were familiarised with the tests to which they were assigned and they conducted these tests for the entire study.

Anthropometry

Height was measured using a stadiometer (SECA GmbH, Hamburg, Germany). Body mass was measured and percentage body fat estimated using a combined weighing scale and bio-electrical impedance device (BC-418 MA, Tanita, Tokyo, Japan).

Flexibility

Hip flexion was measured with a Sit-and-Reach box (Sport Time, USA). Participants were instructed to reach forward as far as possible in a slow and controlled movement and hold the final position for 2 s. The distance (in cm) achieved was measured and three trials were performed, with the single best effort used for analysis.

Hip extension was measured using a modified Thomas Test [14]. Participants adopted a supine position on a bench with both legs bent over the edge. Allowing the measured leg to hang freely, participants were instructed to pull the other knee to their chest ensuring continuous firm contact of the lumbar spine with the bench surface. Hip angle (°) in relation to the bench surface was measured using an inclinometer (ACU 360, Lafayette Instrument Company, Lafayette, USA) at the mid-thigh, capturing six consecutive values in the same position. The average of those six values was used for analysis. An identical measurement was then made with the other leg.

Handgrip strength

Maximal, one-handed handgrip strength was measured for both hands using a mechanical handgrip dynamometer (Takei Scientific Instruments Co. Ltd., Niigata City, Japan). From a standing position, with their arm down by their side, participants were instructed to apply maximal force for 2 s. Participants made three attempts per hand, alternating hands each time, separated by at least 60 s rest, with the single best effort used for analysis.

Core strength

The ability to maintain a standardised position and movement was measured in three different (ventral, lateral and dorsal—in that order) positions as described in the Swiss Olympic manual of core strength assessment [15]. In each position, participants were requested to maintain both position and speed of movement (1 Hz) in synchronisation with a metronome (Ma-30, KORG metronome, Tokyo, Japan). The test was terminated when the subject was unable to maintain the required position or movement [15] after either a maximum of two warnings by the test observer or until volitional fatigue. The time (in s) to test termination was recorded in all three positions.

Muscle strength

Muscle strength was assessed by estimating the one repetition maximum (1 RM) using the Brzycki Formula [16, 17]. Bench press and squat manoeuvres were conducted in a standardised body position (feet, hands, and bench) and range of motion in relation to the rack (Smith machine, gym80, International GmbH, Gelsenkirchen, Germany). Participants were instructed to perform as many repetitions as possible at a pre-selected load with the aim of achieving volitional fatigue in less than 10 repetitions.

Balance

Ten tests of balance were performed using two different instruments (Table 1). To assess postural sway area of the body’s centre of pressure (COP) and COP displacement path length, six tests (Levels 1–6), each with an increasing level of difficulty, were performed on a pressure distribution platform (FDM-S Pressure Plate, Zebris Medical GmbH, Isny, Germany). Data were processed at 100 Hz using Zebris software, with COP area taken as the area (in mm) within the 95 % confidence interval. The last four tests (Levels 7–10) were performed on a balance board (Fig. 1) with a metal spherical base (Sport Thieme GmbH, Grasleben, Germany) instrumented with an inclinometer (BalensoSenso, Fa. Reinert, Pforzheim, Germany) inserted into the sphere underneath the board to measure angular velocity.

Table 1 Balance test Levels 1–10, implemented on pressure plate and balance board

Full size table

Test conditions with both devices were made increasingly difficult by closing the eyes, standing on one foot, tilting the head back and standing on tiptoes, which was a modification of existing balance tests in rehabilitation practice [18]. Foot and hand positions were standardised (hands on hips, surface markers for feet) and tests on one foot were always performed with the same leg. All tests lasted a maximum of 15 s. Stepping off the device surface, the hands losing contact with the hips (e.g. to grasp safety handles) and opening the eyes (for tests with eyes closed) were termination criteria for any test. In the case of termination, the maximum time achieved by the subject was recorded. All ten tests were completed in the same order, regardless of the subject’s ability to complete the full 15 s for any test.

Muscle power

Lower body muscle power was assessed from a countermovement jump (CMJ), squat jump (SJ), single-leg CMJ jumps [right (SLJ-R) and left (SLJ-L) leg] and a drop jump (DJ) from a 0.28-m platform. In bare feet, participants were instructed to jump as high as possible whilst keeping their hands in contact with their hips at all times. Each jump was attempted three times, with a break of 60 s between jumps and the single best effort for each task was used for analysis. Performance was measured by calculating jump height (m) based on measurement of GRF (N), contact time (s) and rate of force development (RFD) (N/s) were and using a force platform (5691 A, Kistler, Winterthur, Switzerland) and analysis software (TEMPLO© by Contemplas GmbH, Kempten, Germany) with a sampling rate of 300 Hz [19]. Reactive strength index (RSI) was also calculated for the drop jump as a measure of stretch–shortening cycle function.

Aerobic capacity

Aerobic capacity (VO_2max) was measured on a treadmill (PPS 55med-I, WOODWAY GmbH., Weil am Rhein, Germany) using a modified Bruce protocol [20] (Fig. 2). Belt speed was increased by 1.8 km h⁻¹ every 3 min (starting at 6 km·h⁻¹) at a constant 1 % incline until volitional fatigue, with 30-s breaks between intervals for lactate sampling (“Lactate scout”, EKF-diagnostic GmbH, Magdeburg, Germany).

Oxygen uptake was measured continuously using a spirometry system (Zan600, ZAN Austria e.U., Steyr-Dietach, Austria) and VO_2max calculated from a sliding mean over the last 30 s before fatigue. Heart rate was recorded using a chest strap and watch (RS800, POLAR, Kempele, Finland). Earlobe lactate samples were taken 1 and 5 min after the point of fatigue, and all lactate values inserted into the ERGONIZER^® software (ERGONIZER^® version 4.1.10, Kai Röcker, Freiburg, Germany) to provide a secondary estimation of VO_2max. In the AFA performed with astronauts, this estimation technique is used when the spirometry equipment is not available to make a direct measurement.

Statistical analysis

Data are reported as mean ± 1 SD unless otherwise stated. The main objectives of the assessment were to evaluate relative (Intraclass correlation coefficients, ICC_3.1) with fixed raters, and absolute (standard error of measurement, SEM, and coefficient of variation, CV) reliability of each element. The rationale for the fixed raters was that, in the operational implementation of the AFA, an individual astronaut is always tested by the same person for consecutive AFAs, and thus, in this study, the same operators always conducted specific test elements and inter-rater correlations were not assessed.

Data from the three experimental visits were analysed using a repeated measures analysis of variance (ANOVA) (k = 3; α-level = 0.05) to calculate SEM, ICC_3.1, and the F-ratio, to identify systematic bias (critical F value >3.74) potentially caused by implementation and analysis procedures, learning and/or fatigue effects [21–23]. Prior to analysis, data were tested for normal distribution and homoscedasticity, and, where not evident, a transformation was applied. Thus, a log100 transformation was applied to the following data: balance: COP sway area (Level 1, 2, 3, 5 and 6), path length (Level 4 and 5), and average rotation velocity (Level 7 and 8); jump: CMJ (height and RFD), SJ (height and RFD), SLJ (RFD for both legs jump height for SLJ-L) and DJ (RFD); VO _2max: estimation by ERGONIZER^®; core strength: dorsal position. The measures of error (SEM, CV) are reported in absolute form (‘+/−’), or in ratio form (‘×/÷’) for log100 transformed data. Normal distribution or homogeneity, although statistically tested here, may still differ for a larger sample and, therefore, both SEM and CV are always provided. Statistical analysis was performed with commercially available software (PASW Statistics 18, IBM Corporation, Armonk, USA) and “Microsoft Excel 2013” (Microsoft, Redmond, USA).

Results

Of the ten participants who were recruited into the study, only eight [(mean ± 1 SD) age 25 ± 2 years; height 1.78 ± 0.05 m; body mass 76.6 ± 8.6 kg] completed all the required procedures and were thus included in the statistical analysis.