Background
Statistical numeracy is essential for understanding health-related risks and making informed medical decisions. However, this concept has not been investigated with probabilistic national samples or compared cross-culturally. We sought (1) to investigate differences in the level of statistical numeracy between 2 countries with different educational and medical systems—the United States and Germany; (2) to study the relationship between statistical numeracy and demographic characteristics such as age, sex, and education; and (3) to test whether a subjective measure of numeracy is a valid indicator of objective measures.
Methods
In a survey of probabilistic, representative national samples in Germany and the United States, conducted in July and August 2008, we asked questions testing objective and subjective statistical numeracy.
Results
German participants had higher numeracy skills than did US participants. On average, 68.5% (SE, 1.1%) and 64.5% (SE, 1.3%), respectively, of items testing objective numeracy were answered correctly. Subjective estimates of numeracy were a good indicator of the objective measures. There is a large gap in numeracy skills between persons with lower and higher educational levels, particularly in the United States.
Conclusions
Physicians should be aware that many patients may not understand all information relevant to making an informed decision. Fortunately, they can identify such patients and use nonnumerical presentation formats, such as graphical displays and analogies, to communicate important statistical information.
What percentage is 20 of 100? For most readers of this article, the answer is straightforward. Many patients, however, have difficulties grasping this and other basic statistical concepts.1-4 Statistical numeracy is part of a more general concept of quantitative or mathematic literacy5,6 and includes understanding the concept of a random toss and knowing how to perform elementary calculations with percentages.1,2 This knowledge is essential for understanding risks associated with different diseases, medical screenings, and treatments and, consequently, for making informed decisions about health.7-12 The present article describes a cross-cultural study investigating 3 important unanswered questions about statistical numeracy in the health context.
First, are there differences in the level of statistical numeracy between countries with different educational and medical systems, such as the United States and Germany? Several large national and international studies have included items that measure a broader concept of quantitative literacy (eg, the Programme for International Student Assessment,13 Trends in International Mathematics and Science Study,14 National Assessment of Adult Literacy,15 and International Adult Literacy Survey).16 Most of these studies, however, are limited to student populations and/or do not deal specifically withstatistical numeracy, in particular not in the context of health. Given a stronger emphasis on mathematics and science education in the early grades in Germany compared with the United States,17 it is possible that statistical numeracy is higher in Germany. However, the opposite could also be true. Because most health expenditure in the United States is privately based (55%)18 and because patient-targeted advertising of prescription drugs is allowed, US residents may have more experience in dealing with information about medical risks and, consequently, have higher statistical numeracy than the residents of Germany, where only 23% of health expenditure is privately based.
Second, what is the relationship between statistical numeracy and demographic characteristics such as age, sex, and education? To promote the ideal of informed and shared medical decision making,19-21 it is essential to identify low-numeracy groups and to educate them in using quantitative statistical information or communicate information about health using nonquantitative formats such as visual displays and analogies.22-25 However, all of the extant studies of statistical numeracy in health used nonprobabilistic samples of patients and students. Although informative about the numeracy skills of certain narrow groups, these studies do not allow for generalizations to any broader population. Consequently, they do not allow us to draw conclusions about the relationship between numeracy and demographic characteristics such as sex, age, and education.
Third, are objective measures of statistical numeracy equivalent to recently proposed subjective measures of this concept?26 In studies of convenience samples of patients and an Internet population, subjective measures were found to be less burdensome for the participants, at the same time approaching predictive validity of the objective measures of statistical numeracy.27 Subjective measures of numeracy, however, have not yet been administered to probabilistic, representative national samples that would enable researchers to study the relationship between objective and subjective numeracy in different demographic subgroups or to conduct cross-cultural comparisons.
To answer these questions, we conducted 2 studies on probabilistic national samples in the United States and Germany. This enables us to compare—for the first time, to our knowledge—statistical numeracy skills of adult populations in these countries and in different sociodemographic groups within the countries. In the first study, we investigated objective statistical numeracy skills in both countries. In the second study, we tested the applicability of a subjective numeracy scale as a proxy for objective skills.
The Ethics Committee of the Max Planck Institute for Human Development approved the methods used herein, and all participants consented to participation through an online consent form at the beginning of the survey.
Study 1 was conducted from July 10 through 24, 2008, on probabilistic national samples in the United States (n = 1009) and Germany (n = 1001) using panels of households selected through probabilistic telephone (random-digit dial) surveys and afterward supplied with equipment that enabled them to complete computerized questionnaires. Thus, existing Internet access or lack thereof did not affect households' ability to become panel members. The panels—built and maintained by the market research institute Forsa in Germany (http://www.forsa.de; 20 000 households [11% of those in the initial sample]) and the online research panel Knowledge Networks in the United States (http://www.knowledgenetworks.com; 43 000 households [16% of those in the initial sample])—allow for statistical inference to the general population. These panels were already used successfully in a number of studies in the areas of health, medicine, political and social sciences, economics, and public policy.28-32 Methodological studies have shown that data from such panels are comparable to the results obtained through traditional probabilistic surveys.33 The possibility of using computerized questionnaires enabled us to ask relatively complex questions involving numerical and visual information about medical treatments on a nationally representative sample.
Of the panel members who were invited to participate in the study, 52.0% in Germany and 53.8% in the United States completed the questionnaire. This is a very good response rate for this survey mode.34 The sample structure is shown in Table 1. According to official statistics, the percentage of population with less education is much higher in Germany than in the United States, so we oversampled the less-educated population in the United States to ensure equivalent sample sizes of less-educated participants in both countries. This was important because the study was conducted within a project that focused specifically on less-educated people. To adjust for this and for minor discrepancies due to nonresponse, we used design (in the United States) and poststratification (in both countries) survey weights to bring the sample proportions in line with the population proportions. The goal of such weighting adjustments is to correct for known differences between sample and population in the hope of providing unbiased survey estimates.35,36 Standard errors in all analyses were estimated using the Taylor series linearization method for estimating population characteristics from complex sample survey data, by means of commercially available software (SPSS Complex Samples procedures, SPSS version 17.0.1 [SPSS, Inc, Chicago, Illinois] and SUDAAN [RTI International, Research Triangle Park, North Carolina]).37
Statistical numeracy was measured on a scale including 3 items developed by Schwartz et al3 and 6 items developed by Lipkus et al,1 for a maximum score of 9 (Table 2). The questions were translated into German by a native German speaker with excellent knowledge of English, back-translated into English by another person with equivalent language skills, and compared with the original English version. Any inconsistencies were resolved by a native German speaker and an excellent English speaker familiar with the research objectives. Finally, the English and German versions were compared and edited by a bilingual German and English speaker. When programming the questionnaire, special care was taken to ensure that the interface looked the same in the German and English versions. In sum, we believe that the materials in English and German were comparable.
To investigate the equivalence of objective1,3 and subjective26 measures of numeracy skills in general populations of Germany and the United States, we conducted a study with a subset of the sample of participants in study 1.
Study 1 participants were ordered by their objective numeracy scores, and those with the highest and lowest scores were invited to participate in study 2, conducted 3 weeks after study 1 (August 1-15, 2008), resulting in a sample of 498. Basic demographic characteristics of the sample are given in Table 3. This sample enables us to compare low- and high-numeracy groups within each country as well as each of those groups between countries.
In Germany, 83.1% of all participants in study 1 completed study 2, and in the United States, 65.8%. The response rates among high- and low-numeracy participants were similar in both countries (eg, it was not the case that the low-numeracy group had lower response rates). The low- and high-numeracy groups in Germany represent, respectively, approximately the bottom and top thirds of the population sorted by numeracy scores. Because of lower response rates in the United States, the low- and high-numeracy US groups represent, respectively, approximately the bottom and top 40% of the population. Nevertheless, the average numeracy scores in both groups were still somewhat lower in the United States (Table 3).
Subjective numeracy was measured with 7 of the 8 items developed by Fagerlin et al.26,27 The items were answered on a 6-point scale, where higher values indicate higher perceived numeracy. We excluded the item “How good are you at calculating a 15% tip?” because it is culturally specific to the United States. Table 4 lists all of the items used. The questionnaire was developed in the same way as that for study 1. Half of the participants were randomly assigned to complete these items before a set of questions involving relatively demanding numerical calculations of risk reductions, and the remaining half completed the items after answering the questions (for more details on these questions, see Garcia-Retamero and Galesic23).
In the first study, conducted on probabilistic national samples in the United States and Germany, we investigated whether there are differences between the 2 countries in the level of statistical numeracy and sought to determine the relationship between statistical numeracy and demographic characteristics.
The statistical numeracy scale has satisfactory internal consistency; the Cronbach α was 0.73 in Germany and 0.80 in the United States. The percentage of correct answers to each of the items is presented in Table 2. For further analysis, we transformed the original scores ranging from 0 to 9 to a scale of 0% to 100%, indicating the percentage of the 9 items that were answered correctly.
As shown in Table 5, German participants had higher numeracy skills than did US participants. On average, 68.5% vs 64.5% of the items were answered correctly. This difference remained after controlling for differences in sex, age, education, and income between the 2 countries.
On the level of each country, sex, age, and education were all related to the numeracy score. In both countries, men had higher scores than did women. Numeracy skills dropped with age (r = −0.13 [95% confidence interval, −0.20 to −0.06] in Germany and −0.12 [−0.19 to −0.05] in the United States) and increased with education (0.28 [0.21-0.35] in Germany and 0.50 [0.44-0.56] in the United States) and income (0.20 [0.13-0.27] in Germany and 0.32 [0.25-0.39] in the United States). When we entered sex, age, education, and income together in a regression model, all 4 showed independent effects in Germany but, in the United States, only sex, education, and income explained differences in numeracy scores, whereas the effect of age was no longer present.
The inequality in numeracy skills was larger in the United States than in Germany, as reflected in the ratio between the scores in the 90th and 10th percentiles of the participants ordered by their scores. This ratio was 4.5 in the United States compared with 3.0 in Germany. The inequality was visible in particular in the average scores of people with low educational attainment vs highly educated people in the United States: 39.9% vs 83.1% correct compared with 62.3% vs 80.7% in Germany (Table 5).
In the second study, we investigated whether subjective measures of statistical numeracy correspond to objective measures. If a subjective numeracy scale can differentiate between people with objectively low and high numeracy skills across different demographic groups, this would speak to its wide applicability. In addition, we tested whether the subjective perceptions of one's numeracy are dependent on the context in which they are measured, namely, before or after answering several difficult numerical questions. If the scale is sensitive to context, this would limit its applicability because the results in clinical practice would depend on patients' recent experiences with quantitative information.
To compare the scores on the subjective numeracy scale with the objective numeracy data, we recoded each item—originally answered on a scale of 1 to 6—to be 0 when the answer was 3 or less or 1 when the answer was 4 or higher. Means and standard deviations of answers to each of the items are presented in Table 4. For further analyses, we summed the recoded answers to the 7 items and transformed the resulting scores to a scale of 0% to 100%, indicating the percentage of answers to the 7 items that reflected high subjective numeracy.
The subjective numeracy scale has satisfactory internal consistency. The Cronbach α ranged from 0.75 to 0.87 across the 2 countries and groups with high vs low objective numeracy skills. The scores on the scale were not sensitive to context. They were similar when the items were positioned before or after the tasks involving difficult calculations (average before/after difference, 2.8 [95% confidence interval, −5.4 to 11.0]); this was so for high- and low-numeracy groups in both countries.
How well does the subjective numeracy scale differentiate between participants who are very high vs very low in terms of their objective numeracy skills (as determined in study 1)? The mean (SE) subjective numeracy scores for these 2 extreme groups were 45.5 (3.7) and 80.0 (2.7) in Germany, and 38.9 (4.4) and 79.0 (2.5) in the United States. These differences were stable across sex, age, education, and income groups. However, compared with the differences in objective numeracy scores between the 2 extreme groups (mean [SE], 37.2 [2.0] vs 95.5 [0.7] in Germany and 35.6 [2.8] vs 90.9 [1.1] in the United States; Table 3), the differences in subjective numeracy scores were smaller.
An average citizen of Germany and the United States could answer only two-thirds of 9 relatively simple items testing basic statistical numeracy skills (Table 5). Statistical numeracy was somewhat lower for women than for men, and it dropped slightly with age but only in Germany. Across most demographic groups, German participants achieved somewhat higher scores than did US participants. An exception was the group with the highest education, in which US participants fared somewhat better. Differences in educational systems—in particular the stronger focus on mathematics and science education in Germany from an early age17,38—are likely to be the main factor underlying the differences in statistical numeracy between countries.
The inequality between people with more or less education in the United States was much larger than in Germany. Although a college-educated American could answer 83.1% of items correctly, those with less than a high school diploma could do so for only 39.9% of the items. Even for those who had a high school education, the average percentage of correct answers in the United States was only 56.4%, lower than the average for German participants who had not completed a high school education (62.3%; Table 5).
The large differences in numeracy between persons with lower and higher educational levels have varying consequences in different medical systems. For instance—at least before the new health care reform—less-educated US residents are particularly likely to be in a position to have to decide about their medical care. Although 99.7% of Germans have health insurance,39 35% of US residents—in particular those of lower socioeconomic status—had insufficient or no coverage40 and had to decide whether to pay for various treatments and screenings themselves.41 Given their low statistical numeracy, they might have had difficulty making good decisions.
The present article is, to the best of our knowledge, the first to describe a study investigating statistical numeracy skills in probabilistic national samples in the United States and Germany, allowing comparison of different demographic groups within each country as well as comparison between the 2 countries. It is also the first to describe a cross-cultural comparison of objective and subjective measures of statistical numeracy.
At the same time, a limitation of our studies is that levels of numeracy in the general population could be even lower than our results suggest. To become members of the national panels from which our samples were selected, participants had to accept having a computer or special TV set with Internet access installed in their homes. It is possible that people with low numeracy refused this more often than did those with high numeracy skills. On the other hand, our sample represents accurately the overall population in terms of education. Furthermore, there is no particular reason to expect that numeracy but not general educational level would be related to higher rates of refusal.
This study has clear implications for medical practice. Physicians should not assume that all patients can understand simple statistical indicators that are often used to express risks and benefits of medical screenings and treatments. For example, approximately 20% of the German and US participants could not say which of the following numbers represents the biggest risk of getting a disease: 1%, 5%, or 10%. Ratios were even more difficult; almost 30% could not answer whether 1 in 10, 1 in 100, or 1 in 1000 represents the largest risk. Similarly, almost 30% of the study participants in both countries could not state what percentage 20 of 100 is, and most (53.7% of German and 76.5% of US participants) could not transform 1 of 1000 to a percentage. Furthermore, many participants lacked the understanding of the concept of random draw. When asked how many times a fair coin would come up heads in 1000 flips, more than one-fourth of the study participants in both countries gave answers that were obviously incorrect (<400 or >600 times).
Given the low levels of statistical numeracy of many patients, physicians could use items from the subjective numeracy scale to identify patients who may have problems understanding numerical information. If they have such a patient, physicians could communicate risks and benefits of treatments by means of formats that do not require high levels of numeracy, such as visual displays22,23,42,43 and analogies,24,25 rather than numerical expressions. In this way, patients with low numeracy skills could understand statistical information and make better decisions about their health.
Correspondence: Mirta Galesic, PhD, Center for Adaptive Behavior and Cognition, Max Planck Institute for Human Development, Lentzeallee 94, 14195 Berlin, Germany (galesic@mpib-berlin.mpg.de).
Accepted for Publication: September 14, 2009.
Author Contributions: Both authors had full access to all the data (including statistical reports and tables) in the study and take responsibility for the integrity of the data and the accuracy of the data analysis. Study concept and design: Galesic and Garcia-Retamero. Acquisition of data: Galesic and Garcia-Retamero. Analysis and interpretation of data: Galesic and Garcia-Retamero. Drafting of the manuscript: Galesic and Garcia-Retamero. Critical revision of the manuscript for important intellectual content: Galesic and Garcia-Retamero. Statistical analysis: Galesic and Garcia-Retamero. Obtained funding: Galesic and Garcia-Retamero. Administrative, technical, and material support: Galesic. Study supervision: Galesic.
Financial Disclosure: None reported.
Funding/Support: This study was supported by the Foundation for Informed Medical Decision Making (United States) and the Max Planck Society (project “Helping People With Low Numeracy to Understand Medical Information”) and grant PSI2008-02019 from the Ministerio de Educación y Ciencia (project “How to Improve Understanding of Risks About Health”).
Role of the Sponsors: The funding sources did not affect the study design, data collection, analysis and interpretation of the data, writing of the report, or the decision to submit the report for publication.
1.Lipkus
IMSamsa
GRimer
BK General performance on a numeracy scale among highly educated samples.
Med Decis Making 2001;21
(1)
37- 44
PubMedGoogle ScholarCrossref 2.Davids
SLSchapira
MMMcAuliffe
TLNattinger
AB Predictors of pessimistic breast cancer risk perceptions in a primary care population.
J Gen Intern Med 2004;19
(4)
310- 315
PubMedGoogle ScholarCrossref 3.Schwartz
LMWoloshin
SBlack
WCWelch
HG The role of numeracy in understanding the benefit of screening mammography.
Ann Intern Med 1997;127
(11)
966- 972
PubMedGoogle ScholarCrossref 4.Woloshin
SSchwartz
LMMoncur
MGabriel
STosteson
ANA Assessing values for health: numeracy matters.
Med Decis Making 2001;21
(5)
382- 390
PubMedGoogle ScholarCrossref 5.Golbeck
ALAhlers-Schmidt
CRPaschal
AMDismuke
SE A definition and operational framework for health numeracy.
Am J Prev Med 2005;29
(4)
375- 376
PubMedGoogle ScholarCrossref 7.Estrada
CAMartin-Hryniewicz
MPeek
BTCollins
CByrd
JC Literacy and numeracy skills and anticoagulation control.
Am J Med Sci 2004;328
(2)
88- 93
PubMedGoogle ScholarCrossref 8.Reyna
VFBrainerd
CJ The importance of mathematics in health and human judgment: numeracy, risk communication, and medical decision making.
Learn Individ Differ 2007;17
(2)
147- 15910.1016/j.lindif.2007.03.010
Google ScholarCrossref 9.Nelson
WReyna
VFFagerlin
ALipkus
IPeters
E Clinical implications of numeracy: theory and practice.
Ann Behav Med 2008;35
(3)
261- 274
PubMedGoogle ScholarCrossref 11.Rothman
RLHousam
RWeiss
H
et al. Patient understanding of food labels: the role of literacy and numeracy.
Am J Prev Med 2006;31
(5)
391- 398
PubMedGoogle ScholarCrossref 12.Cokely
ETKelley
CM Cognitive abilities and superior decision making under risk: a protocol analysis and process model evaluation.
Judgm Decis Mak 2009;4
(1)
20- 33
Google Scholar 13.Programme for International Student Assessment (PISA), The PISA 2003 Assessment Framework: Mathematics, Reading, Science, and Problem Solving Knowledge and Skills. Paris, France Organisation for Economic Co-operation and Development2003;
14.Gonzales
PGuzmán
JCPartelow
L
et al. Highlights From the Trends in International Mathematics and Science Study (TIMSS) 2003. Washington, DC US Dept of Education, National Center for Education Statistics2004;
15.Kutner
MGreenberg
EJin
YPaulsen
C The Health Literacy of America's Adults: Results From the 2003 National Assessment of Adult Literacy (NALS). Washington, DC US Dept of Education, National Center for Education Statistics2006;
16.Tuijnman
A Benchmarking Adult Literacy in America: An International Comparative Study: International Adult Literacy Survey (IALS). Washington, DC Dept of Education, Office of Vocational and Adult Education2000;
17.Rinderman
H The g-factor of international cognitive ability comparisons: the homogeneity of results in PISA, TIMSS, PIRLS and IQ-tests across nations.
Eur J Pers 2007;21
(5)
667- 70610.1002/per.634
Google ScholarCrossref 20.Frosch
DLKaplan
RM Shared decision making in clinical medicine: past research and future directions.
Am J Prev Med 1999;17
(4)
285- 294
PubMedGoogle ScholarCrossref 22.Galesic
MGarcia-Retamero
RGigerenzer
G Using icon arrays to communicate medical risks: overcoming low numeracy.
Health Psychol 2009;28
(2)
210- 216
PubMedGoogle ScholarCrossref 23.Garcia-Retamero
RGalesic
M Communicating treatment risk reduction to people with low numeracy skills: a cross-cultural comparison.
Am J Public Health 2009;99
(12)
2196- 2202
PubMed10.2105/AJPH.2009.160234
Google ScholarCrossref 25.Galesic
MBarton
AWegwarth
OGaissmaier
WGigerenzer
G Do analogies help elderly people understand medical information [abstract]?
Int J Psychol 2008;43
(3-4)
32
Google ScholarCrossref 26.Fagerlin
AZikmund-Fisher
BJUbel
PAJankovic
ADerry
HASmith
DM Measuring numeracy without a math test: development of the Subjective Numeracy Scale.
Med Decis Making 2007;27
(5)
672- 680
PubMedGoogle ScholarCrossref 27.Zikmund-Fisher
BJSmith
DMUbel
PAFagerlin
A Validation of the Subjective Numeracy Scale: effects of low numeracy on comprehension of risk communications and utility elicitations.
Med Decis Making 2007;27
(5)
663- 671
PubMedGoogle ScholarCrossref 28.Baker
LWagner
THSinger
SBundorf
MK Use of the Internet and e-mail for health care information: results from a national survey [published correction appears in
JAMA. 2003;290(3):334].
JAMA 2003;289
(18)
2400- 2406
PubMedGoogle ScholarCrossref 29.Jacoby
WG Value choices and American public opinion.
Am J Pol Sci 2006;50
(3)
706- 72310.1111/j.1540-5907.2006.00211.x
Google ScholarCrossref 30.Lerner
JSGonzalez
RMSmall
DAFischhoff
B Effects of fear and anger on perceived risks of terrorism: a national field experiment.
Psychol Sci 2003;14
(2)
144- 150
PubMedGoogle ScholarCrossref 31.Miller
JDScott
ECOkamoto
S Science communication: public acceptance of evolution [published correction appears in
Science. 2006;313(5794):1739].
Science 2006;313
(5788)
765- 766
PubMedGoogle ScholarCrossref 32.Schlenger
WECaddell
JMEbert
L
et al. Psychological reactions to terrorist attacks: findings from the National Study of Americans' Reactions to September 11.
JAMA 2002;288
(5)
581- 588
PubMedGoogle ScholarCrossref 33.Krosnick
JNie
NRivers
D Web survey methodologies: a comparison of survey accuracy. Paper presented at: Annual Conference of the American Association for Public Opinion Research; May 14, 2005; Miami Beach, FL.Accessed October 3, 2008
34.Vehovar
VBatagelj
ZLozar Manfreda
KZaletel
M Nonresponse in Web surveys. Groves
RMDillman
DAEltinge
JLLittle
RJA
Survey Nonresponse. New York, NY John Wiley & Sons Inc2002;229- 242
Google Scholar 35.Bethlehem
JG Weighting nonresponse adjustments based on auxiliary information. Groves
RMDillman
DAEltinge
JLLittle
RJA
Survey Nonresponse. New York, NY John Wiley & Sons Inc2002;275- 288
Google Scholar 36.Gelman
ACarlin
JB Poststratification and weighting adjustments. Groves
RMDillman
DAEltinge
JLLittle
RJA
Survey Nonresponse. New York, NY John Wiley & Sons Inc2002;289- 302
Google Scholar 37.Siller
ABTompkins
L The big four: analyzing complex sample survey data using SAS, SPSS, STATA, and SUDAAN. Paper presented at: Thirty-First SAS Users Group International conference (SUGI); March 27, 2006; San Francisco, CA.
http://www2.sas.com/proceedings/sugi31/172-31.pdf. Accessed October 30, 2008
38.Stigler
JWGonzales
PKawanaka
TKnoll
SSerrano
A The TIMSS Videotape Classroom Study: Methods and Findings From an Exploratory Research Project on Eighth-Grade Mathematics Instruction in Germany, Japan, and the United States. Washington, DC US Dept of Education, National Center for Education Statistics1999;
40.Schoen
CDoty
MMCollins
SRHolmgren
AL Insured but not protected: how many adults are underinsured?
Health Aff (Millwood) January-June2005;
((suppl Web exclusives))
W5289- W5- 302
PubMed10.1377/hlthaff.W5.289
Google Scholar 41.Schoen
COsborn
RDoty
MMBishop
MPeugh
JMurukutla
N Toward higher-performance health systems: adults' health care experiences in seven countries, 2007.
Health Aff (Millwood) 2007;26
(6)
w717- w734
PubMed10.1377/hlthaff.26.6.w717
Google ScholarCrossref 42.Lipkus
IM Numeric, verbal, and visual formats of conveying health risks: suggested best practices and future recommendations.
Med Decis Making 2007;27
(5)
696- 713
PubMedGoogle ScholarCrossref