A glossary for big data in population and public health: discussion and commentary on terminology and research methods.

Abstract:

:The volume and velocity of data are growing rapidly and big data analytics are being applied to these data in many fields. Population and public health researchers may be unfamiliar with the terminology and statistical methods used in big data. This creates a barrier to the application of big data analytics. The purpose of this glossary is to define terms used in big data and big data analytics and to contextualise these terms. We define the five Vs of big data and provide definitions and distinctions for data mining, machine learning and deep learning, among other terms. We provide key distinctions between big data and statistical analysis methods applied to big data. We contextualise the glossary by providing examples where big data analysis methods have been applied to population and public health research problems and provide brief guidance on how to learn big data analysis methods.

authors

Fuller D,Buote R,Stanley K

doi

10.1136/jech-2017-209608

subject

Has Abstract

pub_date

2017-11-01 00:00:00

pages

1113-1117

issue

11

eissn

0143-005X

issn

1470-2738

pii

jech-2017-209608

journal_volume

71

pub_type

杂志文章
  • What makes an ad a cigarette ad? Commercial tobacco imagery in the lesbian, gay, and bisexual press.

    abstract:OBJECTIVES:To determine the extent of commercial tobacco imagery in the lesbian, gay, and bisexual (LGB) press. METHODS:Content analysis of all advertising containing tobacco related text or imagery in 20 LGB community periodicals, published between January 1990 and December 2000. RESULTS:3428 ads were found: 689 tob...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech.2005.038760

    authors: Smith EA,Offen N,Malone RE

    更新日期:2005-12-01 00:00:00

  • Association of age and social class with suicide among men in Great Britain.

    abstract:STUDY OBJECTIVE:The aim was to investigate suicide and "undetermined" deaths by age, economic activity status, and social class in Great Britain among males of working age. DESIGN:The study was a cross sectional analysis of Registrar General's data for England and Wales around 1981, repeated for around 1971, and for S...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech.45.3.195

    authors: Kreitman N,Carstairs V,Duffy J

    更新日期:1991-09-01 00:00:00

  • How well can poor child health and development be predicted by data collected in early childhood?

    abstract:BACKGROUND:Identifying children at risk of poor developmental outcomes remains a challenge, but is important for better targeting children who may benefit from additional support. We explored whether data routinely collected in early life predict which children will have language disability, overweight/obesity or behav...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech-2018-211028

    authors: Straatmann VS,Pearce A,Hope S,Barr B,Whitehead M,Law C,Taylor-Robinson D

    更新日期:2018-12-01 00:00:00

  • Economic determinants of diet in older adults: systematic review.

    abstract:BACKGROUND AND AIMS:Many economic factors are associated with diet, yet the evidence is generally cross-sectional. Older people are considered especially vulnerable to poor diets from negative changes to varied economic factors. This review extends current knowledge on known correlates to decipher actual economic deter...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章,评审

    doi:10.1136/jech-2013-202513

    authors: Conklin AI,Maguire ER,Monsivais P

    更新日期:2013-09-01 00:00:00

  • The changing contribution of smoking to educational differences in life expectancy: indirect estimates for Finnish men and women from 1971 to 2010.

    abstract:BACKGROUND:We estimated the contribution of smoking to educational differences in mortality and life expectancy between 1971 and 2010 in Finland. METHODS:Eight prospective datasets with baseline in 1970, 1975, 1980, 1985, 1990, 1995, 2000 and 2005 and each linked to a 5-year mortality follow-up were used. We calculate...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech-2012-201266

    authors: Martikainen P,Ho JY,Preston S,Elo IT

    更新日期:2013-03-01 00:00:00

  • Coding the Everyday Discrimination Scale: implications for exposure assessment and associations with hypertension and depression among a cross section of mid-life African American women.

    abstract:BACKGROUND:Studies suggest that racial discrimination impacts health via biological dysregulation due to continual adaptation to chronic psychosocial stress. Therefore, quantifying chronicity is critical for operationalising the relevant aetiological exposure and hence maximising internal validity. Using one of the mos...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech-2018-211230

    authors: Michaels E,Thomas M,Reeves A,Price M,Hasson R,Chae D,Allen A

    更新日期:2019-06-01 00:00:00

  • Child sexual abuse and links to HIV and orphanhood in urban Zimbabwe.

    abstract:BACKGROUND:Evidence of a link between sexual violence and HIV is growing; however, studies among children are scarce. The authors sought to characterise child sexual abuse in Harare, Zimbabwe, and explore its links with HIV and orphanhood. METHODS:Records for new clients attending a child sexual abuse clinic from July...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech.2009.094359

    authors: Birdthistle IJ,Floyd S,Mwanasa S,Nyagadza A,Gwiza E,Glynn JR

    更新日期:2011-12-01 00:00:00

  • Impact of chronic pain on health care seeking, self care, and medication. Results from a population-based Swedish study.

    abstract:STUDY OBJECTIVE:To explore individual and social factors that could predict health care utilisation and medication among people with chronic pain in an unselected population. DESIGN:A mailed survey with questions about pain and mental symptoms, disability, self care action, visits to health care providers, and medicat...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech.53.8.503

    authors: Andersson HI,Ejlertsson G,Leden I,Scherstén B

    更新日期:1999-08-01 00:00:00

  • Serum 25-hydroxyvitamin D3 and the risk of pneumonia in an ageing general population.

    abstract:BACKGROUND:Vitamin D has been suggested to have a role in infection defence and on the immune system. We therefore investigated the effect of serum 25-hydroxyvitamin D₃ (25(OH)D₃) on the risk of incident hospitalised pneumonia in an ageing general population in eastern Finland. METHODS:The study population included 72...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech-2012-202027

    authors: Aregbesola A,Voutilainen S,Nurmi T,Virtanen JK,Ronkainen K,Tuomainen TP

    更新日期:2013-06-01 00:00:00

  • Are fetal growth impairment and preterm birth causally related to child attention problems and ADHD? Evidence from a comparison between high-income and middle-income cohorts.

    abstract:BACKGROUND:Cross-cohort comparison is an established method for improving causal inference. This study compared 2 cohorts, 1 from a high-income country and another from a middle-income country, to (1) establish whether birth exposures may play a causal role in the development of childhood attention problems; and (2) id...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech-2015-206222

    authors: Murray E,Pearson R,Fernandes M,Santos IS,Barros FC,Victora CG,Stein A,Matijasevich A

    更新日期:2016-07-01 00:00:00

  • Long-term cardiovascular consequences of Rose angina at age 20-54 years: 29-years' follow-up of the Tromsø Study.

    abstract:BACKGROUND:The Rose Angina Questionnaire (RAQ) was constructed in the 1960s for assessing the population burden of angina. Studies have found that screening positivity by RAQ conferred an elevated risk of coronary heart disease (CHD). It is, however, not clear to what extent Rose angina represents early CHD in relative...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech-2013-203642

    authors: Graff-Iversen S,Wilsgaard T,Mathiesen EB,Njølstad I,Løchen ML

    更新日期:2014-08-01 00:00:00

  • How complete and accurate are cancer registrations notified by the National Health Service Central Register for England and Wales?

    abstract:STUDY OBJECTIVE:To assess the completeness and accuracy of notification of cancers by the National Health Service Central Register (NHSCR) for England and Wales. DESIGN:Comparison of 720 cancer registrations ascertained from NHSCR up to May 1999 with those ascertained for the same cohort from six other sources and a p...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech.55.6.414

    authors: Dickinson HO,Salotti JA,Birch PJ,Reid MM,Malcolm A,Parker L

    更新日期:2001-06-01 00:00:00

  • The real ecological fallacy: epidemiology and global climate change.

    abstract::Prompted by my participation in the People's Climate March held in New York City on 21 September 2014, as part of the 'Harvard Divest' contingent, in this brief essay I reflect on the late 20th century development of--and debates over--the necessity of ecological thinking in epidemiology, and also the still limited en...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech-2014-205027

    authors: Krieger N

    更新日期:2015-08-01 00:00:00

  • Home ownership and mortality: a register-based follow-up study of 300,000 Finns.

    abstract:BACKGROUND:This study examined whether living in rented housing is associated with increased all-cause and cause-specific mortality, and whether the association between home ownership and mortality can be explained by household income, occupational class, and educational level. METHODS:A random sample including every ...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech.2007.061309

    authors: Laaksonen M,Martikainen P,Nihtilä E,Rahkonen O,Lahelma E

    更新日期:2008-04-01 00:00:00

  • Randomised controlled trial of anti-smoking advice in pregnancy. 1977.

    abstract::In a randomised controlled trial intensive individual anti-smoking advice given in parallel with hospital antenatal care did not influence the outcome of pregnancy. The belief that retardation of fetal growth caused by maternal smoking occurs in late pregnancy is not well based, and the advice may not have been given ...

    journal_title:Journal of epidemiology and community health

    pub_type: 传,古典文章,历史文章,杂志文章

    doi:10.1136/jech.50.3.232

    authors: Donovan JW

    更新日期:1996-06-01 00:00:00

  • A mathematical model of a heroin epidemic: implications for control policies.

    abstract::An exponential model based on the infectious disease model of Kermack and McKendrick has been simplified to illustrate how the use of heroin spreads in epidemic fashion. A numerical simulation is arranged to show how the dynamics of spread are influenced by the original number of users, rates of conversion, and time o...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech.33.4.299

    authors: Mackintosh DR,Stewart GT

    更新日期:1979-12-01 00:00:00

  • Impact of the demerit point system on road traffic accident mortality in Spain.

    abstract:BACKGROUND:To assess the effect of the Demerit Point System (DPS), introduced in Spain on 1 July 2006, on the number of fatalities due to road traffic accidents, using a methodology that controls for the seasonal variation and trend in the data series. METHODS:Time-series analysis by ARIMA models of 29 113 fatalities ...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech.2008.082461

    authors: Pulido J,Lardelli P,de la Fuente L,Flores VM,Vallejo F,Regidor E

    更新日期:2010-03-01 00:00:00

  • Impairments, disabilities and needs assessment among non-fatal war injuries in south Lebanon, Grapes of Wrath, 1996.

    abstract:STUDY OBJECTIVE:To examine the impact of non-fatal war related injuries on physical disability in a group of war wounded civilians and to assess their needs. DESIGN:Cross sectional study. Home interviews were conducted using a structured interview schedule around one month after the injury, to assess impairments, disa...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech.54.1.35

    authors: Mehio Sibai A,Sameer Shaar N,el Yassir S

    更新日期:2000-01-01 00:00:00

  • Twinning rates in Tamilnadu.

    abstract::A prospective study of human reproduction was conducted in Tamilnadu State, South India, from 1969 to 1975. This paper reports twinning rates and relates these to maternal age, parity, and consanguinity. Birth weights and other dimensions at birth and infant mortality are also studied. The overall twinning rate was 1 ...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech.37.2.117

    authors: Rao PS,Inbaraj SG,Muthurathnam S

    更新日期:1983-06-01 00:00:00

  • Alcohol misuse in older people: heavy consumption and protean presentations.

    abstract:BACKGROUND:Alcohol misuse, especially binge drinking in young people, and alcoholic liver disease are major public health concerns. However, alcohol misuse in older people is underestimated and often goes undetected. OBJECTIVE:To document alcohol consumption and clinical presentation of alcohol misuse in hospital inpa...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech.2005.043653

    authors: Mehta MM,Moriarty KJ,Proctor D,Bird M,Darling W

    更新日期:2006-12-01 00:00:00

  • An investigation of risk factors for symptomatic osteoarthritis of the knee in women using a life course approach.

    abstract:STUDY OBJECTIVE:To explore risk factors for symptomatic knee osteoarthritis (OAK) in women, which included wearing high heeled shoes. DESIGN:Matched case-control study. Exposure information obtained by interview, included details about past footwear. Self reported weight and height data obtained representing when wome...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech.57.10.823

    authors: Dawson J,Juszczak E,Thorogood M,Marks SA,Dodd C,Fitzpatrick R

    更新日期:2003-10-01 00:00:00

  • Climate change effects on human health: projections of temperature-related mortality for the UK during the 2020s, 2050s and 2080s.

    abstract:BACKGROUND:The most direct way in which climate change is expected to affect public health relates to changes in mortality rates associated with exposure to ambient temperature. Many countries worldwide experience annual heat-related and cold-related deaths associated with current weather patterns. Future changes in cl...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech-2013-202449

    authors: Hajat S,Vardoulakis S,Heaviside C,Eggen B

    更新日期:2014-07-01 00:00:00

  • Green space, urbanity, and health: how strong is the relation?

    abstract:STUDY OBJECTIVES:To investigate the strength of the relation between the amount of green space in people's living environment and their perceived general health. This relation is analysed for different age and socioeconomic groups. Furthermore, it is analysed separately for urban and more rural areas, because the stren...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech.2005.043125

    authors: Maas J,Verheij RA,Groenewegen PP,de Vries S,Spreeuwenberg P

    更新日期:2006-07-01 00:00:00

  • A two-county comparison of the HOUSES index on predicting self-rated health.

    abstract:BACKGROUND:Mortality, incidence of most diseases, and prevalence of adverse health behaviours follow an inverse gradient with social class. Many proxies for socioeconomic status (SES) exist; however, each bears a different relation to health outcomes, probably following a different aetiological pathway. Additionally, d...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech.2008.084723

    authors: Butterfield MC,Williams AR,Beebe T,Finnie D,Liu H,Liesinger J,Sloan J,Wheeler PH,Yawn B,Juhn YJ

    更新日期:2011-03-01 00:00:00

  • Ethnic differences in perinatal mortality--a challenge.

    abstract::The perinatal mortality rates of mothers who delivered at St. Thomas's Hospital from 1969 to 1976 have been examined. The rate in the West Indian population was significant higher than in the United Kingdom white population. The increased West Indian mortality was confined to infants with a birth weight of more than 2...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech.36.1.22

    authors: Robinson MJ,Palmer SR,Avery A,James CE,Beynon JL,Taylor RW

    更新日期:1982-03-01 00:00:00

  • Unfairness and health: evidence from the Whitehall II Study.

    abstract:OBJECTIVE:To examine the effects of unfairness on incident coronary events and health functioning. DESIGN:Prospective cohort study. Unfairness, sociodemographics, established coronary risk factors (high serum cholesterol, hypertension, obesity, exercise, smoking and alcohol consumption) and other psychosocial work cha...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech.2006.052563

    authors: De Vogli R,Ferrie JE,Chandola T,Kivimäki M,Marmot MG

    更新日期:2007-06-01 00:00:00

  • A simpler tool for estimation of HIV incidence from cross-sectional, age-specific prevalence data.

    abstract:BACKGROUND:HIV incidence estimates are crucial in understanding and predicting the HIV/AIDS epidemic and identifying sub-populations and regions most at risk for the epidemic. However, incidence estimation is a challenge due to the nature of the disease and type of data available. This paper aims to present a simple an...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech.2009.091959

    authors: Rajan SS,Sokal D

    更新日期:2011-02-01 00:00:00

  • Child supervision practices for drowning prevention in rural Bangladesh: a pilot study of supervision tools.

    abstract:BACKGROUND:Injuries are an increasing child health concern and have become a leading cause of child mortality in the 1-4 years age group in many developing countries, including Bangladesh. METHODS:Household observations during 9 months of a community-based pilot of two supervision tools-a door barrier and a playpen-de...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech.2008.080903

    authors: Callaghan JA,Hyder AA,Khan R,Blum LS,Arifeen S,Baqui AH

    更新日期:2010-07-01 00:00:00

  • Comparison of food constituents in the diet of female agricultural workers in Japan with high and low concentrations of high density lipoprotein in their sera.

    abstract::Over 300 female farmers from 18 regions in various parts of Japan were examined for high density lipoprotein cholesterol (HDL) in the serum. Based on the HDL levels, three examinees with the highest HDL and another three with the lowest HDL were selected from each region to form the high HDL group (high group, 54 subj...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech.39.3.259

    authors: Chiba K,Miyasaka M,Koizumi A,Kumai M,Watanabe T,Ikeda M

    更新日期:1985-09-01 00:00:00

  • A comparison of coronary heart disease event rates among urban Australian Aboriginal people and a matched non-Aboriginal population.

    abstract:BACKGROUND:Age-specific death from cardiovascular disease among Australian Aboriginals is estimated to be four to seven times that of general population, and the major cause of premature death. There is little reliable information on the incidence of coronary heart disease (CHD). This study compares CHD event rates in ...

    journal_title:Journal of epidemiology and community health

    pub_type: 杂志文章

    doi:10.1136/jech.2009.098343

    authors: Bradshaw PJ,Alfonso HS,Finn J,Owen J,Thompson PL

    更新日期:2011-04-01 00:00:00