Creating a surrogate commuter network from Australian Bureau of Statistics census data.

Abstract:

:Between the 2011 and 2016 national censuses, the Australian Bureau of Statistics changed its anonymity policy compliance system for the distribution of census data. The new method has resulted in dramatic inconsistencies when comparing low-resolution data to aggregated high-resolution data. Hence, aggregated totals do not match true totals, and the mismatch gets worse as the data resolution gets finer. Here, we address several aspects of this inconsistency with respect to the 2016 usual-residence to place-of-work travel data. We introduce a re-sampling system that rectifies many of the artifacts introduced by the new ABS protocol, ensuring a higher level of consistency across partition sizes. We offer a surrogate high-resolution 2016 commuter dataset that reduces the difference between the aggregated and true commuter totals from ~34% to only ~7%, which is on the order of the discrepancy across partition resolutions in data from earlier years.

journal_name

Sci Data

journal_title

Scientific data

authors

Fair KM,Zachreson C,Prokopenko M

doi

10.1038/s41597-019-0137-z

subject

Has Abstract

pub_date

2019-08-16 00:00:00

pages

150

issue

1

issn

2052-4463

pii

10.1038/s41597-019-0137-z

journal_volume

6

pub_type

杂志文章
  • Draft genome of the big-headed turtle Platysternon megacephalum.

    abstract::The big-headed turtle, Platysternon megacephalum, as the sole member of the monotypic family Platysternidae, has a number of distinct characteristics including an extra-large head, long tail, flat carapace, and a preference for low water temperature environments. We performed whole genome sequencing, assembly, and gen...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0067-9

    authors: Cao D,Wang M,Ge Y,Gong S

    更新日期:2019-05-16 00:00:00

  • Small-wedge synchrotron and serial XFEL datasets for Cysteinyl leukotriene GPCRs.

    abstract::Structural studies of challenging targets such as G protein-coupled receptors (GPCRs) have accelerated during the last several years due to the development of new approaches, including small-wedge and serial crystallography. Here, we describe the deposition of seven datasets consisting of X-ray diffraction images acqu...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00729-2

    authors: Marin E,Luginina A,Gusach A,Kovalev K,Bukhdruker S,Khorn P,Polovinkin V,Lyapina E,Rogachev A,Gordeliy V,Mishin A,Cherezov V,Borshchevskiy V

    更新日期:2020-11-12 00:00:00

  • Tracking vegetation phenology across diverse biomes using Version 2.0 of the PhenoCam Dataset.

    abstract::Monitoring vegetation phenology is critical for quantifying climate change impacts on ecosystems. We present an extensive dataset of 1783 site-years of phenological data derived from PhenoCam network imagery from 393 digital cameras, situated from tropics to tundra across a wide range of plant functional types, biomes...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0229-9

    authors: Seyednasrollah B,Young AM,Hufkens K,Milliman T,Friedl MA,Frolking S,Richardson AD

    更新日期:2019-10-22 00:00:00

  • The Coral Trait Database, a curated database of trait information for coral species from the global oceans.

    abstract::Trait-based approaches advance ecological and evolutionary research because traits provide a strong link to an organism's function and fitness. Trait-based research might lead to a deeper understanding of the functions of, and services provided by, ecosystems, thereby improving management, which is vital in the curren...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2016.17

    authors: Madin JS,Anderson KD,Andreasen MH,Bridge TC,Cairns SD,Connolly SR,Darling ES,Diaz M,Falster DS,Franklin EC,Gates RD,Harmer A,Hoogenboom MO,Huang D,Keith SA,Kosnik MA,Kuo CY,Lough JM,Lovelock CE,Luiz O,Martinelli J

    更新日期:2016-03-29 00:00:00

  • The systematic identification of cytoskeletal genes required for Drosophila melanogaster muscle maintenance.

    abstract::Animal muscles must maintain their function and structure while bearing substantial mechanical loads. How muscles withstand persistent mechanical strain is presently not well understood. Understanding the mechanisms by which tissues maintain their complex architecture is a key goal of cell biology. This dataset repres...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2014.2

    authors: Perkins AD,Lee MJ,Tanentzapf G

    更新日期:2014-03-11 00:00:00

  • dbPSP 2.0, an updated database of protein phosphorylation sites in prokaryotes.

    abstract::In prokaryotes, protein phosphorylation plays a critical role in regulating a broad spectrum of biological processes and occurs mainly on various amino acids, including serine (S), threonine (T), tyrosine (Y), arginine (R), aspartic acid (D), histidine (H) and cysteine (C) residues of protein substrates. Through liter...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-0506-7

    authors: Shi Y,Zhang Y,Lin S,Wang C,Zhou J,Peng D,Xue Y

    更新日期:2020-05-29 00:00:00

  • Long-term observation of amphibian populations inhabiting urban and forested areas in Yekaterinburg, Russia.

    abstract::This article presents data derived from a 36 year-long uninterrupted observational study of amphibian populations living in the city and vicinity of Yekaterinburg, Russia. This area is inhabited by six amphibian species. Based on a degree of anthropogenic transformation, the urban territory is divided into five highly...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2015.18

    authors: Vershinin VL,Vershinina SD,Berzin DL,Zmeeva DV,Kinev AV

    更新日期:2015-05-12 00:00:00

  • The Centennial Trends Greater Horn of Africa precipitation dataset.

    abstract::East Africa is a drought prone, food and water insecure region with a highly variable climate. This complexity makes rainfall estimation challenging, and this challenge is compounded by low rain gauge densities and inhomogeneous monitoring networks. The dearth of observations is particularly problematic over the past ...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2015.50

    authors: Funk C,Nicholson SE,Landsfeld M,Klotter D,Peterson P,Harrison L

    更新日期:2015-09-29 00:00:00

  • Two-colour serial femtosecond crystallography dataset from gadoteridol-derivatized lysozyme for MAD phasing.

    abstract::We provide a detailed description of a gadoteridol-derivatized lysozyme (gadolinium lysozyme) two-colour serial femtosecond crystallography (SFX) dataset for multiple wavelength anomalous dispersion (MAD) structure determination. The data was collected at the Spring-8 Angstrom Compact free-electron LAser (SACLA) facil...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2017.188

    authors: Gorel A,Motomura K,Fukuzawa H,Doak RB,Grünbein ML,Hilpert M,Inoue I,Kloos M,Nass Kovács G,Nango E,Nass K,Roome CM,Shoeman RL,Tanaka R,Tono K,Foucar L,Joti Y,Yabashi M,Iwata S,Ueda K,Barends TRM,Schlichting I

    更新日期:2017-12-12 00:00:00

  • MatchingLand, geospatial data testbed for the assessment of matching methods.

    abstract::This article presents datasets prepared with the aim of helping the evaluation of geospatial matching methods for vector data. These datasets were built up from mapping data produced by official Spanish mapping agencies. The testbed supplied encompasses the three geometry types: point, line and area. Initial datasets ...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2017.180

    authors: Xavier EMA,Ariza-López FJ,Ureña-Cámara MA

    更新日期:2017-12-05 00:00:00

  • One millennium of historical freshwater fish occurrence data for Portuguese rivers and streams.

    abstract::The insights that historical evidence of human presence and man-made documents provide are unique. For example, using historical data may be critical to adequately understand the ecological requirements of species. However, historical information about freshwater species distribution remains largely a knowledge gap. I...

    journal_title:Scientific data

    pub_type: 历史文章,杂志文章

    doi:10.1038/sdata.2018.163

    authors: Duarte G,Moreira M,Branco P,da Costa L,Ferreira MT,Segurado P

    更新日期:2018-08-14 00:00:00

  • High resolution annual average air pollution concentration maps for the Netherlands.

    abstract::Long-term exposure to air pollution is considered a major public health concern and has been related to overall mortality and various diseases such as respiratory and cardiovascular disease. Due to the spatial variability of air pollution concentrations, assessment of individual exposure to air pollution requires spat...

    journal_title:Scientific data

    pub_type:

    doi:10.1038/sdata.2019.35

    authors: Schmitz O,Beelen R,Strak M,Hoek G,Soenario I,Brunekreef B,Vaartjes I,Dijst MJ,Grobbee DE,Karssenberg D

    更新日期:2019-03-12 00:00:00

  • I-BLEND, a campus-scale commercial and residential buildings electrical energy dataset.

    abstract::Efficient energy consumption at the building level is vital for sustainability. Providing energy efficient systems and solutions requires an understanding of how energy gets consumed. However, there is a general lack of large-scale open datasets about the energy consumption of buildings, which hinders the research. Th...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2019.15

    authors: Rashid H,Singh P,Singh A

    更新日期:2019-02-19 00:00:00

  • De novo transcriptome assembly databases for the butterfly orchid Phalaenopsis equestris.

    abstract::Orchids are renowned for their spectacular flowers and ecological adaptations. After the sequencing of the genome of the tropical epiphytic orchid Phalaenopsis equestris, we combined Illumina HiSeq2000 for RNA-Seq and Trinity for de novo assembly to characterize the transcriptomes for 11 diverse P. equestris tissues r...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2016.83

    authors: Niu SC,Xu Q,Zhang GQ,Zhang YQ,Tsai WC,Hsu JL,Liang CK,Luo YB,Liu ZJ

    更新日期:2016-09-27 00:00:00

  • RE-Europe, a large-scale dataset for modeling a highly renewable European electricity system.

    abstract::Future highly renewable energy systems will couple to complex weather and climate dynamics. This coupling is generally not captured in detail by the open models developed in the power and energy system communities, where such open models exist. To enable modeling such a future energy system, we describe a dedicated la...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2017.175

    authors: Jensen TV,Pinson P

    更新日期:2017-11-28 00:00:00

  • High resolution multi-facies realizations of sedimentary reservoir and aquifer analogs.

    abstract::Geological structures are by nature inaccessible to direct observation. This can cause difficulties in applications where a spatially explicit representation of such structures is required, in particular when modelling fluid migration in geological formations. An increasing trend in recent years has been to use analog...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2015.33

    authors: Bayer P,Comunian A,Höyng D,Mariethoz G

    更新日期:2015-07-07 00:00:00

  • A microarray whole-genome gene expression dataset in a rat model of inflammatory corneal angiogenesis.

    abstract::In angiogenesis with concurrent inflammation, many pathways are activated, some linked to VEGF and others largely VEGF-independent. Pathways involving inflammatory mediators, chemokines, and micro-RNAs may play important roles in maintaining a pro-angiogenic environment or mediating angiogenic regression. Here, we des...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2016.103

    authors: Mukwaya A,Lindvall JM,Xeroudaki M,Peebo B,Ali Z,Lennikov A,Jensen LD,Lagali N

    更新日期:2016-11-22 00:00:00

  • Survey-based socio-economic data from slums in Bangalore, India.

    abstract::In 2010, an estimated 860 million people were living in slums worldwide, with around 60 million added to the slum population between 2000 and 2010. In 2011, 200 million people in urban Indian households were considered to live in slums. In order to address and create slum development programmes and poverty alleviation...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2017.200

    authors: Roy D,Palavalli B,Menon N,King R,Pfeffer K,Lees M,Sloot PMA

    更新日期:2018-01-09 00:00:00

  • Time series of heat demand and heat pump efficiency for energy system modeling.

    abstract::With electric heat pumps substituting for fossil-fueled alternatives, the temporal variability of their power consumption becomes increasingly important to the electricity system. To easily include this variability in energy system analyses, this paper introduces the "When2Heat" dataset comprising synthetic national t...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0199-y

    authors: Ruhnau O,Hirth L,Praktiknjo A

    更新日期:2019-10-01 00:00:00

  • A functional trait database for Mediterranean Basin plants.

    abstract::Functional trait databases are emerging as crucial tools for a wide range of ecological studies across the world. Here, we provide a database of functional traits for vascular plant species of the Mediterranean Basin. The database includes 25,764 individual records of 44 traits from 2,457 plant taxa distributed in 119...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2018.135

    authors: Tavşanoğlu Ç,Pausas JG

    更新日期:2018-07-10 00:00:00

  • Draft genome of Bugula neritina, a colonial animal packing powerful symbionts and potential medicines.

    abstract::Many animal phyla have no representatives within the catalog of whole metazoan genome sequences. This dataset fills in one gap in the genome knowledge of animal phyla with a draft genome of Bugula neritina (phylum Bryozoa). Interest in this species spans ecology and biomedical sciences because B. neritina is the natur...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00684-y

    authors: Rayko M,Komissarov A,Kwan JC,Lim-Fong G,Rhodes AC,Kliver S,Kuchur P,O'Brien SJ,Lopez JV

    更新日期:2020-10-20 00:00:00

  • A data citation roadmap for scholarly data repositories.

    abstract::This article presents a practical roadmap for scholarly data repositories to implement data citation in accordance with the Joint Declaration of Data Citation Principles, a synopsis and harmonization of the recommendations of major science policy bodies. The roadmap was developed by the Repositories Expert Group, as p...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0031-8

    authors: Fenner M,Crosas M,Grethe JS,Kennedy D,Hermjakob H,Rocca-Serra P,Durand G,Berjon R,Karcher S,Martone M,Clark T

    更新日期:2019-04-10 00:00:00

  • A synthesis of bacterial and archaeal phenotypic trait data.

    abstract::A synthesis of phenotypic and quantitative genomic traits is provided for bacteria and archaea, in the form of a scripted, reproducible workflow that standardizes and merges 26 sources. The resulting unified dataset covers 14 phenotypic traits, 5 quantitative genomic traits, and 4 environmental characteristics for app...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-0497-4

    authors: Madin JS,Nielsen DA,Brbic M,Corkrey R,Danko D,Edwards K,Engqvist MKM,Fierer N,Geoghegan JL,Gillings M,Kyrpides NC,Litchman E,Mason CE,Moore L,Nielsen SL,Paulsen IT,Price ND,Reddy TBK,Richards MA,Rocha EPC,Schmidt

    更新日期:2020-06-05 00:00:00

  • A test-retest dataset for assessing long-term reliability of brain morphology and resting-state brain activity.

    abstract::We present a test-retest dataset for evaluation of long-term reliability of measures from structural and resting-state functional magnetic resonance imaging (sMRI and rfMRI) scans. The repeated scan dataset was collected from 61 healthy adults in two sessions using highly similar imaging parameters at an interval of 1...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2016.16

    authors: Huang L,Huang T,Zhen Z,Liu J

    更新日期:2016-03-15 00:00:00

  • Daily transcriptomes of the copepod Calanus finmarchicus during the summer solstice at high Arctic latitudes.

    abstract::The zooplankter Calanus finmarchicus is a member of the so-called "Calanus Complex", a group of copepods that constitutes a key element of the Arctic polar marine ecosystem, providing a crucial link between primary production and higher trophic levels. Climate change induces the shift of C. finmarchicus to higher lati...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00751-4

    authors: Payton L,Noirot C,Hoede C,Hüppe L,Last K,Wilcockson D,Ershova EA,Valière S,Meyer B

    更新日期:2020-11-24 00:00:00

  • A dataset of distribution and diversity of ticks in China.

    abstract::While tick-borne zoonoses, such as Lyme disease and tick-borne encephalitis, present an increasing global concern, knowledge of their vectors' distribution remains limited, especially for China. In this paper, we present the first comprehensive dataset of known tick species and their distributions in China, derived fr...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-019-0115-5

    authors: Zhang G,Zheng D,Tian Y,Li S

    更新日期:2019-07-01 00:00:00

  • Complementary proteomics strategies capture an ataxin-1 interactome in Neuro-2a cells.

    abstract::Ataxin-1 mutation, arising from a polyglutamine (polyQ) tract expansion, is the underlying genetic cause of the late-onset neurodegenerative disease Spinocerebellar ataxia type 1 (SCA1). To identify protein partners of polyQ-ataxin-1 in neuronal cells under control or stress conditions, here we report our complementar...

    journal_title:Scientific data

    pub_type:

    doi:10.1038/sdata.2018.262

    authors: Zhang S,Williamson NA,Bogoyevitch MA

    更新日期:2018-11-20 00:00:00

  • A database of marine larval fish assemblages in Australian temperate and subtropical waters.

    abstract::Larval fishes are a useful metric of marine ecosystem state and change, as well as species-specific patterns in phenology. The high level of taxonomic expertise required to identify larval fishes to species level, and the considerable effort required to collect samples, make these data very valuable. Here we collate 3...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2018.207

    authors: Smith JA,Miskiewicz AG,Beckley LE,Everett JD,Garcia V,Gray CA,Holliday D,Jordan AR,Keane J,Lara-Lopez A,Leis JM,Matis PA,Muhling BA,Neira FJ,Richardson AJ,Smith KA,Swadling KM,Syahailatua A,Taylor MD,van Ruth PD,W

    更新日期:2018-10-16 00:00:00

  • Long-term surveys of age structure in 13 ungulate and one ostrich species in the Serengeti, 1926-2018.

    abstract::The Serengeti ecosystem spans an extensive network of protected areas in Tanzania, eastern Africa, and a UNESCO Wold Heritage Site. It is home to some of the largest animal migrations on the planet. Here, we describe a dataset consisting of the sample counts of three age classes (infant, juvenile and adult) of 13 ungu...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/s41597-020-00701-0

    authors: Rogy P,Sinclair ARE

    更新日期:2020-10-21 00:00:00

  • The effects of sequencing platforms on phylogenetic resolution in 16 S rRNA gene profiling of human feces.

    abstract::High-quality and high-throughput sequencing technologies are required for therapeutic and diagnostic analyses of human gut microbiota. Here, we evaluated the advantages and disadvantages of the various commercial sequencing platforms for studying human gut microbiota. We generated fecal bacterial sequences from 170 Ko...

    journal_title:Scientific data

    pub_type: 杂志文章

    doi:10.1038/sdata.2018.68

    authors: Whon TW,Chung WH,Lim MY,Song EJ,Kim PS,Hyun DW,Shin NR,Bae JW,Nam YD

    更新日期:2018-04-24 00:00:00