Data Management Plan
GUID: gov.noaa.nmfs.inport:17957 | Published / External
Data Management Plan
DMP Template v2.0.1 (2015-01-01)
Please provide the following information, and submit to the NOAA DM Plan Repository.Reference to Master DM Plan (if applicable)
As stated in Section IV, Requirement 1.3, DM Plans may be hierarchical. If this DM Plan inherits provisions from a higher-level DM Plan already submitted to the Repository, then this more-specific Plan only needs to provide information that differs from what was provided in the Master DM Plan.
1. General Description of Data to be Managed
Vibrio parahaemolyticus (Vp) is a marine bacterium capable of causing severe gastroenteritis in humans, usually through the consumption of raw shellfish. Before 1995, Vp-vibriosis was sporadic world-wide and caused by a relatively heterogeneous population of the bacterium. Since then, outbreaks have become more epidemic, with foci of infections traced to seafood harvested from single or geographically-linked sites. While initial outbreaks in Asia (and later in South America and the U.S. Gulf Coast region) have been attributed to a single serotypically-related pandemic clonal complex, other serotypes have been implicated in distinct geographical areas, including the Pacific Northwest and Alaska in the U.S.
Current risk assessment models are based on the presence of the virulence-associated genes tdh and trh, yet illnesses have been attributed to tdh- and/or trh- isolates. Previous phylogenetic studies have shown that Vp, like most Vibrio spp., is a genetically diverse species, and as yet there has been no definitive conclusion as to what genes are essential for virulence. Using phenotypic, genetic, and genomic comparison methods such as Multi-Locus Sequence Typing (MLST), we are examining the hypothesis that a set of highly-virulent clones of Vp with increased pathogenic potential have recently emerged in the PNW, and determining whether the emergence can be correlated with specific environmental parameters. MLST and other genotyping analyses of clinical and environmental Vp isolates from PNW sources demonstrate the extensive patterns of diversity as seen elsewhere. However, the majority of PNW strains obtained from human infections form a distinct clonal complex separate from most environmental isolates. Interestingly, many environmental isolates obtained from PNW sources are phylogenetically related to the pandemic clonal complex, but this group has not been associated with clinical infections in the region.
Genome sequences.
Notes: Only a maximum of 4000 characters will be included.
Notes: Data collection is considered ongoing if a time frame of type "Continuous" exists.
Notes: All time frames from all extent groups are included.
NWFSC Montlake: NWFSC Montlake lab, Seattle
Notes: All geographic areas from all extent groups are included.
(e.g., digital numeric data, imagery, photographs, video, audio, database, tabular data, etc.)
(e.g., satellite, airplane, unmanned aerial system, radar, weather station, moored buoy, research vessel, autonomous underwater vehicle, animal tagging, manual surveys, enforcement activities, numerical model, etc.)
2. Point of Contact for this Data Management Plan (author or maintainer)
Notes: The name of the Person of the most recent Support Role of type "Metadata Contact" is used. The support role must be in effect.
Notes: The name of the Organization of the most recent Support Role of type "Metadata Contact" is used. This field is required if applicable.
3. Responsible Party for Data Management
Program Managers, or their designee, shall be responsible for assuring the proper management of the data produced by their Program. Please indicate the responsible party below.
Notes: The name of the Person of the most recent Support Role of type "Data Steward" is used. The support role must be in effect.
4. Resources
Programs must identify resources within their own budget for managing the data they produce.
5. Data Lineage and Quality
NOAA has issued Information Quality Guidelines for ensuring and maximizing the quality, objectivity, utility, and integrity of information which it disseminates.
(describe or provide URL of description):
Lineage Statement:
variety of bioinformatics-based analyses
(describe or provide URL of description):
Sequence reads were filtered through SOLiD Accuracy Enhancement Tool (SAET) and PCR duplicated reads were removed using the fastq_nodup tool from the SEAStAR pipeline . Genomes were assembled in color-space using the CLC Assembly Cell version 4.1.
6. Data Documentation
The EDMC Data Documentation Procedural Directive requires that NOAA data be well documented, specifies the use of ISO 19115 and related standards for documentation of new data, and provides links to resources and tools for metadata creation and validation.
Missing/invalid information:
- 1.7. Data collection method(s)
(describe or provide URL of description):
7. Data Access
NAO 212-15 states that access to environmental data may only be restricted when distribution is explicitly limited by law, regulation, policy (such as those applicable to personally identifiable information or protected critical infrastructure information or proprietary trade information) or by security requirements. The EDMC Data Access Procedural Directive contains specific guidance, recommends the use of open-standard, interoperable, non-proprietary web services, provides information about resources and tools to enable data access, and includes a Waiver to be submitted to justify any approach other than full, unrestricted public access.
NA
Notes: The name of the Organization of the most recent Support Role of type "Distributor" is used. The support role must be in effect. This information is not required if an approved access waiver exists for this data.
Notes: This field is required if a Distributor has not been specified.
Notes: All URLs listed in the Distribution Info section will be included. This field is required if applicable.
Genbank, http://www.ncbi.nlm.nih.gov/genbank/, Accession numbers: AONA00000000
AOOV00000000
AOOW00000000
AOOX00000000
AOOY00000000
AOOZ00000000
AOPA00000000
AOPB00000000
AOPC00000000
AOPD00000000
AOPE00000000
AOPF00000000
AOPG00000000
AOOU00000000
AOPH00000000
AOPI00000000
AOPJ00000000
AOPK00000000
AOPL00000000
No Delay
Notes: This field is required if applicable.
8. Data Preservation and Protection
The NOAA Procedure for Scientific Records Appraisal and Archive Approval describes how to identify, appraise and decide what scientific records are to be preserved in a NOAA archive.
(Specify NCEI-MD, NCEI-CO, NCEI-NC, NCEI-MS, World Data Center (WDC) facility, Other, To Be Determined, Unable to Archive, or No Archiving Intended)
Notes: This field is required if archive location is World Data Center or Other.
Notes: This field is required if archive location is To Be Determined, Unable to Archive, or No Archiving Intended.
Notes: Physical Location Organization, City and State are required, or a Location Description is required.
Discuss data back-up, disaster recovery/contingency planning, and off-site data storage relevant to the data collection
Data is maintained in a US Government data repository for genetic sequence information (Genbank)
9. Additional Line Office or Staff Office Questions
Line and Staff Offices may extend this template by inserting additional questions in this section.