Managing Data for Scholarly Communications, Part Two: Technical Management

Webinar

About the Webinar

The explosion of data creation across all scholarly disciplines necessitates corresponding efforts to create new solutions for its management and use. Ever-growing repositories and datasets within require organization, identification, description, publication, discovery, citation, preservation, and curation to allow these materials to realize their potential in support of data-driven, often interdisciplinary research. What infrastructures and technical environments are required for this work? Can new approaches, specifications, standards and best practices be created? Are there partnerships and collaborations that exist or can be pursued? This webinar, Part 2 of a two-part NISO series on data, will explore these and other questions.

To view the agenda for Part 1 of this webinar, click here.

Event Sessions

Introduction

Speaker

Dataset Identification & Citation: DataCite and EZID

Speaker

Joan Starr

Chair of the Metadata Working Group at DataCite, and Strategic and Project Planning Manager for the California Digital Library
DataCite / California Digital Library

One of the challenges researchers face is consistently finding, citing, and getting credit for their research datasets. Persistent identifiers, and the establishment of best practices for citing well-preserved datasets, offer a solution to this problem. Joan will introduce the work of DataCite, an international organization that works to improve the scholarly infrastructure around datasets, and EZID, a service built for simple identifier management, including DataCite DOIs, other long-term identifiers.

Digital Library Services in the Cloud

Speaker

Mark McFarland

Texas Digital Library and Associate Director for Digital Initiatives, University of Texas Libraries
University of Texas Libraries

The Texas Digital Library (TDL) is a consortium of higher education institutions in Texas that provides shared services in support of research and teaching. For his presentation, Mark will describe how TDL recently moved a variety of services to the Amazon cloud. The primary reason for TDL's initial work using cloud services was as a DRP (disaster recovery plan) for a planned data center move. The centerpiece of this DRP was the duplication of services in the Amazon EC2 cloud (Amazon 2011b) -- an extension of the idea of having an off-site backup for data -- as in the cloud duplication case, TDL also has an off-site backup of the services themselves. Other motivations were that the elastic nature of cloud services (both compute and storage) would allow TDL to pay only for what it actually uses, as opposed to using its own hardware, which incurs cost regardless of utilization. Finally, TDL also envisioned a potential personnel savings: owning hardware requires an organization to staff competencies (e.g., network architecture expertise) that it could effectively outsource by using cloud services. Even if staffing these competencies were still necessary, the resources committed to those skills could potentially be reduced.

Data Governance and Legal Interoperability

Speaker

Sharing and integrating data are common requirements of research projects but are often difficult for a variety of technical, cultural, policy and legal reasons. Data governance is the system of decision rights and accountabilities that describe who can take what actions with what data, and when, under what circumstances, using what methods. It includes laws and policies associated with data, as well as strategies for data quality control and management, in the context of organizations. It includes the processes that insure important data are formally managed throughout an organization, including business processes and risk management. Data governance ensures that data can be trusted and that people can be made accountable for actions affecting the data. While the community is making progress on the technical and cultural issues, the data governance issues around data are only now being recognized and addressed.

Additional Information

  • Registration for the individual webinar closes at 12:00 pm Eastern on October 19, 2011. Cancellations made by October 12, 2011 will receive a refund, less a $20 cancellation fee. After that date, there are no refunds.
  • Registrants will receive detailed instructions about accessing the webinar via e-mail the Monday prior to the event. (Anyone registering between Monday and the close of registration will receive the message shortly after the registration is received, within normal business hours.) Due to the widespread use of spam blockers, filters, out of office messages, etc., it is your responsibility to contact the NISO office if you do not receive login instructions before the start of the webinar.
  • Registration is per site (access for one computer) and includes access to the online recorded archive of the webinar. If you are registering someone else from your organization, either use that person's e-mail address when registering or contact the NISO office to provide alternate contact information.
  • Webinar presentation slides and Q&A will be posted to the site following the live webinar.
  • Registrants will receive access information to the archived webinar following the event. An e-mail message containing archive access instructions will be sent approximately 48 hours after the event.