Loading…

Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

Wednesday, June 19
 

9:00am EDT

Coffee and Breakfast
Wednesday June 19, 2019 9:00am - 10:00am EDT
CGIS South Concourse Hallway

10:00am EDT

Welcome and Kickoff of Dataverse 2019 Developers and Users Workshops Day
Speakers
avatar for Danny Brooke

Danny Brooke

Dataverse Development Project Manager, Harvard University, IQSS
Danny joined IQSS in May of 2016 as a member of the Dataverse team.Before joining IQSS, he worked in Harvard University Information Technology, edtech startup BetterLesson, and the Florida Center for Reading Research. Danny holds a M.S. in Library and Information Studies from Florida... Read More →


Wednesday June 19, 2019 10:00am - 10:30am EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

10:30am EDT

Dataverse and Curation Training for Librarians, and Curation Services
Come and learn about the the latest Dataverse release and features. Training will include hands-on activities using the demo.dataverse.org website. You will create your personal repository, customize your space with logos and links, and prepare your space for supporting your research data by enabling proper metadata and facets to increase discoverability of and management of your data. We will discuss not only data sharing on Dataverse but making sure your data are user friendly.

Speakers
avatar for Sonia Barbosa

Sonia Barbosa

Manager of Data Curation, Dataverse, IQSS, Harvard University


Wednesday June 19, 2019 10:30am - 12:00pm EDT
Belfer Case Study Room (Room S020)

10:30am EDT

Outreach Materials, Engaging Researchers, Installation Types
In this breakout session, we’ll discuss as a community the different types of Dataverse installations, ways to keep with new features, and how to engage with researchers about Dataverse in your community.

Speakers
avatar for Danny Brooke

Danny Brooke

Dataverse Development Project Manager, Harvard University, IQSS
Danny joined IQSS in May of 2016 as a member of the Dataverse team.Before joining IQSS, he worked in Harvard University Information Technology, edtech startup BetterLesson, and the Florida Center for Reading Research. Danny holds a M.S. in Library and Information Studies from Florida... Read More →


Wednesday June 19, 2019 10:30am - 12:00pm EDT
(Room S030)

12:00pm EDT

Lunch
Wednesday June 19, 2019 12:00pm - 1:00pm EDT
CGIS South Concourse Hallway

1:00pm EDT

Ideathon and Hackathon
In this breakout session, developers and non-developers alike can look forward to meeting other members of the community and discussing common problems and solutions. Newcomers welcome!

Speakers
GD

Gustavo Durand

Harvard Dataverse, IQSS


Wednesday June 19, 2019 1:00pm - 2:30pm EDT
(Room S030)

1:00pm EDT

Preservation Integrations
Over the last year, multiple preservation integrations have been introduced by the Dataverse Community. In this breakout session, we will discuss what’s already available for Dataverse installations and will expand on possibilities for further expansion in this area.

Speakers
avatar for Jonathan Crabtree

Jonathan Crabtree

Assistant Director of Cyberinfrastructure, Odum Institute UNC Chapel Hill
avatar for Mercè Crosas

Mercè Crosas

Chief Data Science and Technology Officer, IQSS, Harvard University


Wednesday June 19, 2019 1:00pm - 2:30pm EDT
Belfer Case Study Room (Room S020)

2:30pm EDT

Coffee Break
Wednesday June 19, 2019 2:30pm - 3:00pm EDT
CGIS South Concourse Hallway

3:00pm EDT

Global Dataverse Community Consortium
In this breakout session, we will discuss the first year of the Global Dataverse Community Consortium (http://dataversecommunity.global), talk about its governance structure and about potential services and experts groups that could be coordinated by the Consortium. Jon and Mercè will lead a discussion where participants can provide feedback on the proposed ideas and bring new ideas.

Speakers
avatar for Jonathan Crabtree

Jonathan Crabtree

Assistant Director of Cyberinfrastructure, Odum Institute UNC Chapel Hill
avatar for Mercè Crosas

Mercè Crosas

Chief Data Science and Technology Officer, IQSS, Harvard University


Wednesday June 19, 2019 3:00pm - 5:00pm EDT
Belfer Case Study Room (Room S020)

3:00pm EDT

Ideathon and Hackathon
In this breakout session, developers and non-developers alike can look forward to meeting other members of the community and discussing common problems and solutions. Newcomers welcome!

Speakers
GD

Gustavo Durand

Harvard Dataverse, IQSS


Wednesday June 19, 2019 3:00pm - 5:00pm EDT
(Room S030)

5:30pm EDT

Dataverse Cup (Soccer Social Event)
The 3rd Annual Dataverse Cup is June 19th at 5:30 at Harvard's Jordan Field (https://goo.gl/maps/Wtj7CwP4RX1FExSu6)! If you're interesting in playing for the Cup, please put your name on the sign up sheet:
https://docs.google.com/spreadsheets/d/1DK2Cpw_Og2TVt3g1XkhGOSp2BjAwzQkwnRY3IgsKLBo/edit?usp=sharing


Wednesday June 19, 2019 5:30pm - 7:00pm EDT
 
Thursday, June 20
 

9:00am EDT

Coffee and Breakfast
Thursday June 20, 2019 9:00am - 9:30am EDT
CGIS South Concourse Hallway

9:30am EDT

Welcome to Dataverse 2019 Plenary
Speakers
avatar for Mercè Crosas

Mercè Crosas

Chief Data Science and Technology Officer, IQSS, Harvard University


Thursday June 20, 2019 9:30am - 9:40am EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

9:40am EDT

Keynote: "Data in the Knowledge Ecosystem"
Bringing perspectives from both institutional and regional digital planning initiatives, Martha will present a view of data, in all its forms, at the core of today’s knowledge ecosystem. She will discuss surrounding layers of policies, infrastructure and services needed to facilitate data creation, use and preservation. As an example of a particular slice of this ecosystem, she will describe the evolution of the Portage research data management network in Canada, in the context of digital research infrastructure.

Speakers
avatar for Martha Whitehead

Martha Whitehead

University Librarian, Vice President for the Harvard Library, and Larsen Librarian for the Faculty of Arts and Sciences, Harvard University
Martha Whitehead is the new Vice President for the Harvard Library and University Librarian and Roy E. Larsen Librarian for the Faculty of Arts and Sciences. Prior to joining Harvard in June 2019, Martha was vice-provost (digital planning) and university librarian at Queen’s University... Read More →


Thursday June 20, 2019 9:40am - 10:30am EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

10:30am EDT

Coffee Break
Thursday June 20, 2019 10:30am - 11:00am EDT
CGIS South Concourse Hallway

11:00am EDT

Dataverse Research and Product Updates
Speakers
avatar for Danny Brooke

Danny Brooke

Dataverse Development Project Manager, Harvard University, IQSS
Danny joined IQSS in May of 2016 as a member of the Dataverse team.Before joining IQSS, he worked in Harvard University Information Technology, edtech startup BetterLesson, and the Florida Center for Reading Research. Danny holds a M.S. in Library and Information Studies from Florida... Read More →
avatar for Tania Schlatter

Tania Schlatter

UX & UI Lead, IQSS, Harvard University
I lead UX & UI work at the Institute for Quantitative Social Science at Harvard University, and teach interaction and information design at Northeastern University. Previously, I co-owned Nimble Partners, a UX strategy and design consulting firm. We designed sites and conducted research... Read More →


Thursday June 20, 2019 11:00am - 12:00pm EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

12:00pm EDT

Consortium Updates
Speakers
avatar for Jonathan Crabtree

Jonathan Crabtree

Assistant Director of Cyberinfrastructure, Odum Institute UNC Chapel Hill
avatar for Mercè Crosas

Mercè Crosas

Chief Data Science and Technology Officer, IQSS, Harvard University


Thursday June 20, 2019 12:00pm - 12:15pm EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

12:15pm EDT

Lunch
Thursday June 20, 2019 12:15pm - 1:30pm EDT
CGIS South Concourse Hallway

1:30pm EDT

Designing a Trusted Remote Storage Agent for Dataverse: NSF Impact Progress Report and Demo
ImPACT (Infrastructure for Privacy-Assured CompuTations)

Scientific progress today requires multi-institutional and cross-disciplinary sharing and analysis of data. Many disciplines, such as the social sciences, face a web of policies and technological constraints on data due to privacy concerns. Issues of privacy, safety, integrity, and ownership have led to regulations controlling data location, availability, movement, and access. Compliance poses obstacles to traditional data-processing practices and slows research; yet, increasingly, pressing scientific problems of great concern to society demand collaborative efforts involving data from multiple stakeholders.

We have designed a Trusted Remote Storage Agent for Dataverse that helps groups share metadata for sensitive data for access by researchers. Dataverse provides the discovery tools and the ImPACT project provides the cyber-infrastructure. This same work flow could also be leveraged for larger datasets that are too big to be migrated into Dataverse as well.

This presentation will give an update on the status as well as a demonstration of current progress.

Speakers
avatar for Jonathan Crabtree

Jonathan Crabtree

Assistant Director of Cyberinfrastructure, Odum Institute UNC Chapel Hill


Thursday June 20, 2019 1:30pm - 1:50pm EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

1:50pm EDT

Dataverse Internationalization Project: Future Goals & Sustainability
In 2019, Dataverse officially became a data repository platform that can be configured for use in multiple languages. The Dataverse Internationalization Project which is a multi-year collaboration primarily led by Scholars Portal, with support from les Bibliothèques de L'Université de Montréal, the Dataverse team at Harvard University, and community members, has led to the release of a new internationalized Dataverse framework that supports translation of the Dataverse interface and database fields for display. The project contributions are openly available as part of the Dataverse core releases and can be reused by any Dataverse installation. New language translations are being added by the global Dataverse community to support users across the world. This presentation will discuss project outcomes, workflows for community translators, and future goals to address sustainability for the Dataverse community.

Speakers
avatar for Amber Leahey

Amber Leahey

Librarian, Scholars Portal, Ontario Council of University Libraries


Thursday June 20, 2019 1:50pm - 2:10pm EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

2:10pm EDT

Dataverse in the European Open Science Cloud
One of the tasks of within the SSHOC project (Social Sciences and Humanities Open Cloud Project, https://www.sshopencloud.eu) is to offer a Dataverse service in the EOSC cloud, which will provide SSH institutions without a data repository service, such a facility for their designated communities.
Additional functionality will be developed to meet the needs of the European SSH community. This extra functionality may encompass: connection to localised PID providers, AAI solutions, API for domain specific Controlled Vocabularies and Thesauri, adaption of the UI to facilitate language localisation, improved support for domain specific metadata.

In this project developers from various institutes representing different European Research Infrastructures (CESSDA, CLARIN, DARIAH, E-RIHS) will work together to realise a SSHOC Dataverse network.

The project started on January 1 and will continue for 40 months.
In our presentation we will present our aims, challenges and approaches.

Speakers
avatar for Vyacheslav Tykhonov

Vyacheslav Tykhonov

Senior Information Scientist, Data Archiving and Networked Services (DANS)
Vyacheslav Tykhonov (Slava) is Senior Information Scientist at the Data Archiving and Networked Services (DANS), an institute of the Royal Netherlands Academy of Arts and Sciences (KNAW) and Dutch funding organization NWO. He is serving as lead developer of DataverseEU project coordinating... Read More →
avatar for Philipp Conzett

Philipp Conzett

Senior Research Librarian, UiT The Arctic University of Norway
Philipp is a Senior Research Librarian at UiT The Arctic University of Norway. In addition to be the subject librarian for Nordic and Finnish/Kven Languages and Literatures he works mostly with Open Science and Digital Humanities support at UiT. He is one of the developers and service... Read More →
avatar for Marion Wittenberg

Marion Wittenberg

Service Manager / Project Leader, DANS


Thursday June 20, 2019 2:10pm - 2:30pm EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

2:30pm EDT

Data Sharing Makes Teaching and Research Better
In this presentation, we will introduce Shanghai Huiyuan's data sharing platform, which is a platform sponsored by Shanghai Municipal Education Commission in 2011. Ten universities in Shanghai contribute their data and literature to the platform, and 1,500,000 faculty and students from thirty-nine universities in Shanghai share the data and literature resources in it.

We have a better mechanism to encourage the universities to share their distinguished data resources to the platform. Some data resources are rare and unique, such as Chines Ancient Rare Books, and Books and periodicals in Republic of China, local chronicles. The platform follows the FAIR principle, which makes data findable, accessible, integrable and reuseable.

Shanghai Municipal Education Commission makes all the universities in Shanghai can share search and download resources from the platform by using the InterUnvi-account.

Via the platform, Faculties and students in Shanghai academic universities can using other's universities' data resources very conveniently. We will tell some stories about data sharing makes teaching and research better.

Speakers
avatar for Yin Shenqin

Yin Shenqin

Director of Science Data Center, Fudan University


Thursday June 20, 2019 2:30pm - 2:45pm EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

2:45pm EDT

Using the Dataverse for Homework Assignments: Successes and Failures
In Gov 1006: Models, students were required to use the Dataverse in 4 different homework assignments and in their final projects. This worked well in some dimensions and poorly on others. The failures were severe enough that, in the absence of significant changes, I will not be able to use the Dataverse nearly as much in 2020. I will discuss what changes in the Dataverse would help better enable its use for undergraduate and graduate instruction.

Speakers
DK

David Kane

Preceptor of Statistical Methods and Mathematics, Department of Government, Harvard University


Thursday June 20, 2019 2:45pm - 3:00pm EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

3:00pm EDT

Coffee Break
Thursday June 20, 2019 3:00pm - 3:30pm EDT
CGIS South Concourse Hallway

3:30pm EDT

Keynote: "Data Management Initiatives at NIST and Other US Government Agencies"
The National Institute of Standards and Technology has a diverse intramural research program with equally diverse data acquisition methods, data file formats, and data sharing and dissemination methods.  With the creation of the Office of Data and Informatics five years ago, however, we have made substantial progress in improving practices in data management that embrace the entire data lifecycle and support the global FAIR* initiative.  We have implemented a new NIST-wide research data repository that hosts the majority of NIST’s public research data using Amazon Web Services as a platform.  We have also piloted the use of Laboratory Information Management Systems (LIMS) for better metadata tagging of instrumental data and we have several hundred researchers using electronic laboratory notebooks.  NIST recently invested in a dedicated computer system to support machine learning and AI research, however, effective use of this system relies upon conscientious practices in data management.  NIST also participates in an interagency Subcommittee on Open Science, convened by the Office of Science and Technology Policy, that is discussing ways to improve access to US Government data assets.
 
*Making data Findable, Accessible, Interoperable, and Reusable.

Speakers
avatar for Robert J. Hanisch

Robert J. Hanisch

Director of the Office of Data and Informatics, Material Measurement Laboratory, National Institute of Standards and Technology
Dr. Robert J. Hanisch is the Director of the Office of Data and Informatics, Material Measurement Laboratory, at the National Institute of Standards and Technology in Gaithersburg, Maryland. In this role he is responsible for improving data management and analysis practices and helping... Read More →


Thursday June 20, 2019 3:30pm - 4:00pm EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

4:00pm EDT

Panel: "Institutional, Regional, National, and International Research Data Initiatives"
What have we learned from national and regional research data sharing initiatives? What type of actions should academic institutions undertake to support research data management? Which responsibilities on improving research data management should fall into governments and which ones into research communities? These are some of the topics that will be discussed in this one hour panel with world leaders involved in research data initiatives.

Moderators
avatar for Mercè Crosas

Mercè Crosas

Chief Data Science and Technology Officer, IQSS, Harvard University

Speakers
avatar for Martha Whitehead

Martha Whitehead

University Librarian, Vice President for the Harvard Library, and Larsen Librarian for the Faculty of Arts and Sciences, Harvard University
Martha Whitehead is the new Vice President for the Harvard Library and University Librarian and Roy E. Larsen Librarian for the Faculty of Arts and Sciences. Prior to joining Harvard in June 2019, Martha was vice-provost (digital planning) and university librarian at Queen’s University... Read More →
avatar for Robert J. Hanisch

Robert J. Hanisch

Director of the Office of Data and Informatics, Material Measurement Laboratory, National Institute of Standards and Technology
Dr. Robert J. Hanisch is the Director of the Office of Data and Informatics, Material Measurement Laboratory, at the National Institute of Standards and Technology in Gaithersburg, Maryland. In this role he is responsible for improving data management and analysis practices and helping... Read More →
avatar for Francine Berman

Francine Berman

Edward P. Hamilton Distinguished Professor of Computer Science, Rensselaer Polytechnic Institute
Dr. Francine Berman is the Edward P. Hamilton Distinguished Professor of Computer Science at Rensselaer Polytechnic Institute (RPI).   She is a Fellow of the Association of Computing Machinery (ACM), a Fellow of the Institute of Electrical and Electronics Engineers (IEEE), and a... Read More →
avatar for Wolfram Horstmann

Wolfram Horstmann

University Librarian, Georg-August-Universität Göttingen
Wolfram Horstmann is the director of the Göttingen State and University Library at Georg-August-University of Göttingen since 2014. Prior to his current position he was Associate Director at the Bodleian Libraries of the University of Oxford, UK. He is currently leading several... Read More →


Thursday June 20, 2019 4:00pm - 5:00pm EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

5:00pm EDT

Poster Session
“Arguments (Not Facts) As the Cause for Hyper-Partisan Polarization” by Sandeep Purao, David Yates, and David Murungi. Bentley University

“Building a Harvard Data Commons with Harvard Dataverse” by Mercè Crosas, Scott Yockel Bill Barnett, and Piotr Sliz, IQSS

“Case Study: Code Ocean and Political Analysis” by Xu Fei, Code Ocean

“Dataverse North Metadata Working Group” by Alexandra Cooper, Martine Gagnon, Mark Goodwin, John Huck, Amber Leahey, Michael Steeleworthy, and Sally Taylor, Dataverse North

“Developing an Intelligent Dataset Discovery System for Dataverse” by Xinyue Ye, Mercè Crosas, Xi Niu, Jing Yang, and Ye Zhao, NJIT

“Exploratory Data Visualization with Glue: A New Link to the Harvard Dataverse” by Catherine Zucker, Alyssa Goodman, Tom Robitaille, and Michelle Borkin, Center for Astrophysics

“Future Dataset Page Possibilities: Feedback Requested” by Tania Schlatter, IQSS

“Integrating DataTags with Harvard Dataverse” by Tania Schlatter, Mercè Crosas, IQSS

“Internet Connectivity StatisticsUsing the Dataverse to Share Sensitive Data” by Suso Baleato, James Honaker and Mercè Crosas, IQSS

“Folder feature in Dataverse opens the door to a new era: Repositories 2.0” by Rory Macneil, ResearchSpace

“Simple, Standards-Based Archiving in Dataverse” by Sebastian Karcher, Jim Myers, Sebastian Ostrowski, and Nic Weber, QDR

“UCLA Dataverse: from the Mainframe to the Cloud” by Jamie Jamison, UCLA

“Integrating Open Data Hub with Dataverse” by Michael Daitzman, Massachusetts Open Cloud and Anish Asthana, Red Hat

Thursday June 20, 2019 5:00pm - 6:00pm EDT
CGIS South Concourse Hallway

6:30pm EDT

Conference Dinner (Separate Ticket required)
We’d like for you to join us for a dinner at Orinoco in Harvard Square on Thursday, June 20th at 6:30.

If you’re interested, please purchase your ticket here:
https://secure.touchnet.net/C20832_ustores/web/product_detail.jsp?PRODUCTID=298&SINGLESTORE=true

Thursday June 20, 2019 6:30pm - 8:30pm EDT
Orinoco 56 John F. Kennedy St, Cambridge, MA 02138
 
Friday, June 21
 

9:00am EDT

Coffee and Breakfast
Friday June 21, 2019 9:00am - 9:30am EDT
CGIS South Concourse Hallway

9:30am EDT

Keynote: "Data in the Global Research Commons: Is There a Blueprint?"
Speakers
avatar for Wolfram Horstmann

Wolfram Horstmann

University Librarian, Georg-August-Universität Göttingen
Wolfram Horstmann is the director of the Göttingen State and University Library at Georg-August-University of Göttingen since 2014. Prior to his current position he was Associate Director at the Bodleian Libraries of the University of Oxford, UK. He is currently leading several... Read More →


Friday June 21, 2019 9:30am - 10:00am EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

10:00am EDT

Dataverse North: Working Together to Achieve National Success!
Since 2017, the Portage Network's Dataverse North (DV North) provides a community of practice for library practitioners, end users, and hosts of Dataverse in Canada. Together through DV North, libraries and institutions coordinate activities and development across three main areas: Training, Metadata, and Business Models. Between 2018-2019, new training materials, best practices for metadata, and a formal proposal for a National Dataverse Service were released. This presentation will update the broader community about Dataverse North activities and collaborations, including its members, areas of expertise,  recent outputs and achievements, and future steps.

Speakers
avatar for James Doiron

James Doiron

Research Data Management Services Coordinator, University of Alberta


Friday June 21, 2019 10:00am - 10:15am EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

10:15am EDT

Community-Driven Collaboration: TDR Updates and Activities
Since last year's Dataverse Community Meeting, the Texas Data Repository (TDR) has increased our contributions to the Dataverse community by hiring a contractor to contribute software improvements, joining the Global Dataverse Community Consortium, completing a reports assessment, beginning a Core Trust Seal audit, an open webinar series called Topics in Research Data Management featuring several research data experts and Dataverse implementers, improving our TDR user documentation, and sharing resources and training materials. Laura Waugh, Chair of the TDR Steering Committee, and Courtney Mumma, Deputy Director of the Texas Digital Library, will share about all of these efforts and some plans moving into the next year.

Speakers
avatar for Courtney Mumma

Courtney Mumma

Deputy Director, Texas Digital Library


Friday June 21, 2019 10:15am - 10:30am EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

10:30am EDT

Coffee Break
Friday June 21, 2019 10:30am - 11:00am EDT
CGIS South Concourse Hallway

11:00am EDT

The Holy Grail Has Arrived! The New Folder Feature in Dataverse Could Pave the Way to a New ‘Repository 2.0’ Era.
The new file path feature in Dataverse 4.12  enabling organization of your files in folders paves the way for Dataverse to become a more complete hub for research data.  The 'Repositories 1.0' model on which Dataverse and other repositories were designed assumed deposits of individual files, and that deposits would be made directly, rather than from other data-organizing applications.  Integrations with active data management tools like RSpace add Hivebench eventually were implemented, but their utility was limited because file structures created in these tools could not be carried over through deposits into Dataverse.  

The new file path feature opens the door to a new Repositories 2.0 era, in which data structure and organization put in place during the active research phase can be retained for the archival phase. It will make it far more attractive for a wider range of active data creation and management applications to integrate with Dataverse, and hence for Dataverse to evolve into a powerful data hub that draws in data from a broader range of sources.

With this as background, the presentation will go on to use the RSpace - Dataverse integration to illustrate how the new file path feature can be leveraged to implement transfer of richer datasets in a streamlined fashion.

Speakers
avatar for Rory Macneil

Rory Macneil

CEO, Research Space


Friday June 21, 2019 11:00am - 11:15am EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

11:15am EDT

Dataverse - Archivematica Integration Update & Next Steps
For the past several years, the Scholars Portal team has been working with Artefactual Systems Inc. to create an integration between the open source preservation processing tool Archivematica and Dataverse. This technical integration enables users of Archivematica to select and copy datasets from Dataverse instances and process them for long-term access and digital preservation. The Archival Information Packages (AIPs) generated include Dataverse metadata and files, and are packaged for long-term management using Archivematica's standards-compliant workflow. Integration between Archivematica and Dataverse expands capacity to preserve research data, opening up new avenues for librarians and archivists to make valuable research outputs accessible into the future. This presentation will discuss the technical integration project and potential future workflows for preserving research data at Scholars Portal.

Speakers
avatar for Meghan Goodchild

Meghan Goodchild

RDM Librarian, Queen's University/ Scholars Portal


Friday June 21, 2019 11:15am - 11:30am EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

11:30am EDT

Nextcloud as an Open Source On-Prem File Sync and Share Solution for Research
The internet and the world wide web were originally designed as distributed and federated networks. In the last few years we've seen a trend to more centralized services from Google, Dropbox, Microsoft, Amazon and others.

The Nextcloud community is building an open source and fully federated and distributed network for data sharing and communication. Everyone can run an Nextcloud server at home or somewhere on the internet and collaborate and share with everyone else. With the help of Federation different Nextcloud servers can be connected to provide on big service.

This talk will cover the current problems with centralisation and strategies on how to fix this problem. It will also discuss the current and upcoming federation features of Nextcloud and how to become part of the community and how to scale to multi million users with the help of Nextcloud Global Scale. 

Speakers
avatar for Frank Karlitschek

Frank Karlitschek

Founder, Nextcloud


Friday June 21, 2019 11:30am - 11:45am EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

11:45am EDT

Freya, Dataverse and the PID graph
NARCIS, the national gateway to scholarly information in the Netherlands (www.narcis.nl) contains information about publications, datasets, research projects, researchers and organizations. NARCIS is an aggregated from various different sources; publication repositories, CRIS's (Current Research Information System) and data repositories like Dataverse.

One of the aims of NARCIS is to show Research Information in its context. For a publication, context can mean one or more relations to different information types: the underlying data, the project, its funding or related publications. Relationships between the different information types provide essential context to research. Although this can seem to be a straightforward task, in practice there are many challenges.

Especially in information exchange and aggregating information from different sources, PIDs are the only way to relate objects sustainably. In this session we will discuss the possibilities of PID-Graphs by comparing NARCIS PID Graphs with ORCID PID Graphs as part of the EU project FREYA.

Speakers
avatar for Chris Baars

Chris Baars

Supervisor Digital Services, Data Archiving and Networked Services (DANS)


Friday June 21, 2019 11:45am - 12:00pm EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

12:00pm EDT

pyDataverse - A Python API Wrapper
pyDataverse is a Python module to access Dataverse by it's native API,
developed by AUSSDA - The Austrian Social Science Data Archive. It is
open source and it's main purpose is to make (mass) imports, exports and
updates of Dataverses, Datasets and Datafiles easy. It works with data
and metadata coming from different sources and in different formats
(Nesstar, CSV, DDI XML, GESIS DSpace, another Dataverse instance).

The presentation will give insight into the basic idea of pyDataverse,
give an outlook on planned activities and show in a short live demo,
what it already can be used for.

https://github.com/AUSSDA/pyDataverse

Speakers
avatar for Stefan Kasberger

Stefan Kasberger

DevOp, The Austrian Social Science Data Archive (AUSSDA)
Interested in Open Source and Open Science.


Friday June 21, 2019 12:00pm - 12:15pm EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

12:15pm EDT

Dataverse as a Key Component of the Institutional Data Infrastructure of Inra
In this presentation, we will explain why Dataverse happened to be the best compromise for us given our requirements and resources to develop an institutional data repository: Data Inra (https://data.inra.fr), how we deployed it, which difficulties we encountered and how we overcame (or are expecting to overcome) them. 

Data Inra is now one of our key data management and sharing service, contributing to make our data more compatible with the FAIR principles. We will describe the services we offer on top of it to support the users and foster good data management and sharing practices. Data Inra is an example for other research organizations in France and several of them are planning to adopt the Dataverse tool. We are eager to share our perspectives and discuss how we could better contribute to the roadmap of Dataverse.

Speakers
avatar for Esther Dzalé Yeumo

Esther Dzalé Yeumo

Unit of the Department of Scientific Information, INRA


Friday June 21, 2019 12:15pm - 12:30pm EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

12:30pm EDT

Lunch
Friday June 21, 2019 12:30pm - 1:30pm EDT
CGIS South Concourse Hallway

1:30pm EDT

Time Machine for the Web
Professor Frédéric Kaplan holds the Digital Humanities Chair at the École Polytechnique Fédérale de Lausanne (EPFL) and directs the EPFL Digital Humanities Laboratory. He conducts research projects combining archive digitisation, information modelling and museographic design. He has published over one hundred scientific papers and eight books.  He is currently directing the “Venice Time Machine”, an international project aiming to model the evolution and history of Venice over a 1000-year period and the Time Machine Project, funded by the European Commission, a spatial and temporal extension of this ambition at a global scale.

Speakers
avatar for Vyacheslav Tykhonov

Vyacheslav Tykhonov

Senior Information Scientist, Data Archiving and Networked Services (DANS)
Vyacheslav Tykhonov (Slava) is Senior Information Scientist at the Data Archiving and Networked Services (DANS), an institute of the Royal Netherlands Academy of Arts and Sciences (KNAW) and Dutch funding organization NWO. He is serving as lead developer of DataverseEU project coordinating... Read More →
avatar for Frédéric Kaplan

Frédéric Kaplan

Digital Humanities Chair, École Polytechnique Fédérale de Lausanne (EPFL)


Friday June 21, 2019 1:30pm - 1:45pm EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

1:45pm EDT

Data Curation Tool: A Metadata Driven Tool for Curators
One of the great strengths of Dataverse is its ability to ingest tabular data and extract file and variable-level metadata from certain file types. Dataverse has long supported description of data using the Data Documentation Initiative (DDI) open metadata standard, and efforts to build on metadata editing and curation support at the variable-level are needed.  Scholars Portal has been working with the Dataverse team to add curation functionality to enhance DDI metadata at the variable-level using an integrated and modular Data Curation Tool (DCT). DCT will allow curators to create and edit variable-level metadata including the ability to group and order variables, assign weighting, enhance labeling, and, add contextual documentation such as question text, interviewer instructions, universe statements, and notes to variables.  A prototype of the DCT has been developed by Scholars Portal with much of the back-end database core work completed in the winter 2019. This presentation will provide an overview of the DCT including information about how to get started with using it locally or through the Scholars Portal hosted service. DCT is available as open source software in GitHub: https://github.com/scholarsportal/Dataverse-Data-Curation-Tool

Speakers
VL

Victoria Lubitch

Developer, Scholars Portal, Ontario Council of University Libraries


Friday June 21, 2019 1:45pm - 2:00pm EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

2:00pm EDT

Dataverse for Qualitative Data
The Qualitative Data Repository (QDR) launched its Dataverse instance in early 2018. Since then, we have continually adapted the software to our needs and processes, with a special emphasis on qualitative data, which is at the heart of QDR's mission. In this talk, we will describe some the innovations QDR has introduced to Dataverse to optimize it for qualitative research data:
  1. Given the predominance of textual data in qualitative research, the ability to search through the text of data files in common formats is crucial. QDR added this ability, ensuring also that restricted files remain protected from unauthorized access.
  2. The ability to quickly view a data file enables users to judge whether a data project is of interest to them. With Scholarportal's "Dataverse Data Explorer," such easy viewing is already available for typically tabular/quantitative files. Using a suite of lightweight javascript previewers, QDR offers easy online viewing for common qualitative formats, including images, sound, video, and textual files.
  3. To complement its "Annotation for Transparent Inquiry" project, QDR introduced a tool that uploads the content of Hypothesis open web annotations into Dataverse and adds a viewer for those annotations.

We will conclude the talk by describing some of QDR's ongoing and planned work, focused centrally on the curation of qualitative data.

(co-presented with Jim Myers, Sebastian Ostrowski, Nic Weber)

Speakers
avatar for Sebastian Karcher

Sebastian Karcher

Associate Director, Qualitative Data Repository
avatar for Jim Myers

Jim Myers

Qualitative Data Repository
SO

Sebastian Ostrowski

Qualitative Data Repository
avatar for Nic Weber

Nic Weber

Co-Technical Director, Qualitative Data Repository


Friday June 21, 2019 2:00pm - 2:15pm EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

2:15pm EDT

When Data Repository Workflows Meet Reality
I will present three case studies of recent data curation efforts working with researchers in the humanities, sciences, and social sciences using Dataverse data repository services at Emory University. In each case, we consulted with the researchers about their data, including considerations for related publications or other public dissemination of their research. With the researchers and  librarian collaborators, we developed a custom workflow for each case, tailored to the unique circumstances of their data deposit needs and existing research practices. 

Speakers
avatar for Jennifer Doty

Jennifer Doty

Research Data Librarian, Emory University


Friday June 21, 2019 2:15pm - 2:30pm EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

2:30pm EDT

YARD for Data Curation and Code Review
YARD is a tool for reviewing and enhancing research outputs before publication. The tool is designed to manage the data curation and code review workflow and push high quality and FAIR data packages seamlessly into any data repository for the purpose of advancing research reproducibility, transparency, and long-term use. YARD offers a single pipeline to connect researchers, curators, and publishers. It helps view and track curation tasks for easy administration via dashboard and integrates metadata production with data curation and code review. The tool leverages the Data Documentation Initiative (DDI Lifecycle 3.2) metadata schema and is API-ready for integration with applications that perform automated curation tasks. An open source base, it is licensed under the GNU Affero General Public License (GPL) v3.0. The tool was developed by the Institution for Social and Policy Studies (ISPS) at Yale University and Colectica, with funding from Innovations for Poverty Action (IPA). It is currently piloted at ISPS, and several groups have expressed an interest in adopting it for similar workflows. We are interested in exploring integration with Dataverse. https://yard.yale.edu/

Speakers
avatar for Limor Peer

Limor Peer

Associate Director for Research, Yale University, Institution for Social and Policy Studies


Friday June 21, 2019 2:30pm - 2:45pm EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

2:45pm EDT

Curation Panel
This panel will engage the Dataverse Community in discussing data curation using Dataverse. We will discuss the benefits and challenges encountered in the self-curation and deposit model, as well as the need for collaborative curation support models, and the intersection between domain experts and curation.

Moderators
avatar for Ceilyn Boyd

Ceilyn Boyd

Research Data Program Manager, Harvard Library

Speakers
avatar for Sebastian Karcher

Sebastian Karcher

Associate Director, Qualitative Data Repository
avatar for Sonia Barbosa

Sonia Barbosa

Manager of Data Curation, Dataverse, IQSS, Harvard University
avatar for Thu-Mai Christian

Thu-Mai Christian

Assistant Director for Archives, Odum Institute UNC Chapel Hill
avatar for Amber Leahey

Amber Leahey

Librarian, Scholars Portal, Ontario Council of University Libraries


Friday June 21, 2019 2:45pm - 3:15pm EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

3:15pm EDT

Coffee Break
Friday June 21, 2019 3:15pm - 3:45pm EDT
CGIS South Concourse Hallway

3:45pm EDT

Dataverse Community News and Other Crazy Ideas
Community is critical to the success of software. Listening to and engaging with the community is an investment of time that reaps great rewards. Dataverse is not a top down community with a dictator at the helm. Rather, we celebrate our ground up nature and encourage each other to try new ideas aimed at improving communication, breaking down barriers between us, and helping each other achieve our goals. This talk will focus on crazy ideas from Dataverse developer Philip Durbin, both present and planned for the future, but time will be reserved to hear from the crowd.

Speakers
avatar for Philip Durbin

Philip Durbin

Developer, Harvard/Dataverse
Philip Durbin has been hacking on Dataverse full time since 2012. When he isn't writing code. he's herding cats in the Dataverse community, answering questions via email, chat, and community calls. He has given talks about Dataverse in Jakarta, Berlin, Brussels, Lisbon, San Francisco... Read More →


Friday June 21, 2019 3:45pm - 4:00pm EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138

4:00pm EDT

Community Discussion and Closing Remarks
Friday June 21, 2019 4:00pm - 5:00pm EDT
Tsai Auditorium (Room S010) CGIS South, Tsai Auditorium 1730 Cambridge Street Cambridge, MA 02138