Agenda and Presentations
This tutorial will be split roughly equally between presentation, demonstration, and hands-on exercises so that participants can understand how and when to use the services and tools presented. Each exercise will build on prior material. We will also provide time for group discussion on critical issues related to security and integration with campus identity systems such as InCommon, as well as techniques for eliminating common network bottlenecks.
Introductory Session: Overview of Globus Data Management Capabilities
In the introductory session, you will access the Globus service, configure a data management endpoint on your laptop, and use it to move data between your laptop and an XSEDE endpoint (a demonstration endpoint that will be provided for those that do not have an XSEDE account). You will also experiment with Globus file sharing and group management features.
- Overview of scientific research data management challenges (10 min)
- Introduction to Globus software-as-a-service (10 min)
- Demonstration: File transfer and sharing (15 min)
- Exercise 1: Account signup and configuration (10 min)
- Exercise 2: File transfer and sharing (20 min)
- Demonstration: Globus data publication and discovery (15 min)
- General Q&A and agenda overview for advanced session (10 min)
Advanced Session: Campus Deployment and Configuration
In the advanced session, you will learn how to create and configure a server endpoint (using presenter-provided cloud-hosted servers), how to manage access for multiple users, and how to troubleshoot common configuration errors.
- Enabling a storage resource with Globus Connect Server (15 min)
- Exercise 3: Set up a Globus endpoint using Globus Connect Server (20 min)
- Exercise 4: Globus endpoint configuration (15 min)
- Advanced Globus endpoint configuration (10 min)
- Globus security and best practices deployment scenarios (20 min)
Target Audience
In the introductory session we will review the issues faced by investigators in moving and sharing data between XSEDE resources and across institutional boundaries with their collaborators. We will provide you with the tools necessary to address many of these issues. This session is also appropriate for campus/research computing managers who are exploring research data management options for their institution.
The advanced session is aimed primarily at HPC resource owners, network and system administrators, and campus computing center managers. We will discuss tools for delivering an efficient research data management service on your campus. The material in this session is appropriate for both management and technical staff, but will be more technical than the introductory session. Attendees with a basic knowledge of Linux should be able to follow the discussion and participate in the exercises.
Prerequisites
For the introductory session, you will only need a laptop with a modern web browser. For the advanced session, you should have an SSH client installed, and are expected to be familiar with the Linux command line. An XSEDE account is desirable but not required.
Test Endpoints
ESnet endpoints: esnet#anl-diskpt1, esnet#bnl-diskpt1, esnet#lbl-diskpt1; test files in /data1 directory. These are read-only endpoints with datasets varying in size and number of files; they are useful for testing network performance.
Globus test endpoints: go#ep1, go#ep2; these have a 10MB quota and are used mostly to validate that a new Globus account is working.
Globus sharing endpoints: you may create shared endpoints on go#ep1, go#ep2 but note that these endpoints may not persist through service upgrades and data stored on these may be deleted without warning; please use only for test purposes.
Globus Connect Server Installation
Globus Connect Server Configuration
Edit the configuration file: /etc/globus-connect-server.conf
Enable the configuration changes: globus-connect-server-setup