Data Storage

Data Storage and Backups at CSAIL

TIG provides comprehensive data storage solutions for CSAIL members: AFS and NFS network file systems, plus Ceph object and block storage. Together, these services provide over 1PB of local data, with enterprise backup and off-site disaster recovery.

CSAIL lab members have access to multiple storage options depending on your needs, location, and sharing requirements. Choose the option that best fits your use case below.

Quick Reference

Service Best For Access Quota
AFS Secure, cross-platform file sharing Network + remote 20G (increaseable up to 200G)
NFS High-performance local access CSAIL network only Arranged per request
Dropbox Cloud sync & sharing Any device/browser 500GB per user
Google Storage Collaboration & document sharing Any device/browser 500GB per user

AFS File System Services

AFS is a distributed file system using Kerberos for authentication. It integrates nicely with Kerberos, is cross-platform compatible, has built-in redundancy, fault-tolerance, high availability, and backup/restore management. Plus, it supports a much more granular access control model than standard UNIX modes, including user-defined access groups and even unauthenticated/anonymous access if desired.

Best for: Secure file sharing, cross-platform access, and fine-grained permission control.


NFS File System Services

NFS (Network File System) provides high-performance network storage for workstations and servers at CSAIL. All users automatically have access to two scratch filesystems: /data/scratch (Stata) and /data/scratch-oc40 (Holyoke data center), each with a 1 TiB quota per user. Files are subject to deletion if not accessed for more than a year.

Security Warning: NFS has no built-in security. Data is transmitted unencrypted and access control relies on UNIX file permissions only. See [NFS Security] for important information.

NFS is only accessible from the CSAIL network.


Dropbox

IS&T has licensed Dropbox for Business for the entire MIT community. Dropbox provides convenient access to your data via Windows, macOS, Linux, mobile clients, and any web browser. With the Dropbox client, access your cloud data as a folder on your local disk.

Key features:

Enrollment is not automatic. Register at https://dropbox.mit.edu. Affiliates and temporary employees require a sponsor. See IS&T’s Dropbox Landing Page for complete details.

Best for: Cloud sync, data sharing, and backup across devices.


Google Workspace for Education

Google Workspace for Education is MIT’s cloud collaboration service. It provides registered users with Google apps and services for collaboration, separate from personal Google accounts.

Key features:

To request access: MIT Affiliates should have their full-time staff sponsor email servicedesk@mit.edu with the affiliate’s MIT email address.

For complete details, see IS&T’s Google Workspace Landing Page.


Data Security at CSAIL

Here you can determine how sensitive your data is and how to properly store and handle data of different security levels, particularly for Medium Risk data. Also see information on MIT’s Written Information Security Program (WISP).


Backups

All TIG-provided network storage is backed up by default, though exceptions can be made by request.

Backup retention:


Revision Control

For source code and version control:


Questions? Contact TIG at help@csail.mit.edu or visit 32-270.