PROJECT

Section: CCB Collaboration Projects (7)
Last updated: Wed 10 Jan 16:53:42 GMT 2024
Index Return to Main Contents
 

DEPRECATION NOTICE

This CCB documentation has been superseded by our new website: https://lumin.imm.ox.ac.uk  

OVERVIEW

Collaboration projects are an easy, secure way for you to share your research data with other people on the cluster. Each project represents a single collaboration and you can have as many as you need! Data stored in a project is automatically given the correct permissions so that only other people in the project can see it.

You just pick the logical set of data and define the people it needs to be shared with, then we create the project for you and update the list of people as time goes on so that it's always correct. They're a secure, convenient way to share data.

Please note that only the PI who owns that data may apply for, or request modifications to, a collaboration project.  

WHY DO I NEED MY PI TO APPLY?

We require your PI to apply because they're the data controller on behalf of the University and that responsibility can't be delegated, and also because we need their agreement to our terms and conditions before we can store the data. As much as this may seem annoying, there are some really important things that the PI can do with that data at any time:

- Ask for a new person to be added to the project

- Request that the ownership of any data be transferred to another member of the project; this is especially useful when someone leaves the lab

- Ask for quota to be transferred from one project to another at no cost; this includes creating a new project from unused quota in another, as well as closing one project to increase the quota of another (assuming the data has been moved)

If you're the PI of a project and would like to do any of these, please contact us via help@imm.ox.ac.uk .  

HOW DO COLLABORATION PROJECTS WORK?

Each collaboration project has its own project space in /project. Here's an example of a project with just two members:

/project/ccbadmin/
                 /dtooke/
                 /emacmahon/
                 /shared/
                 /datashare/

Each project member gets their own directory - only the owner can add, remove or modify contents. Each project gets two additional directories: shared and datashare. Any project member can write data into shared; datashare is similar except its contents are publicly accessible from Internet - see datashare(7) for details. With the exception of datashare, all project data access is restricted to project members for confidentiality. Everything within a project is read-only readable by all other project members for collaboration.  

IMPORTANT SAFETY POINTS

It's a common misconception that the /shared/ directory allows anyone in the project to read and write the data. This is not the case. The data is still owned by the user who creates it and is only writable by them by default. This is deliberate. As much as it may seem attractive to have all data in the directory immediately writable by anyone in the group, that would be dangerous. Multiple people editing the same file at the same time runs a risk of unrecoverable data loss and multiple people with edit access to a directory allows any one of them to delete all the files with a single mistyped command. As such, we recommend against trying to "fix" this.

If you're absolutely positive that you want to make data editable by more than one person, and you're happy to risk permanent and irreversible data corruption and/or loss , the owner of the data can change the permissions as normal using chmod(1).  

HOW DO I JOIN AN EXISTING COLLABORATION PROJECT?

Please ask the PI in charge of the project to contact us via help@imm.ox.ac.uk and ask for your username to be added.  

SUCCESSFUL PROJECTS

There's a few things you can do to maximise the likeihood that your projects will be successful. We'd recommend you take the following points into account before you start.

- You should aim to create a collaboration project for a set of logically connected data, e.g. a specific research project. Creating a collaboration project for a list of people and then thinking about the data they may want to share will risk that you end up with poorly defined data boundries and have to unentangle the data at a future data.

- You should create a new collaboration project for each logically separate set of data. Adding different data to an existing collaboration project and then adding more people will likely lead to the same problems described above.

- Please tell us if the permissions on the data in your project aren't working for you. Trying to work around an issue on your own is more likely to make the problem worse and take longer overall.

- Please tell us if someone leaves the project and no longer needs access to the data. Leaving accounts with access to data they don't need is a security and privacy risk.

 

HOW DO I SET UP A COLLABORATION PROJECT?

Setting up a new collaboration project is quick and easy. Please remember that only PIs may request new projects.

1. Decide the purpose of the project. Each project should be for a single requirement, e.g. data for a lab or data for a research project. If different people need access to different logical data sets, these should be multiple projects. Simmarly, if the same people need access to different logical data sets, these should also be multiple projects

2. Choose a name for the project that isn't already in use in /project/

3. Decide which user accounts should have access to the data

4. Decide how much space you think you'll need (in multiples of 1 TB)

5. Follow the steps detailed in accounts(7).

 

WHAT ABOUT MY OTHER DIRECTORIES?

The collaboration projects are an addition to your existing options, not a replacement, meaning that you can have several different locations for your data. Our suggestion for organising your data would be the following:

/home/some_directory/USERNAME/ : ideal for small private data such as documents or configuration files which are private to you

/project/project1/USERNAME/ : used to store data which is part of project1 but won't typically be used by other people

/project/project1/shared/ : used to store data which is part of project1 and is likely to be used by multiple people

 

GETTING HELP

You can email the CCB team using the email address help@imm.ox.ac.uk. Using this address ensures your email is logged and assigned a tracking number, and will go to all the core team, which means the appropriate person or people will be able to pick it up.  

COPYRIGHT

This text is copyright University of Oxford and MRC and may not be reproduced or redistributed without permission.  

AUTHOR

Duncan Tooke <duncan.tooke@imm.ox.ac.uk>  

SEE ALSO

accounts(7), getquota(1)


 

Index

DEPRECATION NOTICE
OVERVIEW
WHY DO I NEED MY PI TO APPLY?
HOW DO COLLABORATION PROJECTS WORK?
IMPORTANT SAFETY POINTS
HOW DO I JOIN AN EXISTING COLLABORATION PROJECT?
SUCCESSFUL PROJECTS
HOW DO I SET UP A COLLABORATION PROJECT?
WHAT ABOUT MY OTHER DIRECTORIES?
GETTING HELP
COPYRIGHT
AUTHOR
SEE ALSO