[WIP] Update Cambridge config for current CSD3 partitions by RaqManzano · Pull Request #1102 · nf-core/configs

RaqManzano · 2026-04-29T08:21:48Z

name: Update Cambridge CSD3 Config
about: Updating Cambridge CSD3 cluster config

Summary

This PR updates the Cambridge CSD3 institutional profile to reflect the current partitions and simplifies the configuration logic based on feedback from the nf-core team.

Changes

update conf/cambridge.config with current partition limits:
- icelake
- icelake-himem
- sapphire
set icelake as the default partition
keep --partition as a user override
infer walltime limits from the SLURM account name:
- accounts containing -SL3- get a 12.h cap
- otherwise default to 36.h for SL1 / SL2
add schema validation ignores for config-specific parameters
refresh docs/cambridge.md:
- update install instructions
- document partition selection
- explain screen / tmux usage
- add a note recommending srun / sbatch for large runs where the Nextflow manager can become memory-heavy
add Cambridge to .github/CODEOWNERS
add Cambridge-specific module initialization and SLURM executor tuning to improve job submission stability on CSD3

Ackowledgements

Many thanks to @pontus, @jfy133, @tdanhorn and @maxulysse for their initial feedback in the nf-core channel.

…documentation

jfy133

Sorry for any duplicate comments!

Don't forget to add the config in: https://github.com/nf-core/configs/blob/master/.github/workflows/main.yml

jfy133 · 2026-04-29T09:06:13Z

+    // Compatibility with nf-core schema validation across pipeline versions.
+    schema_ignore_params         = 'partition,project,max_memory,max_cpus,max_time,csd_time,csd_parts,csd_selected,validationSchemaIgnoreParams'
+    validationSchemaIgnoreParams = 'partition,project,max_memory,max_cpus,max_time,csd_time,csd_parts,csd_selected,schema_ignore_params,validationSchemaIgnoreParams'


Are these two really needed, line 26 seems to cover these already?

We could but this is to be more compatible with older pipelines that still look for schema_ignore_params and validationSchemaIgnoreParams

jfy133 · 2026-04-29T09:06:41Z

 }

-// Description is overwritten with user specific flags
+params.csd_time = {


Can this not go in the main params block?

I prefer to keep it separate from the main block as it is not really a param for the users.

tavareshugo · 2026-04-29T09:28:23Z

Thanks for this Raquel! A few comments and suggestions from my side:

cambridge.md

In the guidelines, I would give a specific recommendation for where to store the singularity cache.

In our HPC, it is definitely not recommended to store directly in $HOME/ -- perhaps an explicit warning about this is good to include.

You could also give a specific recommendation, for example: $HOME/rds/hpc-work/nxf-singularity-cache is a good place.

I like your approach of defining the max parameters from the partition. At the risk of over-complicating, I wonder if there could be an "automatic" selection if the user doesn't specify anything. For example, in my config I have the following:

process {
  executor = 'slurm'
  clusterOptions = --partition icelake'

  // Settings below are for CSD3 nodes detailed at
  //   https://docs.hpc.cam.ac.uk/hpc/index.html
  // Current resources (Jun 2023):
  //   icelake: 76 CPUs; 3380 MiB per cpu; 6760 MiB per cpu (himem)
  //   cclake: 56 CPUs; 3420 MiB per cpu; 6840 MiB per cpu (himem)
  // The values used below were chosen to be multiples of these resources
  // assuming a maximum of 2 retries

  // Using himem partition to ensure enough memory for single-CPU jobs
  withLabel:process_single {
      cpus   = { check_max( 1                  , 'cpus'    ) }
      memory = { check_max( 6800MB * task.attempt, 'memory'  ) }
      time   = { check_max( 4.h  * task.attempt, 'time'    ) }
      clusterOptions = "--partition icelake-himem"
  }
  // 4 CPUs + 13GB RAM
  withLabel:process_low {
      cpus   = { check_max( 4     * task.attempt, 'cpus'    ) }
      memory = { check_max( 13.GB * task.attempt, 'memory'  ) }
      time   = { check_max( 4.h   * task.attempt, 'time'    ) }
      clusterOptions = "--partition icelake"
  }
  // 8 CPUs + 27GB RAM
  withLabel:process_medium {
      cpus   = { check_max( 8     * task.attempt, 'cpus'    ) }
      memory = { check_max( 27.GB * task.attempt, 'memory'  ) }
      time   = { check_max( 8.h   * task.attempt, 'time'    ) }
      clusterOptions = "--partition icelake"
  }
  // 12 CPUs + 40GB RAM
  withLabel:process_high {
      cpus   = { check_max( 12    * task.attempt, 'cpus'    ) }
      memory = { check_max( 40.GB * task.attempt, 'memory')}
      time   = { check_max( 8.h  * task.attempt, 'time'    ) }
      clusterOptions = "--partition icelake"
  }
  // Going by chunks of 12h (2 retries should bring it to max of 36h)
  withLabel:process_long {
      time   = { check_max( 12.h  * task.attempt, 'time'    ) }
  }
  // A multiple of 3 should bring it to max resources on icelake-himem
  withLabel:process_high_memory {
      cpus   = { check_max( 25     * task.attempt, 'cpus'    ) }
      memory = { check_max( 170.GB * task.attempt, 'memory' ) }
      clusterOptions = "--partition icelake-himem"
  }
  withLabel:error_ignore {
      errorStrategy = 'ignore'
  }
  withLabel:error_retry {
      errorStrategy = 'retry'
      maxRetries    = 2
  }
}

So, I allow for 2 retries and increase the resources accordingly, to roughly reach the maximum resources of each partition.
So, the behaviour could be something like:

If user specifies --partition, then all jobs are submitted to that partition only
If user doesn't specify anything, something like the above would kick in, where jobs are submitted either to icelake or icelake-himem depending on the process labels.

But, totally fine if you prefer to leave these extra additions out - the config revision you did is already a great update to the previous config, thanks so much! 🙂

…tion

RaqManzano · 2026-04-29T09:56:01Z

Thanks for the comments @tavareshugo, I just changed $HOME to $HOME/rds, you were right. it was lazy writing from my side, thanks for calling on that! Regarding the "automatic" selection scenario, I already tried an implementation of it and after discussing with the nf-core guys we went for a simpler approach. I refined the docs with info about the partitions. Let me know if you agree, feel free to make changes.

tavareshugo

Keeping the config "simpler" is probably a good idea indeed.

I've made a couple of minor suggestions to the markdown.

Co-authored-by: Hugo Tavares <tavareshugo@users.noreply.github.com>

RaqManzano added 2 commits April 29, 2026 09:08

Update Cambridge config for current CSD3 partitions

3439f0b

Add Cambridge profile owner to CODEOWNERS

26327a3

RaqManzano self-assigned this Apr 29, 2026

pontus reviewed Apr 29, 2026

View reviewed changes

Comment thread docs/cambridge.md Outdated

pontus approved these changes Apr 29, 2026

View reviewed changes

RaqManzano added 3 commits April 29, 2026 09:34

Modified Cambridge resource selection to a partition map and refined …

65f2465

…documentation

Added Cambridge module setup and SLURM executor tuning

5528e15

prettier

d4ce305

jfy133 reviewed Apr 29, 2026

View reviewed changes

RaqManzano added 2 commits April 29, 2026 10:40

Update Cambridge docs to use scratch paths

f1eb218

Refined Cambridge beforeScript logic and polished partition documenta…

47b2a78

…tion

tavareshugo reviewed Apr 29, 2026

View reviewed changes

Comment thread docs/cambridge.md

tavareshugo reviewed Apr 29, 2026

View reviewed changes

Comment thread docs/cambridge.md

tavareshugo reviewed Apr 29, 2026

View reviewed changes

RaqManzano and others added 5 commits April 29, 2026 11:21

Update docs/cambridge.md

31cc0df

Co-authored-by: Hugo Tavares <tavareshugo@users.noreply.github.com>

Updated Cambridge documentation

aa15fe2

Refine Cambridge config defaults and expand runtime documentation

88790b5

Merge branch 'master' into update-cambridge-config

b8ab4c9

Merge branch 'master' into update-cambridge-config

9325477

RaqManzano merged commit 3a9a247 into nf-core:master May 5, 2026
152 of 161 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Update Cambridge config for current CSD3 partitions#1102

[WIP] Update Cambridge config for current CSD3 partitions#1102
RaqManzano merged 12 commits intonf-core:masterfrom
RaqManzano:update-cambridge-config

RaqManzano commented Apr 29, 2026 •

edited

Loading

Uh oh!

Uh oh!

jfy133 left a comment

Uh oh!

jfy133 Apr 29, 2026

Uh oh!

RaqManzano Apr 29, 2026

Uh oh!

jfy133 Apr 29, 2026

Uh oh!

RaqManzano Apr 29, 2026

Uh oh!

tavareshugo commented Apr 29, 2026

Uh oh!

RaqManzano commented Apr 29, 2026

Uh oh!

Uh oh!

Uh oh!

tavareshugo left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

RaqManzano commented Apr 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

name: Update Cambridge CSD3 Config about: Updating Cambridge CSD3 cluster config

Summary

Changes

Ackowledgements

Uh oh!

Uh oh!

jfy133 left a comment

Choose a reason for hiding this comment

Uh oh!

jfy133 Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

RaqManzano Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

jfy133 Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

RaqManzano Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

tavareshugo commented Apr 29, 2026

Uh oh!

RaqManzano commented Apr 29, 2026

Uh oh!

Uh oh!

Uh oh!

tavareshugo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

RaqManzano commented Apr 29, 2026 •

edited

Loading

name: Update Cambridge CSD3 Config
about: Updating Cambridge CSD3 cluster config