Skip to content

Sync meeting 2023 09 25 with CernVM FS developers on Best Practices for CernVM FS on HPC tutorial

Kenneth Hoste edited this page Dec 8, 2023 · 3 revisions

Best Practices for CernVM-FS in HPC

Sync meeting (2023-09-25)

Attending:

  • CernVM-FS: Laura, Valentin
  • EESSI/MultiXscale: Lara, Alan, Bob, Kenneth

Notes

  • GitHub repository @ https://github.com/multixscale/cvmfs-tutorial-hpc-best-practices
  • Progress on tutorial contents
    • Introduction to CVMFS split up into subsections (see PR #14)
      • suggestions by Jakob/Lara largely taken into account
      • TODO:
        • revisit subsection on caching, needs more structure
        • flesh out "Terminology" + "Example repositories" subsections
        • visuals?
      • should also mention systems where CernVM-FS is used (like ComputeCanada, NERSC, EuroHPC Vega)
    • EESSI section: PR #12
      • (still) to be reviewed/merged [Kenneth]
    • "Accessing a repository" section [Bob?]
      • Installing + configuring CernVM-FS client
      • squid proxy (offline nodes + LRU cache in network)
    • Performance aspects
      • focused call on benchmarking with Alan, Laura, Kenneth
  • date for online tutorial?
    • Mon 4 Dec 2023 (13:30-17:00 CET with 30min break)
      • still realistic?
      • promote via CASTIEL2?
      • should be announced at least 1 month up front => end of Oct'23
      • announce via MultiXscale website + EuroHPC portal (https://hpc-portal.eu)
      • registration via UGent event manager?
  • next sync meetings
    • Thu 28 Sept'23 16:00 CEST on performance aspects (Alan, Lara, Kenneth)
    • Mon 23 Oct'23 14:00 CEST: go/no-go for tutorial on Mon 4 Dec'23
  • scenarios for performance benchmarks
    • performance data to collect
      • #files + data volume + bandwidth
      • timing for the command
    • software
      • GROMACS (# files?)
        • gmx --version
      • TensorFlow (~20sec startup?!)
        • python -c 'import tensorflow'
        • diff for x86_64 vs aarch64? (~1GB on aarch64, ~2GB on x86_64)
      • Python script with lots of imports as "extreme" example?
        • tensorflow, pandas, scipy, numpy, h5py, ...
    • CVMFS scenarios
    • comparison with common practice
      • compare with timings using software stack on local disk (like ext4) + GPFS/Lustre (@ HPC-UGent) at the end
    • other
      • loopback cache (offline nodes)
      • alien cache
    • impact of EESSI compat layer?
      • vs software installed on top of bare OS
      • try these experiments both with and without EESSI?
      • can we figure out how many files were pulled in for compat layer?
        • sudo cvmfs_talk -i pilot.eessi-hpc.org cache list

Sync meeting (2023-09-28)

Attending:

  • CernVM-FS: Laura
  • EESSI/MultiXscale: Alan, Lara, Kenneth

Notes

Clone this wiki locally