Prabod Rathnayaka prabod

## ddp_notes.md

      
              1 file
            
          
              26 forks
            
          
              18 comments
            
          
              188 stars
            
          
                TengdaHan
                / ddp_notes.md
            
            
              Last active
              July 22, 2024 17:55
            
              
                Multi-node-training on slurm with PyTorch
              
          
    Multi-node-training on slurm with PyTorch

What's this?


A simple note for how to start multi-node-training on slurm scheduler with PyTorch.
Useful especially when scheduler is too busy that you cannot get multiple GPUs allocated,
or you need more than 4 GPUs for a single job.
Requirement: Have to use PyTorch DistributedDataParallel(DDP) for this purpose.
Warning: might need to re-factor your own code.
Warning: might be secretly condemned by your colleagues because using too many GPUs.


## Attention.py

from keras import backend as K, initializers, regularizers, constraints
from keras.engine.topology import Layer


def dot_product(x, kernel):
    """
    Wrapper for dot product operation, in order to be compatible with both
    Theano and Tensorflow
    Args:

## k_shortest_paths.py
# -*- coding: utf-8 -*-
"""
A NetworkX based implementation of Yen's algorithm for computing K-shortest paths.
Yen's algorithm computes single-source K-shortest loopless paths for a
graph with non-negative edge cost. For more details, see:
http://networkx.github.io
http://en.m.wikipedia.org/wiki/Yen%27s_algorithm
"""
__author__ = 'Guilherme Maia <guilhermemm@gmail.com>'

	from keras import backend as K, initializers, regularizers, constraints
	from keras.engine.topology import Layer


	def dot_product(x, kernel):
	"""
	Wrapper for dot product operation, in order to be compatible with both
	Theano and Tensorflow
	Args:
	# -- coding: utf-8 --
	"""
	A NetworkX based implementation of Yen's algorithm for computing K-shortest paths.
	Yen's algorithm computes single-source K-shortest loopless paths for a
	graph with non-negative edge cost. For more details, see:
	http://networkx.github.io
	http://en.m.wikipedia.org/wiki/Yen%27s_algorithm
	"""
	__author__ = 'Guilherme Maia <guilhermemm@gmail.com>'