Training Deeper Models by GPU Memory Optimization on TensorFlow

@inproceedings{Meng2017TrainingDM,
  title={Training Deeper Models by GPU Memory Optimization on TensorFlow},
  author={Chen Jin Meng and Minmin Sun and Jun Yang and Minghui Qiu and Yang Gu},
  year={2017}
}
With the advent of big data, easy-to-get GPGPU and progresses in neural network modeling techniques, training deep learning model on GPU becomes a popular choice. However, due to the inherent complexity of deep learning models and the limited memory resources on modern GPUs, training deep models is still a nontrivial task, especially when the model size is… CONTINUE READING