最適化問題に対する超高速&安定計算

大規模最適化問題、グラフ探索、機械学習やデジタルツインなどの研究のお話が中心

CUDA 9 Features Revealed

2017年08月25日 00時28分32秒 | Weblog
すでに CUDA 9 の RC は使用しています。。。

CUDA 9 Features Revealed

At the 2017 GPU Technology Conference NVIDIA announced CUDA 9, the latest version of CUDA’s powerful parallel computing platform and programming model. In this post I’ll provide an overview of the awesome new features of CUDA 9.

Support for the Volta GPU architecture, including the new Tesla V100 accelerator;
Cooperative Groups, a new programming model for managing groups of communicating threads;
A new API (preview feature) for programming Tensor Core matrix multiply and accumulate operations on Tesla V100.
Faster library routines for linear algebra, image processing, FFTs, and more;
New algorithms in cuSolver and nvGraph
New NVIDIA Visual Profiler support for Volta V100 as well as improved Unified Memory profiling features;
Improved compiler performance;
Support for C++14 in CUDA device code;
Expanded developer platform and host compiler support including Microsoft Visual Studio 2017, clang 3.9, PGI 17.1 and GCC 6.x;
コメント
  • Twitterでシェアする
  • Facebookでシェアする
  • はてなブックマークに追加する
  • LINEでシェアする