المنتديات

Artificial Intelligence - Theory & Practice‏

Jorge Alberto Hernández C.‏

Deep learning with GPU‏

AI, Deep Learning & Neural Networks‏

Cognitive Computing & AI in Health/care‏

Maged N. Kamel Boulos‏

Robot Apocalypse Engineer‏

Michael Rainey‏

Disruptive Technologies of the 21st Century‏

Raja Mitra‏

Gideon Rosenblatt‏

Walter Di Carlo‏

Data Science & BI praktisch‏

Evert-Jan van Doorn‏

Big data, Data Science, AI & x-computing‏

Compiled Inteligence‏

Singularity, Transhumanism, & AI‏

Cristian Lorenzutti‏

AI and machine learning‏

Peter Speckmayer‏

Artificial Intelligence & Quantum Computing‏

Guillermo González de Garibay‏

Machine Learning and Artificial Intelligence‏

José Carlos Méndez de la Torre‏

Artificial Intelligence and Machine Learning‏

Machine Learning (AI): deep learning.‏

The World Of Software‏

Mahesh Paolini-Subramanya‏

IT / Data Mining / Data Science‏

Franz Graf‏

Disruptive Techs - AI, IoT, Cloud, AR & VR‏

Digital CRM‏

Big Data, Machine Learning and Visualization‏

AI-DeepLearning-MachineLearning‏

Neuroscience News‏

Analytics and Data Science‏

Mats Nilson‏

Modern Computing and Digital Transformation‏

Machine Learning ~ Artificial Intelligence‏

SEO‏

Cleaning Browser Tabs‏

Frank Nestel‏

Artificial Neural Networks/Machine Learning‏

Rakesh Warier‏

Artificial Intelligence & Machine Learning‏

Erik Jonker‏

AI and Machine Learning Project‏

AI and Deep Learning‏

Нейроинформатика, Автоматизация, Лингвистика‏

Ioannis Kourouklides‏

Machine Intelligence‏

Jarek Wilkiewicz‏

Robos & Intelligent Machines (AI, CogSci etc.)‏

Hilmar Hoffmann‏

Digital Marketing Agency‏

Md Soleman‏

Big Data and Machine Learning‏

Videos‏

Srinivasan Narayanan‏

Nazmi Asri‏

Artificial Intelligence & Machine Learning‏

Mansoor Ahmed‏

Deep Learning & Artificial Intelligence‏

Accelerating Technology‏

Deep Learning and Big Brain‏

Yinka Makanjuola‏

Useful codes and libraries‏

Arash Moaddel‏

Data Science && Machine Learning && Big Data‏

تحتوي المشاركة على مرفق

Derek Christensen

عام

1 من الأيام

College of Engineering‏

Derek Christensen on LinkedIn: "Team used #objectdetection Deep Learning #imageanalysis to manually annotate a training set of images & trained a Convolutional #neuralnetwork to tag insulators using inference & classify them as damaged or not. Used two #deeplearning platforms: Py-Faster-RCNN & #pytorch. https://lnkd.in/gMM8_C4 Thank you to: Black & Veatch, K-State Polytechnic, & Dr. William Hsu KDD Lab Team of https://lnkd.in/gDjNRK9"

linkedin.com

إضافة تعليق...

تحتوي المشاركة على مرفق

My Java

المالك

MyJava.in Discussion

2 من الأيام

PyTorch : An open source deep learning platform that provides a seamless path from research prototyping to production deployment.

#openSource #github #pytorch #neuralNetwork #autograd #gpu #numpy #deepLearning #tensor #python

pytorch/pytorch

github.com

تحتوي المشاركة على مرفق

My Java

المالك

MyJava.in Discussion

2 من الأيام

BigDL: Distributed Deep Learning Library for Apache Spark

#DeepLearning #Library #ApacheSpark #github #intelAnalytics #BigDL #bigData #hadoop #python #scala #keras #ai

intel-analytics/BigDL

github.com

تحتوي المشاركة على مرفق

My Java

المالك

MyJava.in Discussion

2 من الأيام

caffe2 : A New Lightweight, Modular, and Scalable Deep Learning Framework

#caffe2 #DeepLearning #Framework #machineLearning #ai #artificialintelligence #neuralNetworks #fbOpenSource #facebookarchive #pytorch #Tensor

Caffe2 Tutorials Overview

caffe2.ai

تحتوي المشاركة على مرفق

HGPU group

Deep learning with GPU

4 من الساعات

Analyzing GPU Tensor Core Potential for Fast Reductions

(Roberto Carrasco, Raimundo Vega, Cristóbal A. Navarro)

#CUDA #DeepLearning #DL #Performance

The Nvidia GPU architecture has introduced new computing elements such as the tensor cores, which are special processing units dedicated to perform fast matrix-multiply-accumulate (MMA) operations and accelerate Deep Learning applications. In this work we present the idea of using tensor cores for a different purpose such as the parallel arithmetic reduction problem, and propose a new GPU tensor-core based algorithm as well as analyze its potential performance benefits in comparison to a traditional GPU-based one. The proposed method, encodes the reduction of n numbers as a set of m×m MMA tensor-core operations (for Nvidia’s Volta architecture m=16) and takes advantage from the fact that each MMA operation takes just one GPU cycle. When analyzing the cost under a simplified GPU computing model, the result is that the new algorithm manages to reduce a problem of n numbers in T(n)=5*log_m^2(n) steps with a speedup of S=4/5*log_2(m^2).

https://hgpu.org/?p=18796

Analyzing GPU Tensor Core Potential for Fast Reductions

hgpu.org

إضافة تعليق...

تحتوي المشاركة على مرفق

Duc Haba

عام

4 من الأيام

The “Demystify Deep Learning for Executives” video part 2, “The ROI of Deep Learning,” is released. #deeplearning #video #artificialintelligence Enjoy. https://youtu.be/iJed_4h_XDA

إضافة تعليق...

تحتوي المشاركة على مرفق

HGPU group

Deep learning with GPU

4 من الساعات

TensorFlow Doing HPC

(Steven W. D. Chien, Stefano Markidis, Vyacheslav Olshevsky, Yaroslav Bulatov, Erwin Laure, Jeffrey S. Vetter)

#CUDA #TensorFlow #HPC #DeepLearning #DL #Package

TensorFlow is a popular emerging open-source programming framework supporting the execution of distributed applications on heterogeneous hardware. While TensorFlow has been initially designed for developing Machine Learning (ML) applications, in fact TensorFlow aims at supporting the development of a much broader range of application kinds that are outside the ML domain and can possibly include HPC applications. However, very few experiments have been conducted to evaluate TensorFlow performance when running HPC workloads on supercomputers. This work addresses this lack by designing four traditional HPC benchmark applications: STREAM, matrix-matrix multiply, Conjugate Gradient (CG) solver and Fast Fourier Transform (FFT). We analyze their performance on two supercomputers with accelerators and evaluate the potential of TensorFlow for developing HPC applications. Our tests show that TensorFlow can fully take advantage of high performance networks and accelerators on supercomputers. Running our TensorFlow STREAM benchmark, we obtain over 50% of theoretical communication bandwidth on our testing platform. We find an approximately 2x, 1.7x and 1.8x performance improvement when increasing the number of GPUs from two to four in the matrix-matrix multiply, CG and FFT applications respectively. All our performance results demonstrate that TensorFlow has high potential of emerging also as HPC programming framework for heterogeneous supercomputers.

https://hgpu.org/?p=18795