Machine Learning

Machine Learning Frameworks Comparison

In this post we compare popular machine learning frameworks like TensorFlow, Theano, Torch, Caffe, CNTK, MXnet, and more.

9 years ago • 4 min read

By Maciej

We need your help!

We're looking for content writers, hobbyists and researchers with a focus on Machine Learning to help build-out our community. Email hello@paperspace.com with a writing sample and tutorial ideas

When taking the deep-dive into Machine Learning (ML), choosing a framework can be daunting. You've probably heard the many names/acronyms that make-up the constellation of frameworks, toolkits, libraries, data sets, applications etc. but may be curious about how they differ, where they fall short and which ones are worth investing in. Since the field and surrounding technologies are relatively new, the most common concern amongst new users is understanding which of these frameworks has the most momentum. This article summarizes their major differences and attempts to contextualize them within a broader landscape.

TensorFlow was developed by the Google Brain Team for conducting research in machine learning and deep neural networks. Google recently moved away from Torch to TensorFlow which was a blow to other frameworks -- Torch and Theano in particular. Many describe TensorFlow as a more modern version of Theano after many important lessons about this new field/technology were learned over the years.

TensorFlow is relatively painless to setup and offers tutorials aimed at beginners that cover the theoretical underpinnings and practical application of neural networks. TensorFlow is slower than Theano and Torch but this is currently being addressed head on by Google and the open source community. TensorBoard is TensorFlow's visualization module which provides an intuitive view of your computation pipeline. Keras, a deep-learning library, was recently ported to run on TensorFlow which means any model written in Keras can now run on TensorFlow. Finally, it's worth mentioning that TensorFlow can run on a wide variety of hardware.

GPU acceleration: Yes
Languages/interfaces: Python, Numpy, C++
Platform: Cross platform
Maintainer: Google

Theano originated in 2007 at the University of Montreal at the widely renowned Institute for Learning Algorithms. Theano is powerful, extremely fast and flexible but is generally regarded as a low-level framework (eg error messages are known to be especially cryptic/unhelpful). As such, raw Theano is more of a research platform and ecosystem than a deep learning library. It is often used as an underlying platform for higher-level abstraction libraries that provide simple API wrappers into Theano. Some of the more popular libraries include Keras, Lasagne and Blocks. One of the downsides of Theano is that multi-GPU support still requires a workaround.

GPU acceleration: Yes
Languages/interfaces: Python, Numpy
Platform: Linux, Mac OS X and Windows
Maintainer: MILA lab at University of Montreal

Of all the common frameworks, Torch is probably the easiest to get up and running, especially if you are using Ubuntu. Originally developed at NYU in 2002, Torch is widely used by large tech companies like Facebook and Twitter and is also backed by NVIDIA. Torch is written in the scripting language called Lua which is easy to read but not nearly as common as languages like Python. Helpful error messages, a plethora of sample code/tutorials and the simplicity of Lua make Torch a great place to start.

GPU acceleration: Yes
Languages/interfaces: Lua
Platform: Linux, Android, Mac OS X, iOS and Windows
Maintainer: Ronan, Clément, Koray and Soumith

Caffe was developed for image classification/machine-vision leveraging Convolutional Neural Networks (CNNs). Caffe is perhaps best known for Model Zoo, a set of pre-trained models which you can use without writing any code.

Caffe is targeted towards those building applications while Torch and Theano are tailored for research. Caffe is not intended for non-computer vision deep-learning applications such as text, sound or time series data. Caffe can run on a variety of hardware and switching between CPU and GPU is set with a single flag. Caffe is slower than Theano and Torch.

GPU acceleration: Yes
Languages/interfaces: C, C++, Python, MATLAB, CLI
Platform: Ubuntu, Mac OS X, experimental Windows support
Maintainer: BVLC

Microsoft Cognitive Toolkit, also known as CNTK, is Microsoft’s open-source deep-learning framework. CNTK is better known in the speech community than in the general deep learning community though CNTK can be used for image and text training as well. CNTK supports a wide variety of algorithms like Feed Forward, CNN, RNN, LSTM, and Sequence-to-Sequence. It runs on many different hardware types including multiple GPUs.

GPU acceleration: Yes
Languages/interfaces: Python, C++, C# and CLI
Platform: Windows, Linux
Maintainer: Microsoft Research

Additional Frameworks

There are several other deep learning frameworks, including MXnet, Chainer, BidMach, Brainstorm, Kaldi, MatConvNet, MaxDNN, Deeplearning4j, Keras, Lasagne, Leaf, and others.

A comparison of GitHub activity:

public

Bash on Windows 10

public

Blog

Docs

Community

ML Showcase

Professional Services

Talk to an Expert

Bash on Windows 10

Windows 10

Additional Frameworks

Spread the word

Bash on Windows 10

Windows 10

Keep reading

Kolmogorov-Arnold Networks (KAN): Promising Alternative to Multi-Layer Perceptron?

Predictive Analysis for Sales: A Comprehensive Forecasting Approach 📈🕵🏼‍♂️👨🏼‍💻

Encoding Categorical Data with One-hot Encoding

Subscribe to our newsletter

Solutions

Product

Resources

Company