Skip to main content

FAISS: Facebook Just Released A New Open Source Library For Clustering Big Data


facebook open source
FAISS is an open-source library released by Facebook for similarity search and clustering high-dimensional data. This library finds application in complex datasets like images and videos which could not fit in RAM all at once.

With the advent of highly successful Machine Learning methods, there has been a boom in big datasets across varied domains. With these huge datasets, hardware becomes a bottleneck. Processing these datasets requires high memory bandwidth and processor capabilities. Furthermore, indexing the data points, clustering and search become highly demanding.
Researchers at Facebook AI Research or FAIR recently published a research paper describing an efficient design for clustering and similarity search. Their new algorithmic structure performs much faster than the previous state-of-art algorithms and utilises GPU for higher memory bandwidth and computational throughput.
Based on their research, they have created a library called FAISS and open-sourced it. Although the algorithms for clustering and similarity search are well-known, this library optimizes those algorithms to perform efficiently on GPUs. Some the algorithms implemented in the library include –
  • Fast K-Nearest Neighbour
  • QuickSelect
  • Warpselect
  • K-Means clustering
As a test of how the library performs, in the following figure, the first and the last image are given and the algorithm computes the intermediate transitional images from a collection of 95 million images.
FAISS Similarity Search

Top Features of FAISS Library –

  • Written in C++ with complete Python wrappers
  • Supports single/multiple GPUs
  • Highly Scalable, typically supports up to 100 dimensions
  • Built on BLAS and CUDA libraries
  • 8.5x faster performance than current state-of-art libraries
Here is the GitHub repo of the FAISS library. So what do you think about the new library? Share your thoughts with us in comments.

Comments

Follow Us

WHAT'S HOT

Best Gaming Linux Distros You Need To Try In 2017

Gaming on Linux scene is improving each year with better hardware support and increasing support from game developers. Apart from established distros like Ubuntu and Arch Linux, gamers are using gaming Linux distros like Steam OS to get a better experience. The other It’s gaming operating systems are Sparky Linux – Gameover

Create TOR’s Hidden .Onion URL Of Any Site With Free EOTK Tool

Want to create Tor’s hidden .onion URL of a website? A free tool named Enterprise Onion Toolkit (EOTK) can help you out with this process by creating a man-in-the-middle proxy between the website and users. It should be noted that his tool only adds a .onion URL for anonymous users; it doesn’t hide your server behind the Tor network.

No Network? This Tiny Device Helps Send Texts, Location From Your Smartphone

G oing out of cellular network coverage area is one of the many problems faced by trekkers, mountaineers, and off-roaders. Sometimes, the situation gets worse when the members of a group separate in an unknown location.