MULTIVAC PLATFORM

the platform of platforms!

LARGE-SCALE SCIENTIFIC DATA AT YOUR FINGERTIPS

IT'S ALL ABOUT DATA!

Multivac is a data-centric platform developed at Institut des Systèmes Complexes de Paris Île-de-France. It loves data! It hosts more than 14 billion data and counting. These documents are being stored inside large-scale databases and search engine clusters.

Multivac Platform offers a wide range of topics to cover most social science use cases. Some of ISC-PIF initiations such as Climate, Risk and Politic has been built and powered by Multivac Platform.

Multivac Platform updates its datasets in real-time and simultaneously gives access to them in real-time. The importance of real-time streaming is near to essential in most of our scientific projects and we hope it makes the same difference to other scientists.

Global Pulse Education

+2B

Global Pulse Employment

+2B

Music dataset

+1.5B

Geo-tagged dataset

+1B

Public stream dataset

+1B

IT logs dataset

+1B

Risk dataset

+320M

Scientific dataset

71M

Climate dataset

+80M

political dataset

+80M

News dataset

+50M

Wikipedia dataset

+12M

World beyond data

Multivac Platform is a platform of platforms! It offers scientific toolbox to dive into large-scale data for discovering and exploration. Platforms such as Data Science Lab, APIs engine and Scientific Dashboards.

MULTIVAC
DATA SCIENCE LAB

MULTIVAC Hadoop Cluster

We have designed and implemented Hadoop cluster over more than 30 servers inside our private Cloud. This gives us Hadoop components such as YARN, HDFS, Apache Spark, Apache Hive. etc.

Multivac DSL offers a large-scale Hadoop cluster with over 900 vCores, 1TB of memory and more than 100TB of distributed storage.

MULTIVAC Hadoop Notebooks

Multivac also offers interactive Hadoop notebooks by hosting multi-users/multi-tenants Apache Zeppelin and Hue.
Users can submit their codes and jobs over Multivac Hadoop Cluster by using Apache Spark interactive shell and Spark submit or Multivac hosted interactive notebooks in Scala, Java, Python, R and SQL.

MULTIVAC Hadoop Open Data

Multivac commits to make its important datasets to researchers and scientists over Multivac Hadoop Cluster in both format of RAW (JSON) and big SQL tables (Apache Hive). Users can run their jobs against Multivac Public Data easily!

Multivac

Interactive Spark Notebooks

WE’VE BUILT A WHOLE BUNCH OF STUFF TO BRING DATA SCIENCE AND BIG DATA HAND TO HAND RIGHT TO YOUR DOORSTEP.

MULTIVAC API ENGINE

Multivac Platform offers a complete set of REST APIs to communicate to its data repositories. This makes it easy to get out not only the raw data, but also the aggregated and processed results.

Multivac uses Swagger to design, build and document its RESTful APIs. Swagger is a powerful open source framework backed by a large ecosystem of tools. It also follows the Open API Initiative (OAI) to standardising on how Multivac REST APIs are described.

Take a look at the demo on the right and see how you can integrate Multivac APIs inside your code.

Multivac

API Engine

This is just an example of how to integrate Multivac Wiktionary API into your code!


var settings = {
  "async": true,
  "crossDomain": true,
  "url": "https://api.iscpif.fr/v2/pub/wikitionary/suggest?q=climate&lang=en&count=10",
  "method": "GET",
  "headers": {
    "cache-control": "no-cache"
  }
}
$.ajax(settings).done(function (response) {
  console.log(response);
});
                                    

$request = new HttpRequest();
$request->setUrl('https://api.iscpif.fr/v2/pub/wikitionary/suggest');
$request->setMethod(HTTP_METH_GET);

$request->setQueryData(array(
  'q' => 'climat',
  'lang' => 'en',
  'count' => '10'
));
$request->setHeaders(array(
  'cache-control' => 'no-cache'
));
try {
  $response = $request->send();

  echo $response->getBody();
} catch (HttpException $ex) {
  echo $ex;
}
                                    

import requests
url = "https://api.iscpif.fr/v2/pub/wikitionary/suggest"
querystring = {"q":"climat","lang":"en","count":"10"}
headers = {
    'cache-control': "no-cache"
    }
response = requests.request("GET", url, headers=headers, params=querystring)
print(response.text)
                                    

wget --quiet \
  --method GET \
  --header 'cache-control: no-cache' \
  --output-document \
  - 'https://api.iscpif.fr/v2/pub/wikitionary/suggest?q=climate&lang=en&count=10'
                                    

MULTIVAC IS A STATE OF THE ART PLATFORM, IT WAS CREATED WITH A GREAT VISION OF HOW BIG DATA CAN HELP AND ASSIST SCIENTISTS

MULTIVAC DASHBOARDS

Create beautiful visualisations by using Multivac Dashboards. It allows you to discover and explore your desired topics and subjects within 14 billion data!

In version 2.0 you will be able to save your dashboards, export the visualisations and the results.

Please use the little demo on your right to see what you can expect in near future! ;)

Note: Here we use Multivac suggestion API to build an autocomplete based on English Wiktionary dataset. It also visualises the very same query within Web of Science dataset with 52 million metadata.

Multivac

Dashboard


PROJECTS POWERED BY MULTIVAC PLATFORM

We are showcasing some of our scientific projects which were built and powered by using Multivac Platform. By the use of Multivac API Engine, these projects have access to both raw and aggregated data.

How to access

Who can access

- Limited access to ISC-PIF residents and partners. Multivac Platform is in beta!

What can and cannot do


CAN

- Access to Multivac Dashboards
- Access to Multivac API engine (secret token is required for private RESET APIs)
- Access to Multivac Hadoop cluster (only available to partners and residents)
- Access to Multivac Data for Hadoop (only available to partners and residents)

CANNOT

- Access to any raw data! You always get filtered, limited and aggregated results
- Direct access to any database, search engine or any other technology. Every request MUST go through Multivac API engine
- Please do not ask for any database dump nor the entire datasets! :-)



Access to Multivac Platform

Access to Multivac

request access to Multivac Platform

Request for Access

Core Team

Maziyar Panahi

Maziyar PANAHI

Project Manager
CNRS / ISC-PIF
David Chavalarias

David CHAVALARIAS

Scientific Manager
CNRS/EHESS, ISC-PIF/CAMS

Host Institutions

Sponsors



Community Users


Technology Stack