Yandex Data Proc
A service for processing multi-terabyte data arrays using such open-source tools as Apache Spark™, Apache Hadoop®, Apache HBase, Apache Hive, Apache Zeppelin,
and other Apache® ecosystem services.
and other Apache® ecosystem services.
Easy to use
Low cost
Full control of your cluster
AutoscalingPreview
Secure data storage
Workflow automation
We'll take care of most cluster maintenance
Independent control
Control on the Yandex.Cloud side
Questions and answers
What Apache® services are available in Yandex Data Proc?
Spark™, HDFS, YARN, Hive, HBase®, Oozie™, Sqoop™, Flume™, Tez®, and Zeppelin™.
Spark™, HDFS, YARN, Hive, HBase®, Oozie™, Sqoop™, Flume™, Tez®, and Zeppelin™.
Can anyone access my data?
Only you can manage access to your data using Yandex Resource Manager. Databases of different Yandex.Cloud customers are completely isolated from one another.
Only you can manage access to your data using Yandex Resource Manager. Databases of different Yandex.Cloud customers are completely isolated from one another.
Get started with Yandex Data Proc
Useful links
Apache, Apache Hadoop, Apache Spark, and Apache Oozie are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries.