Getting Started

Welcome to getting started with Hydra! Hydra is a fault-tolerant, reactive streaming system capable of handling data ingestion, persistence, and replication in an efficient and scalable manner. Although there are many systems that can provide you with these types of features, Hydra aims to provide an extra layer of abstraction on top of existing messaging or streaming systems, and provide a unified way to both store and stream data. For laymen, Hydra can help you:

  • Store any form of data with consistent and guaranteed delivery
  • Validate the structure of your data
  • Attach useful metadata to your messages (timestamps are your friend!)
  • Stream data to various destinations, ranging from HDFS or S3 to Postgres or Hive (Postgres currently supported)
  • Much, much more!

Now we'll walk you through the various use cases of Hydra and provide links for further reading, troubleshooting, tutorials, and FAQs.

Where to go from here

What do you need to do?

If you're a developer, we recommend starting with our System Overview to get an idea of how Hydra is built and to learn more about some of the motivations behind the system, and some of the technologies we leverage (Scala, Akka, and Kafka, to name a few). If you want to get up and running as quickly as possible, check out our Hydra tutorials:

Need to learn how to send data to Hydra? Start with our tutorial.

Want to find out how to stream data out of Hydra? Start with our streaming tutorial.

What Else?

If you've already followed the tutorials, check out our API Documentation for more information about configuration options when using Hydra. If you're having trouble working with Hydra or need assistance troubleshooting, please refer to our FAQs to see if it helps with any of your problems!