Always know what to expect from your data.
What is Great Expectations?¶
Great Expectations is a framework that helps teams save time and promote analytic integrity with a new twist on automated testing: pipeline tests. Pipeline tests are applied to data (instead of code) and at batch time (instead of compile or deploy time).
Software developers have long known that automated testing is essential for managing complex codebases. Great Expectations brings the same discipline, confidence, and acceleration to data science and engineering teams.
Why would I use Great Expectations?¶
To get more done with data, faster. Teams use Great Expectations to
- Save time during data cleaning and munging.
- Accelerate ETL and data normalization.
- Streamline analyst-to-engineer handoffs.
- Monitor data quality in production data pipelines and data products.
- Simplify debugging data pipelines if (when) they break.
- Codify assumptions used to build models when sharing with distributed teams or other analysts.
See Workflow advantages to learn more about how Great Expectations speeds up data teams.
How do I get started?¶
It’s easy! Just use pip install:
$ pip install great_expectations
You can also clone the repository, which includes examples of using great_expectations.
$ git clone https://github.com/great-expectations/great_expectations.git $ pip install great_expectations/
How do I learn more?¶
For full documentation, visit [Great Expectations on readthedocs.io](http://great-expectations.readthedocs.io/en/latest/).
[Down with Pipeline Debt!](https://medium.com/@expectgreatdata/down-with-pipeline-debt-introducing-great-expectations-862ddc46782a) explains the core philosophy behind Great Expectations. Please give it a read, and clap, follow, and share while you’re at it.
For quick, hands-on introductions to Great Expectations’ key features, check out our walkthrough videos:
- [Introduction to Great Expectations](https://www.useloom.com/share/3eb1d429823744288c99ea26e2c4d443)
- [Using Distributional Expectations](https://www.useloom.com/share/c74b3e9c8dd349e9b8c4aa230cc4bedc)
What’s the best way to get in touch with the Great Expectations team?¶
[Issues on GitHub](https://github.com/great-expectations/great_expectations/issues). If you have questions, comments, feature requests, etc., [opening an issue](https://github.com/great-expectations/great_expectations/issues/new) is definitely the best path forward.
Great Expectations doesn’t do X. Is it right for my use case?¶
It depends. If you have needs that the library doesn’t meet yet, please [upvote an existing issue(s)](https://github.com/great-expectations/great_expectations/issues) or [open a new issue](https://github.com/great-expectations/great_expectations/issues/new) and we’ll see what we can do. Great Expectations is under active development, so your use case might be supported soon.