Wes McKinney is a software engineer at Cloudera. He is the creator of Python’s pandas library and the Ibis project, a committer to the Apache Parquet and Apache Arrow projects, and the author of the O'Reilly Media book, Python for Data Analysis. Previously, Wes was the founder and CEO of DataPad.
Next-generation Python Big Data Tooling, powered by Apache Arrow
Dealing with Data, Intermediate
The Python data stack has struggled to interoperate well with big data systems. Apache Arrow provides standard in-memory columnar data structures that will enable Python programmers to participate in big data problems in a more natural and performant way. This talk will discuss the Apache Arrow project itself and the state of the new tools being created to help Python work better with Apache Hadoop and Apache Spark.