Skip to content
forked from wesm/feather

Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow

License

Notifications You must be signed in to change notification settings

Intellifora/feather

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

56 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Feather: fast, interoperable data frame storage

Travis-CI Build Status Coverage Status

Feather is binary columnar serialization for data frames. It is designed to read and write data frames very efficiently, and to make it easy to share data across multiple data analysis languages. The initial version of Feather comes with bindings for python (written by Wes McKinney) and R (written by Hadley Wickham).

Feather uses the Apache Arrow columnar memory specification to representing binary data on disk in a way that can be read and written very rapidly. This is particularly important for encoding null/NA values and variable-length types like UTF8 strings. Feather is complementary to Apache Arrow because Arrow does not provide a file format, so Feather defines its own schemas and metadata for an on-disk representation.

Feather currently supports the following column types:

  • A wide range of numeric types (int8, int16, int32, int64, uint8, uint16, uint32, uint64, float, double).
  • Logical/boolean values.
  • Dates, times, and timestamps.
  • Factors/categorical variables that have fixed set of possible values.
  • UTF-8 encoded strings.
  • Arbitrary binary data.

All column types support NA/null values.

License and Copyrights

This library is released under the Apache License, Version 2.0.

See NOTICE for details about the library's copyright holders.

About

Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • C++ 70.6%
  • Python 13.7%
  • CMake 5.7%
  • R 4.5%
  • Shell 2.2%
  • C 1.9%
  • Perl 1.4%