562c04c
.TH HDFSCLI "1" "October 2021" "" "User Commands"
562c04c
.SH NAME
562c04c
.B hdfscli\-avro
562c04c
\(en an Avro extension for HdfsCLI
562c04c
.SH SYNOPSIS
562c04c
.B hdfscli\-avro schema
562c04c
.RB [ \-a\fR\ \fIALIAS ]
562c04c
.RB [ \-v ...]
562c04c
.I HDFS_PATH
562c04c
.P
562c04c
.B hdfscli\-avro read
562c04c
.RB [ \-a\fR\ \fIALIAS ]
562c04c
.RB [ \-v ...]
562c04c
.RB [ \-F\fR\ \fIFREQ \ |\  \-n\fR\ \fINUM ]
562c04c
.RB [ \-p\fR\ \fIPARTS ]
562c04c
.I HDFS_PATH
562c04c
.P
562c04c
.B hdfscli write
562c04c
.RB [ \-fa\fR\ \fIALIAS ]
562c04c
.RB [ \-v ...]
562c04c
.RB [ \-C\fR\ \fICODEC ]
562c04c
.RB [ \-S\fR\ \fISCHEMA ]
562c04c
.I HDFS_PATH
562c04c
.P
562c04c
.B hdfscli\-avro
562c04c
.BR \-L \ |\  \-h
562c04c
.SH OPTIONS
562c04c
.SS COMMANDS
562c04c
.TP
562c04c
.B schema
562c04c
Pretty print schema.
562c04c
.TP
562c04c
.B read
562c04c
Read an Avro file from HDFS and output records as JSON to standard out.
562c04c
.TP
562c04c
.B write
562c04c
Read JSON records from standard in and serialize them into a single Avro file
562c04c
on HDFS.
562c04c
.SS ARGUMENTS
562c04c
.TP
562c04c
.I HDFS_PATH
562c04c
Remote path to Avro file or directory containing Avro part-files.
562c04c
.SS OPTIONS
562c04c
.TP
562c04c
.BR \-C\fR\ \fICODEC \  \-\-codec=\fICODEC
562c04c
Compression codec.
562c04c
Available values are among:
562c04c
.BR null ,
562c04c
.BR deflate ,
562c04c
.BR snappy .
562c04c
[default:
562c04c
.BR deflate ]
562c04c
.TP
562c04c
.BR \-F\fR\ \fIFREQ \  \-\-freq=\fIFREQ
562c04c
Probability of sampling a record.
562c04c
.TP
562c04c
.BR \-L \  \-\-log
562c04c
Show path to current log file and exit.
562c04c
.TP
562c04c
.BR \-S\fR\ \fISCHEMA \  \-\-schema=\fISCHEMA
562c04c
Schema for serializing records.
562c04c
If not passed, it will be inferred from the first record.
562c04c
.TP
562c04c
.BR \-a \ \fIALIAS \-\-alias=\fIALIAS
562c04c
Alias of namenode to connect to.
562c04c
.TP
562c04c
.BR \-f \  \-\-force
562c04c
Overwrite any existing file.
562c04c
.TP
562c04c
.BR \-h \  \-\-help
562c04c
Show a usage message and exit.
562c04c
.TP
562c04c
.BR \-n \ \fINUM \-\-num=\fINUM
562c04c
Cap number of records to output.
562c04c
.TP
562c04c
.BR \-p \ \fIPARTS \-\-parts=\fIPARTS
562c04c
Part-files to read.
562c04c
Specify a number to randomly select that many, or a comma-separated list of
562c04c
numbers to read only these.
562c04c
Use a number followed by a comma (e.g.
562c04c
.BR 1, )
562c04c
to get a unique part-file.
562c04c
The default is to read all part-files.
562c04c
.TP
562c04c
.BR \-v \  \-\-verbose
562c04c
Enable log output.
562c04c
Can be specified up to three times (increasing verbosity each time).
562c04c
.SH EXAMPLES
562c04c
.EX
562c04c
.B hdfscli\-avro\ schema\ /data/impressions.avro
562c04c
.EE
562c04c
.EX
562c04c
.B hdfscli\-avro\ read\ \-a\ dev\ snapshot.avro\ >snapshot.jsonl
562c04c
.EE
562c04c
.EX
562c04c
.B hdfscli\-avro\ read\ \-F\ 0.1\ \-p\ 2,3\ clicks.avro
562c04c
.EE
562c04c
.EX
562c04c
.B hdfscli\-avro\ write\ \-f\ positives.avro\ 
562c04c
.EE
562c04c
.SH "SEE\ ALSO"
562c04c
.BR hdfscli (1)