- In
inputSpec
of ioConfig
, inputFormat
must be changed from "org.apache.hadoop.hive.ql.io.orc.OrcNewInputFormat"
to "org.apache.orc.mapreduce.OrcInputFormat"
- The ‘contrib’ extension supported a
typeString
property, which provided the schema of the ORC file, of which was essentially required to have the types correct, but notably not the column names, which facilitated column renaming. In the ‘core’ extension, column renaming can be achieved with . For example, with the actual schema struct<_col0:string,_col1:string>
, to preserve Druid schema would need replaced with:
- The ‘contrib’ extension supported a
mapFieldNameFormat
property, which provided a way to specify a dimension to flatten OrcMap
columns with primitive types. This functionality has also been replaced with flattenSpec. For example: "mapFieldNameFormat": "<PARENT>_<CHILD>"
for a dimension nestedData_dim1
, to preserve Druid schema could be replaced with
{
"type": "path",
"name": "nestedData_dim1",
}
]
}