ODI - Hive and Complex JSON

Time for a nested JSON example using Hive external tables. ODI treats nested complex types in Hive the same way it treats types in other technologies such as Oracle, the type name is captured, not the definition - you can see XMLType or SDO_GEOMETRY as an example within the ODI Oracle technology. The Hive technology in ODI also has the ARRAY, STRUCT and MAP types seeded. For the nested JSON example I will use the example define in this JSON SerDe page here.

The external table definition is below, I have defined this in Hive and reverse engineered into ODI just like the previous post. Note it is using a different SerDe from the post here, when I tried using that Google SerDe the data was not projected properly (all data was projected as null...so beware of components used). Just like the previous post we need to add the jar whenever it is used (plus don't forget the ODI_HIVE_SESSION_JARS), either from Hive or ODI;

ADD JAR /home/oracle/json/hive-serde-1.0.jar;
CREATE EXTERNAL TABLE message (
messageid string,
messagesize int,
sender string,
recipients array<string>,
messageparts array<struct<
extension: string,
size: int
>>,
headers map<string,string>
)
ROW FORMAT SERDE 'com.proofpoint.hive.serde.JsonSerde'
LOCATION '/user/oracle/json_complex';

This external table has ARRAY fields, STRUCT fields and MAP fields, so we are going above and beyond simple types. The data I will use is the same as the referenced web page;

{
"messageId": "34dd0d3c-f53b-11e0-ac12-d3e782dff199",
"messageSize": 12345,
"sender": "alice@example.com",
"recipients": ["joe@example.com", "bob@example.com"],
"messageParts": [
{
"extension": "pdf",
"size": 4567
},
{
"extension": "jpg",
"size": 9451
}
],
"headers": {
"Received-SPF": "pass",
"X-Broadcast-Id": "9876"
}
}

Again, I will use the Hive RKM that I mentioned in the post here in order to reverse engineer the external table defined in Hive into ODI. Below you can see the table and how it is represented in ODI. You can see the recipients and messageparts columns are defined as ARRAY, and headers is a MAP.

We can view the data in ODI just like other tables - remember the external table is projected the JSON data from HDFS through Hive;

When the datastore is used in an interface these are the columns that are available in mapping, you can use any of the Hive functions or operators available on these complex types, plus constructors for building them. For example to retrieve the 1st element in the array the following Hive QL can be used;

select sender, recipients[0] from message;

You can build such expressions in ODI;

You just need to be aware of the datatypes you are using and the functions available. Haven't gone into complex SQL/Hive QL here, but you see the basic mechanics are pretty straightforward. One of the points that comes home here is the functionality level of the non-core pieces of Hadoop, so the first 2 JSON SerDe libraries I have used support different capabilities of JSON - so be aware.

ODI - Hive and Complex JSON

Trending Articles

Black Angus Grilled Artichokes

Obituaries for Friday, June 27, 2025

Raj Panchayat 3rd / Third Grade Teacher Revised Result 2012 Level 1-2...

Mace 6 FEN Loot 2.0 +60k pureped BO

Ilahi mera jee aaye/ Shaame Malang si Lyrics Translation

DD Kashir channel packaging bids invited by 29 june

HResult: 0x80240033 Context: uecGeneral Msg: The license terms of one or more...

spreading clines

Procedure for conduct of supplementary DPC

Srinagar Kitty’s brother dies at 67 due to Covid-19

Download: Mirraj Malifah – Chance Yako (Prod by_Bicko @Musiqhouse)

Born To Be Wild: Chicago Outfit Hit Squad Littered The Streets With Bodies...

A/L Technology Stream – Subject combinations, Syllabuses and Teacher guides

Best Suvichar in Hindi |बेस्ट सुविचार |शुभ विचार हिंदी में

Mp3 Download: Mdu - Nammer

libdevinfo を使ってネットワークインターフェイスデバイスの一覧を取得する

TAPERED BEAM DESIGN OUTPUT

Current scandal has roots in NPF saga

Re: How to fix error on printer HP Color LaserJet Pro MFP 3303 with event...

Re: My Sisters Plan For Me To Smell Her Feet (Fiction): Part 1,2,3 and 4!!!