Joined: 06 Jun 2008 Posts: 8701 Location: Dubuque, Iowa, USA
Quote:
Confusing ones are (UTF-8, UTF-16, UCS-2, Open (O), Only (J), Either (E), Graphic (G)).
I'm not sure what is confusing you -- the site you referenced states
Quote:
The data types J (only), E (either), O (open), and G (graphic) support DDS database files that use DBCS. The G (graphic) data type also supports DDS database files that use UCS-2 or UTF-16. The A (character) data type also supports database files that use UTF-8.
and since UTF-8, UTF-16, UCS-2 are forms of Unicode (which Java uses natively), you shouldn't have much of a conversion issue with them. The conversion table in the site you referenced tells you how to convert each type you are confused about.
Your group may have to undergo significant and expensive conversion processing to get the data from AS400 formats to something HIVE will recognize and deal with. The costs and time for this conversion should have been considered in the initial project planning.