Upload and replace newline characters from data file

Rafael Longo · Posted: Wed Aug 29, 2007 12:22 am

I know that the output dataset (after the conversion on the mainframe) is perfect because I used windows FTP to "get" the file from mainframe and the sequence is ok (0D 0A 0D 0A).

dick scherrer · Posted: Wed Aug 29, 2007 1:11 am

Hello,

Can you run the SELECT * query from a green-screen (3270) rather than a windows client?

Can you "unload" a few selected rows (ones with the embedded cr/lf) from teradata and see if the problem character is present?

Rafael Longo · Posted: Wed Aug 29, 2007 1:19 am

Hi, Dick...

Sorry, but I did not understand...

What do you mean by green-screen(3270)?

And what do you mean by "unload"? If you mean, exporting the data, I already did and, yes, the problem character is present...

It's driving me crazy...

dick scherrer · Posted: Wed Aug 29, 2007 1:38 am

Hi Rafael,

By green-screen, i mean a terminal that is used for TSO or CICS rather than a custom windows gui font-end application (in the "old days", the only device most people saw was a physical 3270 terminal which typically was a dark background with green characters). These days, a 3270 terminal emulator is how most people connect to the mainframe.

Whatever you use to connect to TSO or CICS should work as long as you can enter a query from that "terminal". Is there a way to run the SELECT query in batch and view the result in the output queue?

I suspect that my "unload" and your "export" are similar. Does teradata have the ability to export individual fields rather than whole rows/records?

Is there a way to hex-dump a row/record from teradata?

I'm trying to isloate when the "bad" character is introduced - during the load into teradata or just on the retrieval via the windows process.

Rafael Longo · Posted: Wed Aug 29, 2007 1:46 am

Ok.. understood.

Yes... I can run the query via Rumba/TSO/ISPF which I use to access mainframe. I will do that and let you know...

About the column exporting, yes, I did export it and the D5s are really there.

dick scherrer · Posted: Wed Aug 29, 2007 2:44 am

Hi Rafael,

Sounds like the teradata load is adding the extra character when it encounters 2 sets of cr/lf - might even be a "feature"..

You might try to "edit" the file before running the load and replacing the cr/lf's with spaces.

For a simple test, you could use the tso/ispf editor. For longterm (if it works), you could replace them with yojr sort product.

Rafael Longo · Posted: Wed Aug 29, 2007 7:41 pm

Good morning!

Dick,

I ran the SQL on the mainframe... The D5's are there...

Now I want to keep the cr/lfs... Even if I need to run an update in the database to eliminate the D5s... I don't want to do that though.. Very poor solution...

Any suggestions anyone?

dick scherrer · Posted: Wed Aug 29, 2007 8:12 pm

Hi Rafael,

You might try a modified version of my previous thought - edit the file replacing the 3-character cr/lf/N with cr/lf/space before loading. . . This would preserve the cr/lf but remove the unwanted N.

If that won't do what is needed let us know and we'll think some more

Rafael Longo · Posted: Wed Aug 29, 2007 8:20 pm

Hi, Dick...

Thanks for the support..

I think you misunderstood.. in the file, before the loading, I have only CR/LF. The N, or Õ, appear only on the data stored in teradata.

Let me know if I was the one who misunderstood your suggestion...

dick scherrer · Posted: Wed Aug 29, 2007 8:28 pm

My bad

Got ahead of myself.

Do you have teradata documentation? Is it possible that when cr/lf appears consecutively, the additional character is automatically inserted?

If you have support, you might open an issue with their support and see if this has happened to other teradata users.

If the data to be loaded has multiple consecutive sets of cr/lf, would changing the data to only contain 1 cr/lf work for your database data or are all of the cr/lf's needed?

Otherwise, your update within the database may be a way out.

Rafael Longo · Posted: Wed Aug 29, 2007 9:21 pm

Dick,

I just looked closer in the dataset on mainframe (Found out how to check the hex dump without ftping the data set to windows..

) and turns out that I did not describe this issue to you properly...

I am very very sorry for my mistake... Here is what really happens:

The original cr/lfs, which after the binary transfer and ebcdic conversion should be x'0d0a' in the converted dataset, are actually x'0d15'. Just to illustrate, look at the end of each record below:

Rafael Longo · Posted: Wed Aug 29, 2007 9:31 pm

From IBM documentation:

dick scherrer · Posted: Wed Aug 29, 2007 9:36 pm

Not to worry

Sometimes it takes a while to sort thru

Can you move forward with the new understanding or might something still need attention?

Rafael Longo · Posted: Wed Aug 29, 2007 9:43 pm

Now, I am looking for a way to make these x'0d15' become x'0d0a'. If you have any info to guide me, please let me know.

Another thing to think about: Look at my previous post. The CRLFs at the end of each record after the OCOPY conversion are x'0d15'. Shouldn't they be x'15' considering that originally they were x'0d0a'?

stodolas · Posted: Wed Aug 29, 2007 10:40 pm

Can't you just trim the last 2 bytes? They aren't really part of the data anyway.

Rafael Longo · Posted: Wed Aug 29, 2007 11:15 pm

Hi, stodolas,

Sure can...

But I posted the last two bytes just to illustrate... The same thing happens with all CRLF in the record. And there are others in the middle (Without them, this topic wouldn`t even exist...

).

dick scherrer · Posted: Wed Aug 29, 2007 11:49 pm

Hi Rafael,

Rafael Longo · Posted: Thu Aug 30, 2007 12:32 am

dick scherrer · Posted: Thu Aug 30, 2007 12:57 am

Hi Rafael,

Rafael Longo · Posted: Thu Aug 30, 2007 2:39 am

Guys,

One last thought before I go home and rest my mind...

What about specifying a custom translation table on the OCOPY step (instead of BPXFX311) that says "Wherever you find a x'0a' keep it as x'0a'!".

Is that possible?

I don't know how to create and specify a custom translation table... Can you guys show me how?

Talk to you guys in the morning...

Rafael

dick scherrer · Posted: Thu Aug 30, 2007 9:05 pm

Hi Rafael,

Good morning

Good idea asking about the custom translation table in a new topic - i looked around some last night, but did not find anything useful.

If a custom translation table does not happen, you should be able to do what you need with your sort product. . .

Rafael Longo · Posted: Thu Aug 30, 2007 9:19 pm

Thanks for all your time and help, Dick...

Once I find a solution, I shall get back to you on this...

Regards,
Rafael

Rafael Longo · Posted: Mon Sep 03, 2007 8:12 pm

Hello,

Looks like EBCDIC's NL (0x15) do not have a ASCII equivalent. Teradata SQL Assistant client (which I run from windows) translates the 0x15 to 0xD5 (Don't ask me why this conversion was chosen).

So, the problem occurs only when I issue the query through windows. When I issue it from Mainframe (Green-screen, right Dick?

), I get the 0x15's back (I know I said that I got the D5's from mainframe too, but I did something wrong before...).

The thing is that I changed the "Session character set" from ASCII to UTF8 in the ODBC driver that Teradata SQL assistant uses to log and the the 0xD5's started to display correctly... as a Line wrapping.

So, it looks that the issue was just a diplaying problem on the particular client that I was using on windows...

And my problem is solved! The binary upload and conversion through OCOPY did it...

Thanks guys.

dick scherrer · Posted: Mon Sep 03, 2007 10:59 pm

Cool

Good to hear it is working.