Read UTF-8 file in Cobol

sathyajes · New User Joined: 02 Mar 2006 Posts: 35 Location: Chennai

I have input file which transferred to Mainframe using FTP from Windows. this is comma separated text file. while transferring, i have used FTP commands as (QUOTE SITE ENCODING=MBCS QUOTE SITE MBDATACONN=(UTF-8,UTF-8). in mainframe i have stored as variable block file

Input file from Windows
9992,王清, ID card,1123233

in mainframe, it showed input as below
Õ]×......Xþ»W½eVýþ.+/ÈÑ?>/%.ñà.Ä/ÊÀ...................

in hex format
FIN-OWNER-RECORD =Õ]×......Xþ»W½eVýþ.+/ÈÑ?>/%.ñà.Ä/ÊÀ...................
4CCD6DEDCD6DCCDDC47EBB333332E88EB8E88246766666244266762333333333333333333
069506655909536940EFBF80012C7EB6855DECE149FE1C09403124C220102196301073331

Could you please help me how to locate comma delimiter(,) in cobol, i was trying as below codes, none of them worked.
IF FIN-OWNER-RECORD(1:INDEX-VAL) = N","
IF FIN-OWNER-RECORD(3:INDEX-VAL) = X'2C'

sergeyken · Posted: Wed May 12, 2021 5:51 pm

To avoid a lot of headaches in the future, any (text) file transmitted from Windows environment to mainframe needs to be translated from ASCII to EBCDIC.
Otherwise endless series of tricks and gimmicks shall be required at each step while working with this dataset in mainframe environment. Your example is only the first problem in this way out of hundreds or thousands of further problems with this approach.

P.S.
First of all, learn how to use code tags when posting your code

Robert Sample · Posted: Wed May 12, 2021 7:43 pm

sathyajes · New User Joined: 02 Mar 2006 Posts: 35 Location: Chennai

Thanks for your reply

As suggested, i have moved file to mainframe as EBCDIC file

during FTP, i have used as below
QUOTE SITE MBDATACONN=(UTF-8,IBM-937)

now in mainframe, the file looks as below
80012,g..f8.e..,National ID card,220102196301073331

Actual file as below from windows
80012,王清华,National ID card,220102196301073331

In cobol program, i have used the below code to convert
MOVE FUNCTION NATIONAL-OF(WS-CONTACT-NM,937) TO
WS-CONTACT-NM-NA --> national type variable
MOVE FUNCTION DISPLAY-OF(WS-CONTACT-NM-NA,937) TO
WS-CONTACT-NM-UTF8

Still chinse characters are not inserted in table correctly, it is showing as below
gÚÚÚÚÚÚÚÚ

Please help on this

sergeyken · Posted: Thu May 13, 2021 8:48 pm

enrico-sorichetti · Posted: Thu May 13, 2021 9:50 pm

when people ask these question

I wonder how the application design was done
requirements VS architecture VS implementation
with a bit of prototyping to confirm the approach for critical items

or not done as in this case

since we know nothing about the application requirements and architecture
the proper answer IMO should be ...
talk to the application analysts to find out