How to Read Comma Separated Values Using Fscanf for Matrix
NCL Dwelling house> Awarding examples> File IO || Data files for some examples
Example pages containing: tips | resources | functions/procedures
NCL: Reading CSV (comma-separated values) files
CSV files are ASCII files whose values are separated past commas or other separators (semicolons, spaces, etc).
One fashion to read these files is using a combination of these functions:
- asciiread
- str_get_field, str_sub_str,
- str_match_ic_regex, str_match_ic_regex, str_match_ind_regex, str_ind_match_ic_regex
- tointeger, tofloat, todouble
You tin get abroad with merely using asciiread if yous have a file of all numbers, and no mix of numbers and characters in any one field.
For examples of reading/writing other types of ASCII files, see:
- Writing CSV files
- Writing ASCII information
- Reading ASCII information
Sometimes there are multiple CSV files. Rather that looping over many files, it may be advantageous to create 1 CSV file contaning all the data. This is readily accomplished via the unix/linux cat control:
cat file01.csv file02.cvs .... >! FILE.csv or cat file*.csv .... >! FILE.csv
The '>!' are the *nix redirect operator ( > ) and the overwrite operator ( ! ).
csv_1.ncl: Shows how to read a unproblematic CSV file (example1.csv) that contains all integers.
asciiread is used to read the table every bit strings kickoff so we can become the number of rows and columns. The values are and then converted to integers using tointeger.
csv_2.ncl: Shows how to read a CSV file (example2.csv) that contains a mix of strings, integers, and fields.
asciiread is used to read the table equally strings and then str_get_field is used to read the desired fields. There's a mix of integer and bladder fields, so tointeger and tofloat are used to convert from strings to numeric values.
csv_3.ncl: Shows how to read a CSV file (example3.csv) that contains fields that are all enclosed in double quotes.
asciiread is used to read the table as strings, str_get_field is used to read the desired fields, and str_sub_str is used to remove all the double quotes.
csv_5.ncl: CSV files with blank cells are very common in "the real earth". The function str_split_csv makes it easy to correctly read CSV files with empty fields equally missing values. Notation: The tofloat returns an _Fillvalue=9.96921e+36. The script manually changes this to a 'nicer' _FillValue.
The input file, examination-with-missing.csv, contains:
168.0 ,157.ane 165.5 ,145.8 164.0 ,163.3 169.7 ,169.7 182.viii ,168.three 158.2 ,170.v 155.8 , 168.8 , 176.0 , 211.v ,200.5 214.v ,211.6 216.7 ,195.7 219.0 ,193.7 227.5 ,147.5 243.3 ,107.7 146.8 ,72.viii
The output would look similar:
(0) 168 157.1 (1) 165.5 145.eight (ii) 164 163.three (3) 169.7 169.7 (4) 182.8 168.3 (5) 158.ii 170.5 (6) 155.8 -9999 (vii) 168.8 -9999 (eight) 176 -9999 (9) 211.5 200.5 (ten) 214.v 211.vi (11) 216.vii 195.7 (12) 219 193.7 (13) 227.5 147.5 (fourteen) 243.iii 107.7 (fifteen) 146.8 72.8
csv_6.ncl: Shows how to read a CSV file (479615.NorthDakota.csv) and extract all strings with a user specified string using str_match_ic_regex . Write the selected data to an ascii file via asciiwrite.
The original code was posted by Karin Meier-Fleischer (DKRZ) in response to an ncl-talk question.
csv_7.ncl: Read the CSV files (479615.NorthDakota.csv) and (479615.latlon.csv) and excerpt all strings with a user specified appointment ('yyyymm') string using str_match_ic_regex . The 2nd ascii file is read for the latitude and longitudes of the locations. Write the selected data to an ascii file via asciiwrite.
Plot the random stations on a map for yyyymm. This csv file only has x stations with data. Hence, the graphics are a bit crude.
csv_9.ncl: Shows how to read a CSV file (tAL.csv) which contains daily data from fourteen stations concatenated together. A sample:
"StationID","Year","Month","Twenty-four hours","Julian Twenty-four hour period","Precip","Lat","Long" 11084,1950,1,1,2433284.195625,0,31.0581,-87.0547 <=== initial station ID ..... 11084,2011,12,31,2455928.79375,0,31.0581,-87.0547 12813,1950,1,one,2433284.195625,0,30.5467,-87.8808 <=== new station ID ..... 12813,2011,12,31,2455928.79375,0.0508,30.5467,-87.8808 13160,1950,one,1,2433284.195625,0,32.8347,-88.1342 <=== new station ID .....
readAsciiTable is used to input the information. NCL'southward ind office is used to select data blocks associated with each station. For demonstration, a simple procedure (could be a function) is used to calculate a few simple statistcs.
ascii_delim_new.ncl: Shows how to read a CSV file (asc5.txt) that contains header information, and employ this data to write the data to a NetCDF file.
The script is rather lengthy because it does some error checking of types.
In guild to write fields to a netCDF file, the netCDF field (variable) names cannot comprise any tabs or spaces. Hence this script removes white spaces from the get-go and finish of whatsoever field names and converts other white infinite to underscores ('_'). String or graphic symbol values for the fields themselves are not modified.
If y'all desire to use this script for your own purposes, you will need to modify the script to bespeak 1) the input ASCII file name, ii) the number of fields, 3) the delimiter, 4) the type of each field, and 5) whether the field contains missing values.
To change either one for your own data file, first search for the lines:
;============================================================ ; Main lawmaking ;============================================================
The lines you need to modify follow shortly:
filename = "asc5.txt" ; ASCII file to read. nfields = 6 ; # of fields delimiter = "," ; field delimiter var_types = new(nfields,string) var_msg = new(nfields,string) var_strlens = new(nfields,integer) ; var to hold string lengths, ; just in case. . . . var_msg = "" ; Default to no missing var_msg(3) = "-999" ; Corresponds to field #4 var_types = "integer" ; Default to integer var_types(1:two) = "float" ; 2nd and third fields var_types(4) = "character" ; Corresponds to field #five
Change "var_types" to whatever the types of your fields are, and "var_msg" to what the missing value should be (an empty string indicates no missing value).
The in a higher place lawmaking is defaulting all variable types to "integer", and then changing the 2d and 3rd fields to type "bladder" and the fifth field to type "graphic symbol" (which in this case is existence used as a character assortment). The only field that will contain a missing value is the quaternary field.
The allowable variable types are "integer", "float", "double", "string", or "grapheme". Annotation that if y'all read in a variable as a string, it won't go written to the netCDF file because only character arrays tin can be written to a netCDF file.
andersondayinexce1983.blogspot.com
Source: https://www.ncl.ucar.edu/Applications/read_csv.shtml
Post a Comment for "How to Read Comma Separated Values Using Fscanf for Matrix"