astropy:docs
  • Index
  • Modules

Navigation

  • next »
  • « previous |
  • Astropy v5.2.1 »
  • ASCII Tables (astropy.io.ascii) »
  • Fixed-Width Gallery

Fixed-Width Gallery¶

Fixed-width tables are those where each column has the same width for every row in the table. This is commonly used to make tables easy to read for humans or Fortran codes. It also reduces issues with quoting and special characters, for example:

Col1   Col2    Col3 Col4
---- --------- ---- ----
 1.2   "hello"    1    a
 2.4 's worlds    2    2

There are a number of common variations in the formatting of fixed-width tables which astropy.io.ascii can read and write. The most significant difference is whether there is no header line (FixedWidthNoHeader), one header line (FixedWidth), or two header lines (FixedWidthTwoLine). Next, there are variations in the delimiter character, like whether the delimiter appears on either end (“bookends”), or if there is padding around the delimiter.

Details are available in the class API documentation, but the easiest way to understand all of the options and their interactions is by example.

Reading¶

Fixed Width¶

Nice, typical, fixed-format table:

>>>
>>> from astropy.io import ascii
>>> table = """
... # comment (with blank line above)
... |  Col1  |  Col2   |
... |  1.2   | "hello" |
... |  2.4   |'s worlds|
... """
>>> ascii.read(table, format='fixed_width')
<Table length=2>
  Col1     Col2
float64    str9
------- ---------
    1.2   "hello"
    2.4 's worlds

Typical fixed-format table with col names provided:

>>>
>>> table = """
... # comment (with blank line above)
... |  Col1  |  Col2   |
... |  1.2   | "hello" |
... |  2.4   |'s worlds|
... """
>>> ascii.read(table, format='fixed_width', names=['name1', 'name2'])
<Table length=2>
 name1    name2
float64    str9
------- ---------
    1.2   "hello"
    2.4 's worlds

Weird input table with data values chopped by col extent:

>>>
>>> table = """
...   Col1  |  Col2 |
...   1.2       "hello"
...   2.4   sdf's worlds
... """
>>> ascii.read(table, format='fixed_width')
<Table length=2>
  Col1    Col2
float64   str7
------- -------
    1.2    "hel
    2.4 df's wo

Table with double delimiters:

>>>
>>> table = """
... || Name ||   Phone ||         TCP||
... |  John  | 555-1234 |192.168.1.10X|
... |  Mary  | 555-2134 |192.168.1.12X|
... |   Bob  | 555-4527 | 192.168.1.9X|
... """
>>> ascii.read(table, format='fixed_width')
<Table length=3>
Name  Phone       TCP
str4   str8      str12
---- -------- ------------
John 555-1234 192.168.1.10
Mary 555-2134 192.168.1.12
 Bob 555-4527  192.168.1.9

Table with space delimiter:

>>>
>>> table = """
...  Name  --Phone-    ----TCP-----
...  John  555-1234    192.168.1.10
...  Mary  555-2134    192.168.1.12
...   Bob  555-4527     192.168.1.9
... """
>>> ascii.read(table, format='fixed_width', delimiter=' ')
<Table length=3>
Name --Phone- ----TCP-----
str4   str8      str12
---- -------- ------------
John 555-1234 192.168.1.10
Mary 555-2134 192.168.1.12
 Bob 555-4527  192.168.1.9

Table with no header row and auto-column naming:

Use header_start and data_start keywords to indicate no header line.

>>>
>>> table = """
... |  John  | 555-1234 |192.168.1.10|
... |  Mary  | 555-2134 |192.168.1.12|
... |   Bob  | 555-4527 | 192.168.1.9|
... """
>>> ascii.read(table, format='fixed_width',
...            header_start=None, data_start=0)
<Table length=3>
col1   col2       col3
str4   str8      str12
---- -------- ------------
John 555-1234 192.168.1.10
Mary 555-2134 192.168.1.12
 Bob 555-4527  192.168.1.9

Table with no header row and with col names provided:

Second and third rows also have hanging spaces after final “|”. Use header_start and data_start keywords to indicate no header line.

>>>
>>> table = ["|  John  | 555-1234 |192.168.1.10|",
...          "|  Mary  | 555-2134 |192.168.1.12|  ",
...          "|   Bob  | 555-4527 | 192.168.1.9|  "]
>>> ascii.read(table, format='fixed_width',
...            header_start=None, data_start=0,
...            names=('Name', 'Phone', 'TCP'))
<Table length=3>
Name  Phone       TCP
str4   str8      str12
---- -------- ------------
John 555-1234 192.168.1.10
Mary 555-2134 192.168.1.12
 Bob 555-4527  192.168.1.9

Fixed Width No Header¶

Table with no header row and auto-column naming. Use the ``fixed_width_no_header`` format for convenience:

>>>
>>> table = """
... |  John  | 555-1234 |192.168.1.10|
... |  Mary  | 555-2134 |192.168.1.12|
... |   Bob  | 555-4527 | 192.168.1.9|
... """
>>> ascii.read(table, format='fixed_width_no_header')
<Table length=3>
col1   col2       col3
str4   str8      str12
---- -------- ------------
John 555-1234 192.168.1.10
Mary 555-2134 192.168.1.12
 Bob 555-4527  192.168.1.9

Table with no delimiter with column start and end values specified:

This uses the col_starts and col_ends keywords. Note that the col_ends values are inclusive so a position range of zero to five will select the first six characters.

>>>
>>> table = """
... #    5   9     17  18      28    <== Column start / end indexes
... #    |   |       ||         |    <== Column separation positions
...   John   555- 1234 192.168.1.10
...   Mary   555- 2134 192.168.1.12
...    Bob   555- 4527  192.168.1.9
... """
>>> ascii.read(table, format='fixed_width_no_header',
...                 names=('Name', 'Phone', 'TCP'),
...                 col_starts=(0, 9, 18),
...                 col_ends=(5, 17, 28),
...                 )
<Table length=3>
Name   Phone      TCP
str4    str9     str10
---- --------- ----------
John 555- 1234 192.168.1.
Mary 555- 2134 192.168.1.
 Bob 555- 4527  192.168.1

Table with no delimiter with only column start or end values specified:

If only the col_starts keyword is given, it is assumed that each column ends where the next column starts, and the final column ends at the same position as the longest line of data.

Conversely, if only the col_ends keyword is given, it is assumed that the first column starts at position zero and that each successive column starts immediately after the previous one.

The two examples below read the same table and produce the same result.

>>>
>>> table = """
... #1       9        19                <== Column start indexes
... #|       |         |                <== Column start positions
... #<------><--------><------------->  <== Inferred column positions
...   John   555- 1234 192.168.1.10
...   Mary   555- 2134 192.168.1.123
...    Bob   555- 4527  192.168.1.9
...    Bill  555-9875  192.255.255.255
... """
>>> ascii.read(table,
...                 format='fixed_width_no_header',
...                 names=('Name', 'Phone', 'TCP'),
...                 col_starts=(1, 9, 19),
...                 )
<Table length=4>
Name   Phone         TCP
str4    str9        str15
---- --------- ---------------
John 555- 1234    192.168.1.10
Mary 555- 2134   192.168.1.123
 Bob 555- 4527     192.168.1.9
Bill  555-9875 192.255.255.255

>>> ascii.read(table,
...                 format='fixed_width_no_header',
...                 names=('Name', 'Phone', 'TCP'),
...                 col_ends=(8, 18, 32),
...                 )
<Table length=4>
Name   Phone        TCP
str4    str9       str14
---- --------- --------------
John 555- 1234   192.168.1.10
Mary 555- 2134  192.168.1.123
 Bob 555- 4527    192.168.1.9
Bill  555-9875 192.255.255.25

Fixed Width Two Line¶

Typical fixed-format table with two header lines with some cruft:

>>>
>>> table = """
...   Col1    Col2
...   ----  ---------
...    1.2xx"hello"
...   2.4   's worlds
... """
>>> ascii.read(table, format='fixed_width_two_line')
<Table length=2>
  Col1     Col2
float64    str9
------- ---------
    1.2   "hello"
    2.4 's worlds

reStructuredText table:

>>>
>>> table = """
... ======= ===========
...   Col1    Col2
... ======= ===========
...   1.2   "hello"
...   2.4   's worlds
... ======= ===========
... """
>>> ascii.read(table, format='fixed_width_two_line',
...                 header_start=1, position_line=2, data_end=-1)
<Table length=2>
  Col1     Col2
float64    str9
------- ---------
    1.2   "hello"
    2.4 's worlds

Text table designed for humans and test having position line before the header line:

>>>
>>> table = """
... +------+----------+
... | Col1 |   Col2   |
... +------|----------+
... |  1.2 | "hello"  |
... |  2.4 | 's worlds|
... +------+----------+
... """
>>> ascii.read(table, format='fixed_width_two_line', delimiter='+',
...                 header_start=1, position_line=0, data_start=3, data_end=-1)
<Table length=2>
  Col1     Col2
float64    str9
------- ---------
    1.2   "hello"
    2.4 's worlds

Writing¶

Fixed Width¶

Define input values ``dat`` for all write examples:

>>>
>>> table = """
... | Col1 |  Col2     |  Col3 | Col4 |
... | 1.2  | "hello"   |  1    | a    |
... | 2.4  | 's worlds |  2    | 2    |
... """
>>> dat = ascii.read(table, format='fixed_width')

Write a table as a normal fixed-width table:

>>>
>>> ascii.write(dat, format='fixed_width')
| Col1 |      Col2 | Col3 | Col4 |
|  1.2 |   "hello" |    1 |    a |
|  2.4 | 's worlds |    2 |    2 |

Write a table as a fixed-width table with no padding:

>>>
>>> ascii.write(dat, format='fixed_width', delimiter_pad=None)
|Col1|     Col2|Col3|Col4|
| 1.2|  "hello"|   1|   a|
| 2.4|'s worlds|   2|   2|

Write a table as a fixed-width table with no bookend:

>>>
>>> ascii.write(dat, format='fixed_width', bookend=False)
Col1 |      Col2 | Col3 | Col4
 1.2 |   "hello" |    1 |    a
 2.4 | 's worlds |    2 |    2

Write a table as a fixed-width table with no delimiter:

>>>
>>> ascii.write(dat, format='fixed_width', bookend=False, delimiter=None)
Col1       Col2  Col3  Col4
 1.2    "hello"     1     a
 2.4  's worlds     2     2

Write a table as a fixed-width table with no delimiter and formatting:

>>>
>>> ascii.write(dat, format='fixed_width',
...                  formats={'Col1': '%-8.3f', 'Col2': '%-15s'})
|     Col1 |            Col2 | Col3 | Col4 |
| 1.200    | "hello"         |    1 |    a |
| 2.400    | 's worlds       |    2 |    2 |

Fixed Width No Header¶

Write a table as a normal fixed-width table:

>>>
>>> ascii.write(dat, format='fixed_width_no_header')
| 1.2 |   "hello" | 1 | a |
| 2.4 | 's worlds | 2 | 2 |

Write a table as a fixed-width table with no padding:

>>>
>>> ascii.write(dat, format='fixed_width_no_header', delimiter_pad=None)
|1.2|  "hello"|1|a|
|2.4|'s worlds|2|2|

Write a table as a fixed-width table with no bookend:

>>>
>>> ascii.write(dat, format='fixed_width_no_header', bookend=False)
1.2 |   "hello" | 1 | a
2.4 | 's worlds | 2 | 2

Write a table as a fixed-width table with no delimiter:

>>>
>>> ascii.write(dat, format='fixed_width_no_header', bookend=False,
...                  delimiter=None)
1.2    "hello"  1  a
2.4  's worlds  2  2

Fixed Width Two Line¶

Write a table as a normal fixed-width table:

>>>
>>> ascii.write(dat, format='fixed_width_two_line')
Col1      Col2 Col3 Col4
---- --------- ---- ----
 1.2   "hello"    1    a
 2.4 's worlds    2    2

Write a table as a fixed width table with space padding and ‘=’ position_char:

>>>
>>> ascii.write(dat, format='fixed_width_two_line',
...                  delimiter_pad=' ', position_char='=')
Col1        Col2   Col3   Col4
====   =========   ====   ====
 1.2     "hello"      1      a
 2.4   's worlds      2      2

Write a table as a fixed-width table with no bookend:

>>>
>>> ascii.write(dat, format='fixed_width_two_line', bookend=True, delimiter='|')
|Col1|     Col2|Col3|Col4|
|----|---------|----|----|
| 1.2|  "hello"|   1|   a|
| 2.4|'s worlds|   2|   2|

Custom Header Rows¶

The fixed_width and fixed_width_two_line formats normally include a single initial row with the column names in the header. However, it is possible to customize the column attributes which appear as header rows. The available column attributes are name, dtype, format, description and unit. This is done by listing the desired the header rows using the header_rows keyword argument.

::
>>>
>>> from astropy.table.table_helpers import simple_table
>>> dat = simple_table(size=3, cols=4)
>>> dat["a"].info.unit = "m"
>>> dat["d"].info.unit = "m/s"
>>> dat["b"].info.format = ".2f"
>>> dat["c"].info.description = "C column"
>>> ascii.write(
...    dat,
...    format="fixed_width",
...    header_rows=["name", "unit", "format", "description"],
... )
|     a |       b |        c |     d |
|     m |         |          | m / s |
|       |     .2f |          |       |
|       |         | C column |       |
|     1 |    1.00 |        c |     4 |
|     2 |    2.00 |        d |     5 |
|     3 |    3.00 |        e |     6 |

In this example the 1st row is the dtype, the 2nd row is the name, and so forth. You must supply the name value in the header_rows list in order to get an output with the column name included.

A table with non-standard header rows can be read back in the same way, using the same list of header_rows:

>>>
>>> txt = """\
... | int32 | float32 |      <U4 | uint8 |
... |     a |       b |        c |     d |
... |     m |         |          | m / s |
... |       |     .2f |          |       |
... |       |         | C column |       |
... |     1 |    1.00 |        c |     4 |
... |     2 |    2.00 |        d |     5 |
... |     3 |    3.00 |        e |     6 |
... """
>>> dat = ascii.read(
...     txt,
...     format="fixed_width",
...     header_rows=["dtype", "name", "unit", "format", "description"],
... )
>>> dat.info
<Table length=3>
name  dtype   unit format description
---- ------- ----- ------ -----------
a   int32     m
b float32          .2f
c    str4                 C column
d   uint8 m / s

The same idea can be used with the fixed_width_two_line format:

>>>
>>> txt = """\
...     a       b        c     d
... int64 float64      <U1 int64
...     m                  m / s
... ----- ------- -------- -----
...     1    1.00        c     4
...     2    2.00        d     5
...     3    3.00        e     6
... """
>>> dat = ascii.read(
...     txt,
...     format="fixed_width_two_line",
...     header_rows=["name", "dtype", "unit"],
... )
>>> dat
<Table length=3>
  a      b     c     d
  m                m / s
int64 float64 str1 int64
----- ------- ---- -----
    1     1.0    c     4
    2     2.0    d     5
    3     3.0    e     6

Note that the two_line in the fixed_width_two_line format name refers to the default situation where the header consists two lines, a row of column names and a row of separator lines. This is a bit of a misnomer when using header_rows.

Page Contents

  • Fixed-Width Gallery
    • Reading
      • Fixed Width
      • Fixed Width No Header
      • Fixed Width Two Line
    • Writing
      • Fixed Width
      • Fixed Width No Header
      • Fixed Width Two Line
    • Custom Header Rows
«

Page Source   Back to Top

© Copyright 2011–2023, The Astropy Developers.
Created using Sphinx 5.3.0.   Last built 02 Feb 2023.