CodeHilite¶
Summary¶
The CodeHilite extension adds code/syntax highlighting to standard Python-Markdown code blocks using Pygments.
This extension is included in the standard Markdown library.
Setup¶
Step 1: Download and Install Pygments¶
You will also need to download and install the Pygments package on your
PYTHONPATH
. The CodeHilite extension will produce HTML output without
Pygments, but it won’t highlight anything (same behavior as setting
use_pygments
to False
).
Step 2: Add CSS Classes¶
You will need to define the appropriate CSS classes with appropriate rules. The CSS rules either need to be defined in or linked from the header of your HTML templates. Pygments can generate CSS rules for you. Just run the following command from the command line:
pygmentize -S default -f html -a .codehilite > styles.css
If you are using a different css_class
(default: .codehilite
), then
set the value of the -a
option to that class name. The CSS rules will be
written to the styles.css
file which you can copy to your site and link from
your HTML templates.
If you would like to use a different theme, swap out default
for the desired
theme. For a list of themes installed on your system (additional themes can be
installed via Pygments plugins), run the following command:
pygmentize -L style
See Pygments’ excellent documentation for more details. If no language is defined, Pygments will attempt to guess the language. When that fails, the code block will not be highlighted.
See Also
GitHub user richeland has provided a number of different CSS style
sheets which work with Pygments along with a preview of each theme.
The css_class
used is .highlight
. Therefore, one would need to override the
css_class
option when using richeland’s CSS styles. However, the
Python-Markdown project makes no guarantee that richeland’s CSS styles will
work with the version of Pygments you are using. To ensure complete
compatibility, you should generate the CSS rules from your own installation
of Pygments.
Syntax¶
The CodeHilite extension follows the same syntax as regular Markdown code blocks, with one exception. The highlighter needs to know what language to use for the code block. There are three ways to tell the highlighter what language the code block contains and each one has a different result.
Note
The format of the language identifier only effects the display of line numbers
if linenums
is set to None
(the default). If set to True
or False
(see Usage below) the format of the identifier has no effect on the
display of line numbers – it only serves as a means to define the language
of the code block.
Shebang (with path)¶
If the first line of the code block contains a shebang, the language is derived from that and line numbers are used.
#!/usr/bin/python
# Code goes here ...
Will result in:
1 2 |
|
Shebang (no path)¶
If the first line contains a shebang, but the shebang line does not contain a
path (a single /
or even a space), then that line is removed from the code
block before processing. Line numbers are used.
#!python
# Code goes here ...
Will result in:
1 |
|
Colons¶
If the first line begins with three or more colons, the text following the colons identifies the language. The first line is removed from the code block before processing and line numbers are not used.
:::python
# Code goes here ...
Will result in:
# Code goes here ...
Certain lines can be selected for emphasis with the colon syntax. When using Pygments’ default CSS styles, emphasized lines have a yellow background. This is useful to direct the reader’s attention to specific lines.
:::python hl_lines="1 3"
# This line is emphasized
# This line isn't
# This line is emphasized
Will result in:
# This line is emphasized
# This line isn't
# This line is emphasized
Note
hl_lines
is named for Pygments’ option meaning “highlighted lines”.
When No Language is Defined¶
CodeHilite is completely backwards compatible so that if a code block is
encountered that does not define a language, the block is simply wrapped in
<pre>
tags and output.
# Code goes here ...
Will result in:
# Code goes here ...
Lets see the source for that:
<div class="codehilite"><pre><code># Code goes here ...
</code></pre></div>
Note
When no language is defined, the Pygments highlighting engine will try to guess
the language (unless guess_lang
is set to False
). Upon failure, the same
behavior will happen as described above.
Usage¶
See Extensions for general extension usage. Use codehilite
as the
name of the extension.
See the Library Reference for information about configuring extensions.
The following options are provided to configure the output:
-
linenums
: An alias to Pygments’linenos
formatter option. Possible values areTrue
for yes,False
for no andNone
for auto. Defaults toNone
.Using
True
will force every code block to have line numbers, even when using colons (:::
) for language identification.Using
False
will turn off all line numbers, even when using shebangs (#!
) for language identification. -
guess_lang
: Automatic language detection. Defaults toTrue
.Using
False
will prevent Pygments from guessing the language, and thus highlighting blocks only when you explicitly set the language. -
css_class
: An alias to Pygmentscssclass
formatter option. Set CSS class name for the wrapper<div>
tag. Defaults tocodehilite
. -
pygments_style
: Pygments HTML Formatter Style (ColorScheme
). Defaults todefault
.Note
This is useful only when
noclasses
is set toTrue
, otherwise the CSS styles must be provided by the end user. -
noclasses
: Use inline styles instead of CSS classes. Defaults toFalse
. -
use_pygments
: Specifies the use of Pygments in generating the output.If
True
(the default) and Pygments is available, CodeHilite will use Pygments to analyze and format the output. Additionally, if using Pygments >= 2.4, the output will be wrapped in<code>
tags, whereas earlier versions will not.Otherwise, Pygments will not be used. If a language is defined for a code block, it will be assigned to the
<code>
tag as a class in the manner suggested by the HTML5 spec and may be used by a JavaScript library in the browser to highlight the code block. See thelang_prefix
option to customize the prefix. -
lang_prefix
: The prefix prepended to the language class assigned to the HTML<code>
tag. Default:language-
. -
pygments_formatter
: This option can be used to change the Pygments formatter used for highlighting code blocks. By default, this is set to the string'html'
, which means it’ll use the defaultHtmlFormatter
provided by Pygments.This can be set to a string representing any of the other default formatters, or set to a formatter class (or any callable).
The code’s language is always passed to the formatter as an extra option
lang_str
, with the value formatted as{lang_prefix}{lang}
. If the language is unspecified, the language guessed by Pygments will be used. While this option has no effect to the Pygments’s builtin formatters, a user can make use of the language in their custom formatter. See an example below.To see what formatters are available and how to subclass an existing formatter, please visit Pygments documentation on this topic.
-
Any other Pygments’ options:
All other options are accepted and passed on to Pygments’ lexer and formatter. Therefore, valid options include any options which are accepted by the html formatter or whichever lexer the code’s language uses. Invalid options are ignored without error.
A trivial example:
markdown.markdown(some_text, extensions=['codehilite'])
To keep the code block’s language in the Pygments generated HTML output, one can provide a custom Pygments formatter
that takes the lang_str
option. For example,
from pygments.formatters import HtmlFormatter
from markdown.extensions.codehilite import CodeHiliteExtension
class CustomHtmlFormatter(HtmlFormatter):
def __init__(self, lang_str='', **options):
super().__init__(**options)
# lang_str has the value {lang_prefix}{lang}
# specified by the CodeHilite's options
self.lang_str = lang_str
def _wrap_code(self, source):
yield 0, f'<code class="{self.lang_str}">'
yield from source
yield 0, '</code>'
some_text = '''\
:::python
print('hellow world')
'''
markdown.markdown(
some_text,
extensions=[CodeHiliteExtension(pygments_formatter=CustomHtmlFormatter)],
)
The formatter above will output the following HTML structure for a code block:
<div class="codehilite">
<pre>
<code class="language-python">
...
</code>
</pre>
</div>