Documentation

strGo to source

A sequence of Unicode codepoints.

You can iterate over the grapheme clusters of the string using a for loop. Grapheme clusters are basically characters but keep together things that belong together, e.g. multiple codepoints that together form a flag emoji. Strings can be added with the + operator, joined together and multiplied with integers.

Typst provides utility methods for string manipulation. Many of these methods (e.g., split, trim and replace) operate on patterns: A pattern can be either a string or a regular expression. This makes the methods quite versatile.

All lengths and indices are expressed in terms of UTF-8 bytes. Indices are zero-based and negative indices wrap around to the end of the string.

You can convert a value to a string with the str constructor.

Example

#"hello world!" \
#"\"hello\n  world\"!" \
#"1 2 3".split() \
#"1,2;3".split(regex("[,;]")) \
#(regex("\\d+") in "ten euros") \
#(regex("\\d+") in "10 euros")

Escape sequences

Just like in markup, you can escape a few symbols in strings:

Constructor
Question mark

Converts a value to a string.

If you wish to convert from and to Unicode code points, see the to-unicode and from-unicode functions.

ExpandView example
#str(10) \
#str(4000, base: 16) \
#str(2.7) \
#str(1e8) \
#str(<intro>)

value
int or float or decimal or str or label or bytes or version or type
RequiredPositional
Question mark

The value that should be converted to a string.

baseDefault: 10

The base (radix) to display integers in, between 2 and 36.

Definitions
Question mark

lenGo to source

The length of the string in UTF-8 encoded bytes.

self.len() → int

firstGo to source

Extracts the first grapheme cluster of the string.

Returns the provided default value if the string is empty or fails with an error if no default value was specified.

self.first(default: str) → str

default

A default value to return if the string is empty.

lastGo to source

Extracts the last grapheme cluster of the string.

Returns the provided default value if the string is empty or fails with an error if no default value was specified.

self.last(default: str) → str

default

A default value to return if the string is empty.

atGo to source

Extracts the first grapheme cluster after the specified index. Returns the default value if the index is out of bounds or fails with an error if no default value was specified.

self.at(
int,default: any,
) → any

index
int
RequiredPositional
Question mark

The byte index. If negative, indexes from the back.

default
any

A default value to return if the index is out of bounds.

sliceGo to source

Extracts a substring of the string. Fails with an error if the start or end index is out of bounds.

self.slice() → str

start
int
RequiredPositional
Question mark

The start byte index (inclusive). If negative, indexes from the back.

end
none or int
Positional
Question mark
Default: none

The end byte index (exclusive). If omitted, the whole slice until the end of the string is extracted. If negative, indexes from the back.

count

The number of bytes to extract. This is equivalent to passing start + count as the end position. Mutually exclusive with end.

clustersGo to source

Returns the grapheme clusters of the string as an array of substrings.

self.clusters() → array

codepointsGo to source

Returns the Unicode codepoints of the string as an array of substrings.

self.codepoints() → array

to-unicodeGo to source

Converts a character into its corresponding code point.

ExpandView example
#"a".to-unicode() \
#("a\u{0300}"
   .codepoints()
   .map(str.to-unicode))
str.to-unicode(str) → int

character
str
RequiredPositional
Question mark

The character that should be converted.

from-unicodeGo to source

Converts a unicode code point into its corresponding string.

ExpandView example
#str.from-unicode(97)
str.from-unicode(int) → str

value
int
RequiredPositional
Question mark

The code point that should be converted.

normalizeGo to source

Normalizes the string to the given Unicode normal form.

This is useful when manipulating strings containing Unicode combining characters.

#assert.eq("é".normalize(form: "nfd"), "e\u{0301}")
#assert.eq("ſ́".normalize(form: "nfkc"), "ś")
self.normalize(form: str) → str

formDefault: "nfc"

VariantDetails
"nfc"Canonical composition where e.g. accented letters are turned into a single Unicode codepoint.
"nfd"Canonical decomposition where e.g. accented letters are split into a separate base and diacritic.
"nfkc"Like NFC, but using the Unicode compatibility decompositions.
"nfkd"Like NFD, but using the Unicode compatibility decompositions.

containsGo to source

Whether the string contains the specified pattern.

This method also has dedicated syntax: You can write "bc" in "abcd" instead of "abcd".contains("bc").

self.contains(strregex) → bool

pattern
str or regex
RequiredPositional
Question mark

The pattern to search for.

starts-withGo to source

Whether the string starts with the specified pattern.

self.starts-with(strregex) → bool

pattern
str or regex
RequiredPositional
Question mark

The pattern the string might start with.

ends-withGo to source

Whether the string ends with the specified pattern.

self.ends-with(strregex) → bool

pattern
str or regex
RequiredPositional
Question mark

The pattern the string might end with.

findGo to source

Searches for the specified pattern in the string and returns the first match as a string or none if there is no match.

self.find(strregex) → nonestr

pattern
str or regex
RequiredPositional
Question mark

The pattern to search for.

positionGo to source

Searches for the specified pattern in the string and returns the index of the first match as an integer or none if there is no match.

self.position(strregex) → noneint

pattern
str or regex
RequiredPositional
Question mark

The pattern to search for.

matchGo to source

Searches for the specified pattern in the string and returns a dictionary with details about the first match or none if there is no match.

The returned dictionary has the following keys:

ExpandView example: Shape of the returned dictionary
#let pat = regex("not (a|an) (apple|cat)")
#"I'm a doctor, not an apple.".match(pat) \
#"I am not a cat!".match(pat)
ExpandView example: Different kinds of patterns
#assert.eq("Is there a".match("for this?"), none)
#"The time of my life.".match(regex("[mit]+e"))
self.match(strregex) → nonedictionary

pattern
str or regex
RequiredPositional
Question mark

The pattern to search for.

matchesGo to source

Searches for the specified pattern in the string and returns an array of dictionaries with details about all matches. For details about the returned dictionaries, see above.

ExpandView example
#"Day by Day.".matches("Day")
self.matches(strregex) → array

pattern
str or regex
RequiredPositional
Question mark

The pattern to search for.

replaceGo to source

Replace at most count occurrences of the given pattern with a replacement string or function (beginning from the start). If no count is given, all occurrences are replaced.

self.replace() → str

pattern
str or regex
RequiredPositional
Question mark

The pattern to search for.

replacement
str or function
RequiredPositional
Question mark

The string to replace the matches with or a function that gets a dictionary for each match and can return individual replacement strings.

The dictionary passed to the function has the same shape as the dictionary returned by match.

count

If given, only the first count matches of the pattern are replaced.

trimGo to source

Removes matches of a pattern from one or both sides of the string, once or repeatedly and returns the resulting string.

pattern
none or str or regex
Positional
Question mark
Default: none

The pattern to search for. If none, trims white spaces.

at

Can be start or end to only trim the start or end of the string. If omitted, both sides are trimmed.

repeatDefault: true

Whether to repeatedly removes matches of the pattern or just once. Defaults to true.

splitGo to source

Splits a string at matches of a specified pattern and returns an array of the resulting parts.

When the empty string is used as a separator, it separates every character (i.e., Unicode code point) in the string, along with the beginning and end of the string. In practice, this means that the resulting list of parts will contain the empty string at the start and end of the list.

self.split(nonestrregex) → array

pattern
none or str or regex
Positional
Question mark
Default: none

The pattern to split at. Defaults to whitespace.

revGo to source

Reverse the string.

self.rev() → str