Why does Typst allow minus signs in identifiers?

Hi. It seems Typst allows minus signs in variable names.

#import "@local/commute:0.2.0": node, arr, commutative-diagram

How is this differentiated from regular minus operations? Is it true that I need to add spaces around operators? I think that’s not true since I use / without spaces often. What’s the exact rules controlling this bizarre behavior?

It’s not bizarre, it’s just the naming convention Typst embraces called “kebab-case” because it seemed a little nicer for a typesetting system than the usual “snake_case”.

It is differentiated the same way as most operations are: To refer to a variable with a hyphen in its name, you need to be in code mode and spaces around the hyphen aren’t needed, the parser can take care of this.

4 Likes

Interesting to know that only - isn’t treated the same as other infix operators.

Was rationale given for adopting this convention? All my typst functions
are still in snake case. It feels bizzare to have an identifier with - in it, so was wondering why Typst made this the convention.

1 Like

It is less bizarre if you remember that ‘-’ is actually a hyphen, which is very commonly used in many languages to combine words.

There is also precedent in languages like lisp and css for hyphenated identifiers. Personally, I like it, it looks better than underscore, and it’s imo probably the most natural way to create compound words, since that’s how you do it in human language.

Come to think of it, my own name contains a hyphen, so I would say it’s an extremely natural part of an identifier!

If we set aside the long-established convention of programming languages prohibiting hyphens in identifiers, I’d concede that hyphens might be the better choice over underscores for readability and alignment with natural language. However, coding languages have historically rejected this trend, and I think Typst made a misstep by breaking from it—even if, in theory, it’s a defensible move. Hyphenated identifiers in Typst code, like my-variable, don’t feel like traditional ‘code’ to me; they add a subtle layer of friction for programmers accustomed to underscores, making the transition less intuitive. It’s not a dealbreaker—hardly the end of the world—but I’m sticking with underscores in silent protest, hoping the Typst community might eventually lean back toward convention. I’ll keep it up until a colleague I’m collaborating with on a Typst document calls me out and forces the issue.

1 Like

I’ll also add that as a marketing ploy, that is it easier for a newcomer to Typst with coding background to have that “a-ha!” moment of the power of combining markup with a programming language, if that programming language looks like other languages they have seen.

1 Like

This seems to me to be clearly a personal preference. When I think about identifiers I think about:

  • Naming things whatever I want without following any kind of convention
  • Camelcase
  • Snake case
  • Kebab case
  • Are emojis supported? What about UTF-8?

I don’t see why using underscores is more correct than dashes. Yes, there is a historical preference for them, but that was born from a time when it wasn’t even clear that ASCII would be the character encoding of choice going forward. We aren’t restricted in the same way anymore.

Also, as I understand it, Typst isn’t aiming to reduce the learning curve for programmers. Some cherry-picked quotes from typst.app/home:

A more productive workflow for science.

For Rocket Scientists.
And the rest of us, too.

There is also this:

uses familiar programming constructs instead of hard-to-understand macros.

But that is directly contrasting Latex which I don’t think is written for programmers either.

I’d also like to point out that the character we are talking about has many uses. In the title of this post it is called a minus sign. I have also heard people call it a hyphen, a stroke, or a dash. The single character fulfills multiple roles. Yes, it’s a bit annoying that you can end up with something like #let x = num-a - 1. The same character has two completely different uses. But that’s not the fault of Typst but of how (English) language uses that character (plus, how many characters do you want on your keyboard?).

Anyways, that’s my opinion on this non-critical aspect of a typesetting system that I enjoy using.

Edit: For completeness, the standard used by Typst is uax31 (source forum post by kevio).

2 Likes

Kebab case may be a bit niche but still, apart form the already mentioned CSS and Lisp it’s used at least in COBOL (hum), Forth and Raku (Perl 6), and also modern Lisp variants like Clojure. It has two great advantages:

  • Easier to read than all the other options
  • Easier to type than snake case (in most keyboard layouts you need shift for _ but not for -)

and one great downside:

  • It’s ambiguous with subtraction in languages that have infix operators

I personnally love the Typst solution, which is to require spaces around the minus operator in code. Because this makes code even more readable. I think the biggest issue with kebab case in Typst is the inconsistency in math mode where #a-b and func(a-b: xxx) are parsed as one word but a-b alone is parsed as a - b. But it’s not a big deal either. I think the two benefits mentioned above make it well worth it.