Status update of my tsc port
I'm the creator of swc, a web build tool written in Rust. And I'm working at Vercel to make web development as fast as it can be. My go-to language is Rust, but I selected Go for the new TypeScript type checker.
- Related: I'm porting tsc to Go
Porting TypeScript compiler is hard
Every web developer wants a faster TypeScript type checker.
But there's only one implementation of it because implementing it is way too hard.
TypeScript does not have specification and tsc
is the only thing which can be called a reference.
But well... tsc
is a moving target, and the velocity is far from common sense.
Definitely, there's no other word which can represent it correctly.
Let's compare v4.7.2
and v4.8.2
.
These versions are the first official stable releases of 4.7
and 4.8
respectively.
There are 311 commits and 569635 lines of codes are changed.
Yes, that’s correct.
311 commits. In one minor version bump.
Following tsc
with swiftness, and with custom reimplementation is nearly impossible.
That's why I decided to port tsc
to Go, not Rust.
As TypeScript and Go shares lots of common properties, I can rewrite almost all TypeScript code in Go line-by-line.
But while trying to do it, I found a better methodology.
My methodology
Porting line-by-line is easy. But... there are so many lines.
TypeScript on release-4.9 is 📦 v4.9.1-beta via v16.17.0 took 2s
❯ tokei src
===============================================================================
Language Files Lines Code Comments Blanks
===============================================================================
JSON 32 9076 9032 0 44
Markdown 2 59 0 37 22
TypeScript 538 346496 270968 42109 33419
===============================================================================
Total 572 355631 280000 42146 33485
===============================================================================
The src
directory of TypeScript contains 346496 lines of TypeScript.
Line-by-line porting is easy, but 346496 * (line-by-line porting) is definitely not easy nor funny.
No one, at least on this planet, wants to port hundreds of thousands of lines of code on one's own. As I'm one of the people living on this planet, I looked for another way. It should be much less cumbersome way than line-by-line port.
Then I got an idea.
If we want a line-by-line port, computers can help. All about computers are automation. Why can't we apply it here?
We can write a compiler which takes TypeScript compiler as an input, and emits a new TypeScript compiler as an output. Of course, output should be written in Go.
Yeah, this is insane.
This is way more insane than velocity of tsc
.
But one thing we should note is that following tsc
becomes viable if we transpile tsc
to Go.
I'm an experienced compiler developer. I have been maintaining swc for several years. So I didn't need to study about compiler internals, and I just started writing a compiler.
ts2go
: The transpiler
TypeScript contains a file named checker.ts
, which is super big.
Guessing from the name of the file, I expected this to contain core validation logic. I chose TypeScript for the language of TypeScript transpiler, because I can access type information while transpiling.
Of course, emitting a Go file which can be consumed by go build
was a hard task.
But it was something doable.
I simply invested lots of time.
After fixing all syntax errors, checker.gen.go
had 100K+ compile errors.
tsfix
: Go autofix tool
I found that what I can do from TypeScript compiler is quite limited, because I can't access any information from custom Go code.
So I investigated.
While searching, I found that golang/x/tools/analysis
supports --fix
, and it supports using the type information..
Lint tool with autofix can be used to modify code automatically.
So it was almost a perfect fit.
Once I implemented it, it worked!
I had to deal with some bugs, but it was fine.
It perform various operations. It modifies type definitions, automatically convert types, and perform much more, based on the CLI option.
Status
While working on it, I found that it would be much better if I generate more files.
So I did so.
Also, I improved ts2go
and tsfix
over time, and I reduced the number of compile errors.
1298
Per file:
32 ts/internal/checker/binder.gen.go
713 ts/internal/checker/checker.gen.go
2 ts/internal/checker/checker_base.go
17 ts/internal/checker/core.go
1 ts/internal/checker/factory_node_factory.go
31 ts/internal/checker/factory_utilities.gen.go
117 ts/internal/checker/moduleNameResolver.gen.go
79 ts/internal/checker/parser.gen.go
6 ts/internal/checker/path.gen.go
86 ts/internal/checker/program.gen.go
23 ts/internal/checker/scanner.gen.go
181 ts/internal/checker/utilities.gen.go
9 ts/internal/checker/utilitiesPublic.gen.go
1 ts/internal/checker/visitor_public.go
From 100K+ compile errors, to 1298 compile errors. Still, there are lots of tasks to do. But once the number become small enough, I can switch to manual implementation, which is quite simple and fast. It requires applying same manual patch for each update but it will allow faster release and I'll improve automatic conversion rate over time.
Moving forward
For now, my focus continues to be swc, in particular making its minifier production-ready and ensuring a successful integration into Next.js And because this is still my private project, so I can't use working hours for this. Thus, I can't guarantee when this will be ready.