Jonah Bernard

i want to build systems that advance civilization and make it easier for humans to live

i'm currently exploring llm inference

hi! i'm jonah bernard, i'm a senior at cornell studying computer science

i am interested in the world and in computing systems

i am currently exploring llm inference with a focus on optimizing software for different hardware backends

i am interested in building technology systems that advance civilization and make us more productive, safer, and lead more meaningful lives

please reach out to me if you want to chat!

Experience

AMD

Software Engineer Intern

Summer 2026

Cubic Transportation Systems

Software Engineer Intern

Summer 2025

Worked on building technology systems that advance civilization and make it easier for humans to live. Contributed to various software engineering projects.

Education

Cornell University

B.S. in Computer Science

2022-2026

Senior studying computer science. Exploring LLM inference with a focus on optimizing software for different hardware backends.

open source work is important to me as it makes it easier for young people to bring their ideas to life

alongside my open-source contributions, i write technical blogs about topics to help newcomers contribute more effectively to ai infrastructure

Understanding the SGLang Scheduler

my deep dive into the inner workings of sglang's famous scheduler

coming soon

My open-source contributions

SGLang

My tech stack

these are the tools and technologies i reach for when building, debugging, and exploring

Languages

Python

my daily driver for ml infra, scripting, and most of my sglang work

C++

for low-level work

CUDA

writing custom kernels for nvidia gpus, mostly around moe and lora

Triton

use it to write kernels for both AMD and NVIDIA gpus

Metal (MSL)

writing kernels for apple silicon as part of the sglang port

OCaml

picked it up at cornell

Java

my first programming language

ML & Inference

SGLang

the inference engine i contribute to most actively

PyTorch

model definitions, custom ops, and most experimentation

vLLM

reference point when i'm thinking about scheduler and kv-cache design

Hugging Face

model weights, tokenizers, and quick benchmarking

ONNX

portable model format for moving across runtimes and hardware

ONNX Runtime

running onnx models with hardware-specific execution providers

Tools

MacBook Pro M1

my daily machine for development

git

all of my open-source work flows through it

tmux

how i keep long-running training and bench jobs alive

Claude Code / Cursor / Antigravity

i switch between them depending on the most recent updates

Docker

reproducing inference environments across machines

LLVM

wrote a few compiler passes

MLIR

wrote a few compiler passes

Linux

everyone uses linux

Bash

necessary to do work on my mac

LangChain

used it to build an agent to help me contribute to sglang

PostgreSQL

my default for relational storage

MinIO

my default object store

i'm a software engineer because i want to make the world a better place through technology

to ensure i know the best way to do this, i carve out time to learn about law, history and business

JonahBernard

|

Experience

AMD

Software Engineer Intern

Cubic Transportation Systems

Software Engineer Intern

Education

Cornell University

B.S. in Computer Science

Understanding the SGLang Scheduler

My open-source contributions

SGLang

Major projects

Full List of my Prs to SGLang

personal projects

My tech stack

Languages

ML & Inference

Tools

Jonah
Bernard