jpeg-go

a from-scratch jpeg encoder and decoder written in go. no external dependencies, just the standard library.

built this while writing a paper on image compression - wanted to really understand what's happening under the hood. ai did the heavy lifting:)

what's in here

jpeg-go/
├── main.go                 # example program
├── go.mod
└── jpeg/
    ├── constants.go        # markers, quant tables, huffman tables
    ├── color.go            # rgb to ycbcr, subsampling
    ├── dct.go              # discrete cosine transform
    ├── quantization.go     # quantization, zigzag, rle
    ├── huffman.go          # huffman coding
    ├── encoder.go          # jpeg encoder
    └── decoder.go          # jpeg decoder

usage

encoding

package main

import (
    "os"
    "image/png"
    "jpeg-go/jpeg"
)

func main() {
    file, _ := os.Open("input.png")
    img, _ := png.Decode(file)
    file.Close()

    output, _ := os.Create("output.jpg")
    encoder := jpeg.NewEncoder(75) // quality 1-100
    encoder.Encode(output, img)
    output.Close()
}

decoding

package main

import (
    "os"
    "image/png"
    "jpeg-go/jpeg"
)

func main() {
    file, _ := os.Open("input.jpg")
    img, _ := jpeg.DecodeImage(file)
    file.Close()

    output, _ := os.Create("output.png")
    png.Encode(output, img)
    output.Close()
}

quality settings

1-10: crunchy as hell, but tiny files
50: decent balance
75: the sweet spot for most stuff
90-100: basically lossless looking, bigger files

how it works

the whole point of jpeg is that humans are way better at seeing brightness differences than color differences. so we can throw away a lot of color info and nobody notices.

1. rgb → ycbcr

first we split the image into brightness (Y) and color (Cb, Cr). this lets us treat them separately.

2. chroma subsampling

since eyes don't care much about color resolution, we shrink Cb and Cr to 1/4 the size (half in each direction). that's already 50% smaller and you can barely tell.

3. 8x8 blocks

the image gets chopped into 8x8 pixel blocks. each one is processed on its own.

4. dct (the math part)

each block goes through a discrete cosine transform. this converts pixel values into frequency components - basically "how much low frequency stuff vs high frequency stuff is in this block".

the cool thing is most of the important visual info ends up in the low frequencies (top-left of the transformed block). the high frequency stuff (bottom-right) is usually small values we can safely throw away.

5. quantization (the lossy part)

this is where the actual compression happens. we divide all the dct values by numbers from a quantization table and round them. lots of the high-frequency values become zero.

lower quality = bigger divisors = more zeros = smaller file = more artifacts

6. zigzag ordering

we read the 8x8 block in a zigzag pattern starting from top-left. this groups all the zeros (from the high frequencies) together at the end.

7. run-length encoding

now we encode it as "5 zeros then a 12, 3 zeros then a -4, etc". way more compact than storing all those zeros individually.

8. huffman coding

finally, everything gets huffman encoded. common patterns get short codes, rare ones get longer codes. standard entropy compression stuff.

building

go build -o jpeg-compressor
./jpeg-compressor

this'll generate some test images at different quality levels so you can see the difference.

references

ITU-T T.81 (the actual jpeg spec)
JFIF spec
IJG documentation
this video got me interested in building this

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
jpeg		jpeg
README.md		README.md
go.mod		go.mod
jpg.png		jpg.png
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

jpeg-go

what's in here

usage

encoding

decoding

quality settings

how it works

1. rgb → ycbcr

2. chroma subsampling

3. 8x8 blocks

4. dct (the math part)

5. quantization (the lossy part)

6. zigzag ordering

7. run-length encoding

8. huffman coding

building

references

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

jpeg-go

what's in here

usage

encoding

decoding

quality settings

how it works

1. rgb → ycbcr

2. chroma subsampling

3. 8x8 blocks

4. dct (the math part)

5. quantization (the lossy part)

6. zigzag ordering

7. run-length encoding

8. huffman coding

building

references

About

Resources

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages