【george hotz archive】George Hotz | Researching | multiGPU with HIP (or maybe without HIP) | HSA | HIP Graph | Part 1



george hotz archive :George Hotz | Researching | multiGPU with HIP (or maybe without HIP) | HSA | HIP Graph | Part 1

George Hotz | Researching | multiGPU with HIP (or maybe without HIP) | HSA | HIP Graph | Part 1

Date of the stream 20 Jan 2024.
from $1250 buy https://comma.ai/shop/comma-3x & best ADAS system in the world https://openpilot.comma.ai
Live-stream chat added as Subtitles/CC – English (Twitch Chat) – at the bottom – Show Transcript

Sources:
– https://hsafoundation.com/wp-content/uploads/2021/02/HSA-PRM-1.2.pdf
Follow for notifications:
– https://twitch.tv/georgehotz
Support George:
– https://twitch.tv/subs/georgehotz
Pre-order tinybox:
– https://buy.stripe.com/5kAaGL6lk9uX9nW144 (https://tinygrad.org/)

Chapters:
00:00:00 intro
00:00:20 no warning, linkedin ban, child prodigy
00:02:25 torchrl
00:03:20 tinybox pre-order, AI computer, lambda labs
00:05:20 lambdalabs vs tinybox
00:12:20 7302p vs 7532 epyc
00:15:00 tinybox raspberry pi for ML
00:16:10 not buying Apple Vision Pro
00:16:45 fastdd github, Meta buying H100
00:18:10 twitch removing content warning, linkedin post
00:20:15 linkedin worst dating site
00:20:35 selling rolls royce, money
00:21:25 George not a good fit for twitter, culture war
00:23:00 Peter Thiel dinner party, e/acc
00:25:55 better processor and it got slower
00:27:35 drive faster than dev 0
00:30:00 boost frequency, perplexity
00:33:50 bios, ipmi, epyc boost, not boosting
00:44:40 btop, pcie 4 vs 5
00:46:25 direct democracy
00:47:40 boost speed
00:50:10 hip graph is not fast
00:52:20 ROCm 6.0, Llama-2-70b slow, single thread
00:52:55 single thread, multithread, multiprocess tinygrad
00:53:55 ggml, tinygrad long term goal, universal
00:55:00 event, block slow
00:56:40 GPU queue sync, multiprocess
00:58:35 writing your own GPU driver, userspace
00:58:54 AMD HIP, clone of CUDA
01:00:30 finding HIP graph code
01:02:50 spinlocks, multiprocessing, GPU driver
01:03:25 how do GPUs work?
01:06:00 prebuilding the queues, hip semaphores
01:09:20 rdna 3 instruction set
01:12:40 so much complexity, micro engine scheduler
01:14:40 Alex
01:19:55 reading the code to send packets
01:22:45 hate free stream
01:23:30 amd gpu scheduling
01:32:50 perplexity valuation, how to value a company
01:34:00 HSA queue
01:35:30 perplexity fast, GPT4 slow, anthropic
01:39:00 HSA level 0
01:41:30 HSA runtime book, anna’s archive
01:45:30 no copyright infringement intended
01:46:20 AQL packets
01:52:55 piano
01:53:20 AMD is for people who likes to get twice as much GPUs for their money
01:54:50 tinygrad pay per token API
01:58:30 replacing HIP support with HSA support
02:00:20 Nvdia vs AMD datacenter, customer GPU architecture
02:02:30 secret good version of openpilot joke
02:03:40 HIP does not use DMA engine
02:05:00 bit blit
02:14:20 rocm-bandwidth-test
02:17:00 hca kmt api amd
02:17:55 Alex
02:20:08 the hidden song
02:23:30 going on a journey
02:27:55 real completion events
02:31:30 hsa example of kernel dispatch
02:32:00 cool that AMD is so open
02:35:40 just using HSA, HSA rabbit hole, hsa foundation
02:37:50 the chapel language with 0 github stars
02:39:40 HSA Programmer’s Reference Manual
02:40:20 linkedin post
02:42:40 the weather people, if you could design a country, deep state
02:45:45 conservatism, progressivism quote
02:49:00 Alex
02:50:00 thinking from first principles, experiments hard
02:50:20 has anyone heard about HSA foundation
02:52:50 scientific computing people, OpenMP, OpenACC
02:55:20 AMD extensions
02:56:15 traveling salesman, 2^n algorithm, scientific computing funding
02:59:10 leslie greengard
02:59:40 deep learning revolution
03:01:00 tinygrad experiment, complexity dysfunction of governance
03:01:38 misunderstanding of how software is developed today
03:02:00 compression is intelligence
03:02:20 complexity management instead complexity reduction
03:02:40 spacex rocket landing controls genius
03:05:10 complex systems, twitter
03:06:10 software 0 cost to replication
03:07:40 twitter acquisition best political dollar ever spend
03:09:30 making the tinybox good
03:09:46 making money off OSS
03:10:10 pre-order tinyboxes
03:11:10 etched.com, tenstorrent.com
03:13:50 tenstorrent offering a card to George
03:14:15 respect to tenstorrent, intel tier
03:15:05 extropic.ai
03:16:40 science grants, fundamental research that needs to be done
03:17:40 bullish on perplexity
03:18:40 atomicsemi.com
03:19:30 ranking startups, tenstorrent open source
03:21:00 tinygrad factorization
03:23:30 hammer.lol, berkshirehathaway.com
03:24:40 stop using javascript
03:27:20 apple.com website, feross.org
03:30:10 lana_lux 5k viewers

Official George Hotz communication channels:
– https://geohot.com
– https://twitter.com/realGeorgeHotz
– https://instagram.com/georgehotz
– https://tinygrad.org
– https://geohot.github.io/blog
– https://github.com/geohot

We archive George Hotz and comma.ai videos for fun.
Follow for notifications:
– https://twitter.com/geohotarchive

Thank you for reading and using the SHOW MORE button.
We hope you enjoy watching George’s videos as much as we do.
See you at the next video.