↑ Michael Bukatin ↩🇺🇦 @ComputingByArts Twitter Profile

↑ Michael Bukatin ↩🇺🇦

@ComputingByArts

Dataflow matrix machines (neuromorphic computations with linear streams). Julia, Python, Clojure, C, Processing. Shaders, ambient, psytrance, 40hz sound.

4KPosts 487Followers 566Following

Similar User

@shahdhruv_

@ait_eth

@XingyouSong

@oren_ai

@lowrank_adrian

@MokadyRon

@VSehwag_

@gbarthmaron

@josephdviviano

@kkahatapitiy

@algroznykh

@thefillm

@shamangary

@davidrd123

@rampasek

↑ Michael Bukatin ↩🇺🇦

@ComputingByArts

16 Nov

"Diffusion Models are Evolutionary Algorithms" This is an overview by Grigory Sapunov: gonzoml.substack.com/p/diffusion-mo…

↑ Michael Bukatin ↩🇺🇦

@ComputingByArts

16 Nov

github.com/Zhangyanbo/dif…

↑ Michael Bukatin ↩🇺🇦

@ComputingByArts

16 Nov

github.com/Zhangyanbo/dif…

Check out our new preprint! arxiv.org/abs/2410.02543 We find that Diffusion Models are Evolutionary Algorithms! By viewing evolution as denoising, we show they share the same mathematical foundation. We then propose Diffusion Evolution (1/n)

GitHub - Zhangyanbo/diffusion-evolution: Diffusion model derived evolutionary algorithm

Source: https://t.co/fBhO4tlzWt

↑ Michael Bukatin ↩🇺🇦 Reposted

John Hewitt

@johnhewtt

24 Sep

If I finetune my LM just on responses, without conditioning on instructions, what happens when I test it with an instruction? Or if I finetune my LM just to generate poems from poem titles? Either way, the LM will roughly follow new instructions! Paper: arxiv.org/pdf/2409.14254

↑ Michael Bukatin ↩🇺🇦

@ComputingByArts

15 Nov

This is today. This is one of the strongest illustrations of the thesis, "LLMs already know almost everything and can do almost everything, they just need to be unhobbled". I'll repost a twitter thread by the author which is a good starting point.

ML Collective

@ml_collective

14 Nov

📢 Join us tomorrow at 10 AM PST for the next DLCT talk featuring @johnhewtt! He’ll dive into "Instruction Following without Instruction Tuning"—exploring innovative approaches to model training and task generalization.

↑ Michael Bukatin ↩🇺🇦

@ComputingByArts

14 Nov

"We recorded this conversation in person. In order to protect Gwern’s anonymity, we created this avatar. This isn’t his voice. This isn’t his face. But these are his words." dwarkeshpatel.com/p/gwern-branwen

varepsilon

@var_epsilon

13 Nov

dwarkesh's attention to detail is why his podcasts stand out so much. here gwern brings up his essay from 2009 and dwarkesh references a quote from it to transition to the next question

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

Source: https://t.co/yJlhPg1w0k

↑ Michael Bukatin ↩🇺🇦 Reposted

Cristian Garcia

@cgarciae88

12 Nov

People learning JAX, feel free to reach out if the learning feels too steep, hopefully we can flatten it out. Also, checkout the JAX LLM for help from the community: discord.gg/m9NDrmENe2

xjdr

@_xjdr

10 Nov

This has been and will continue to be my recommendation for anyone in this position. Learn jax and sign up for sites.research.google/trc/about/ Its one of the best things Google has ever done. You can do meaningful research for free, but the learning curve is steep. strap in

Join the JaxLLM (Unofficial) Discord Server!

Source: https://t.co/GX18E3MHvf

↑ Michael Bukatin ↩🇺🇦

@ComputingByArts

11 Nov

"Convolutional Differentiable Logic Gate Networks", NeurIPS oral, arxiv.org/abs/2411.04732

Felix Petersen

@FHKPetersen

8 Nov

I am thrilled to announce that we have 3 accepted papers @NeurIPSConf, including an Oral 🎉. As of today, they are all available on arXiv. A big thanks to my co-authors @StefanoErmon @HildeKuehne @sutter_tobias @OliverDeussen @julianwelzel0 and Christian Borgelt! @StanfordAILab

↑ Michael Bukatin ↩🇺🇦

@ComputingByArts

4 Nov

Is there a prediction market for the @arcprize coming top score when it closes on Nov 10? (It's currently at 55.5 out of 85 and keeps increasing at a solid clip, but one week does not seem to be enough to close the remaining gap.)

↑ Michael Bukatin ↩🇺🇦

@ComputingByArts

4 Nov

Looks great! "Project Sid: Many-agent simulations toward AI civilization" The paper etc in the github repository:

Robert Yang

@GuangyuRobert

1 Nov

What will a world look like with 100 billion digital human beings? Today we share our tech report on Project Sid – a glimpse at the first AI agent civilization (powered by our new PIANO architecture). github.com/altera-al/proj… 1/8

↑ Michael Bukatin ↩🇺🇦

@ComputingByArts

3 Nov

This looks really cool ("Fourier Head: Helping Large Language Models Learn Complex Probability Distributions"):

Nate Gillman

@GillmanLab

31 Oct

LLMs are powerful sequence modeling tools! They not only can generate language, but also actions for playing video games, or numerical values for forecasting time series. Can we help LLMs better model these continuous "tokens"? Our answer: Fourier series! Let me explain… 🧵(1/n)

↑ Michael Bukatin ↩🇺🇦

@ComputingByArts

1 Nov

Discussion: news.ycombinator.com/item?id=420170…

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8

31 Oct

TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters paper: arxiv.org/abs/2410.23168 repo: github.com/Haiyang-W/Toke… 🤗: huggingface.co/Haiyang-W TokenFormer introduces a scalable Transformer architecture that uses attention not only between input tokens but…

↑ Michael Bukatin ↩🇺🇦

@ComputingByArts

31 Oct

It looks like this is the new text-to-image leader: recraft.ai/blog/recraft-i… @recraftai

Artificial Analysis

@ArtificialAnlys

30 Oct

Select comparisons between Recraft V3, FLUX1.1 [pro], Midjourney v6.1 and Stable Diffusion 3.5 Large. All images sourced from the Artificial Analysis Image Arena.

↑ Michael Bukatin ↩🇺🇦

@ComputingByArts

25 Oct

Please release the large expensive models! We would appreciate 3.5 Opus @AnthropicAI

This post is unavailable.

↑ Michael Bukatin ↩🇺🇦

@ComputingByArts

24 Oct

New Claude 3.5 Sonnet is weird. Stellar official results, but... The most discerning users are unhappy. Has something bad happened AFTER the progress measurements were taken, or is it the case that discerning users are always in conflict with averages?

↑ Michael Bukatin ↩🇺🇦

@ComputingByArts

24 Oct

So, it's really weird. I am also seeing this degradation report: x.com/VictorTaelin/s… The most discerning users are unhappy. Has something bad happened AFTER the progress measurements were taken, or is it the case that discerning users are always in conflict with averages?

↑ Michael Bukatin ↩🇺🇦

@ComputingByArts

24 Oct

Nice result: arxiv.org/abs/2410.11081

OpenAI

@OpenAI

23 Oct

Introducing sCMs: our latest consistency models with a simplified formulation, improved training stability, and scalability. sCMs generate samples comparable to leading diffusion models but require only two sampling steps. openai.com/index/simplify…

↑ Michael Bukatin ↩🇺🇦

@ComputingByArts

22 Oct

"Decomposing The Dark Matter of Sparse Autoencoders" arxiv.org/abs/2410.14670 github.com/JoshEngels/SAE… By @JoshAEngels, Logan Riggs lesswrong.com/users/elriggs, and @tegmark

Max Tegmark

@tegmark

21 Oct

I’m excited about our new paper on mapping concepts in artificial neural networks with sparse autoencoders: we find that map errors exhibit remarkable structure, splitting into categories, which I’m optimistic can be leveraged to further improve “artificial neuroscience”:

↑ Michael Bukatin ↩🇺🇦

@ComputingByArts

17 Oct

Looks great! And here is, "Depth dweller: translating art code between Processing and Wolfram languages using ChatGPT o1 and 4o" by @superflow: community.wolfram.com/groups/-/m/t/3…

ア

@yuruyurau

11 Oct

a=(x,y,d=5*cos(o=mag(k=x/8-25,e=y/8-25)/3))=>[(q=x/2+k/atan(9*cos(e))*sin(d*4-t))*sin(c=d/3-t/8)+200,(y/4+5*o*o+q)/2*cos(c)+200] t=0,draw=$=>{t||createCanvas(w=400,w);background(6).stroke(255,96);for(t+=PI/60,y=99;++y<300;)for(x=99;++x<300;)point(...a(x,y))} //#つぶやきProcessing