Lisa033 @Lisa03312 Twitter Profile

Lisa033

@Lisa03312

Joined December 2017

20Posts 229Followers 838Following

Similar User

@punk1685

@jaxayshah

@Miisssliz

@Prophet87_eth

@1903SavasBJK

@maesocio

@drah_cula

@tdl8_

@LimaJmr8

@FesslerThedfess

Lisa033 Reposted

Sebastian Raschka

@rasbt

15 Mar 2023

A comparison of the exact same model training for all 3 approaches 1) Pure PyTorch: github.com/rasbt/pytorch-… (25 min) 2) PyTorch + Fabric: github.com/rasbt/pytorch-… (1.7 min) 3) PyTorch + Trainer: github.com/rasbt/faster-p… (2.7 min, the 1 extra min is for logging+checkpointing)

Lisa033 Reposted

Adrian Wälchli

@adrianwaelchli

16 Mar 2023

Easy to miss in the @pytorch 2.0 release notes, they've added a small, but useful feature: torch.device, which previously just returned a device object, can now be used as a context manager. 0/8

Lisa033 Reposted

Sebastian Raschka

@rasbt

17 Mar 2023

What an awesome week for open source and the PyTorch ecosystem with three big launches! - PyTorch 2.0 - Lightning Trainer 2.0 for PyTorch - Fabric for PyTorch! Just updated my "faster PyTorch" article to include the latest tools! 🔗 sebastianraschka.com/blog/2023/pyto…

rasbt's tweet image. What an awesome week for open source and the PyTorch ecosystem with three big launches!

- PyTorch 2.0
- Lightning Trainer 2.0 for PyTorch
- Fabric for PyTorch!

Just updated my "faster PyTorch" article to include the latest tools!

🔗 <a style="text-decoration: none;" rel="nofollow" target="_blank" href="https://t.co/36ptnaFOzz">sebastianraschka.com/blog/2023/pyto…</a>

Lisa033 Reposted

Sebastian Raschka

@rasbt

17 Mar 2023

Btw if you have an hour to spare on the weekend, I am covering how to use the Trainer to fit PyTorch models in the newly released Unit 5: lightning.ai/pages/courses/…

rasbt's tweet image. Btw if you have an hour to spare on the weekend, I am covering how to use the Trainer to fit PyTorch models in the newly released Unit 5:
<a style="text-decoration: none;" rel="nofollow" target="_blank" href="https://t.co/MJB1pr3VeQ">lightning.ai/pages/courses/…</a>

Sebastian Raschka

@rasbt

17 Mar 2023

Lisa033 Reposted

Sebastian Raschka

@rasbt

18 Mar 2023

But note that vision transformers (ViTs) are not free from any inductive biases! ViTs focus more on global relationships due to the patchification & self-attention mechanism, which often leads to the perception that they act as low-pass filters, emphasizing (or recognizing)…

rasbt's tweet image. But note that vision transformers (ViTs) are not free from any inductive biases!

ViTs focus more on global relationships due to the patchification & self-attention mechanism, which often leads to the perception that they act as low-pass filters, emphasizing (or recognizing)…

Sebastian Raschka

@rasbt

16 Mar 2023

Similar to fully-connected networks, the ViT architecture (and transformer architecture in general) lacks the inductive bias for spatial invariance/equivariance that convolutional networks have. Consequently, ViTs require more data for pretraining to acquire useful "priors" from…

rasbt's tweet image. Similar to fully-connected networks, the ViT architecture (and transformer architecture in general) lacks the inductive bias for spatial invariance/equivariance that convolutional networks have. Consequently, ViTs require more data for pretraining to acquire useful "priors" from…

Lisa033 Reposted

Sebastian Raschka

@rasbt

18 Mar 2023

Plot twist: they didn’t disclose any details because there was actually no innovation to report, just a bit more finetuning, which would have looked underwhelming given all the hype. By not sharing any details they made GPT-4 seem like a bigger innovation/deal than it really is.

Sebastian Raschka

@rasbt

15 Mar 2023

GPT-4 was interesting for a hot second, I'll admit. But today is a new day: time to move on and get back to discussing research and open source.

Lisa033 Reposted

Jia-Bin Huang

@jbhuang0604

18 Mar 2023

Exactly! This is a great example of how CVPR publicity restrictions very effectively prevent unfair public visibility on social media for research from prestigious institutes like Ivy League. oh wait…

Tanmay Gupta

@tanmay2099

18 Mar 2023

I don’t mean to be disrespectful to either @fchollet or the authors of ViperGPT (its great work!), but i have to point out that our work VisProg did something similar 4 months ago but it just went unnoticed because of CVPR’s publicity restrictions. @CVPR @CVPRConf

Lisa033 Reposted

Sebastian Raschka

@rasbt

20 Mar 2023

For a hot second, I was wondering how relevant weight decay still is. Instead of asking ChatGPT, I ran a simple experiment (*when a picture says more than a thousand words*)

rasbt's tweet image. For a hot second, I was wondering how relevant weight decay still is.
Instead of asking ChatGPT, I ran a simple experiment (*when a picture says more than a thousand words*)

Lisa033 Reposted

Sebastian Raschka

@rasbt

21 Mar 2023

Some unsolicited writing advice when you are trying to fit things into an 8-page paper limit / 1-page rebuttal limit: ... without any use of bias units ... ... without using bias units ... Does LayerNorm have an effect on ...? Does LayerNorm affect ...? The theorem does not…

Lisa033 Reposted

Tom Goldstein

@tomgoldsteincs

20 Mar 2023

Classical Theory: garbage in garbage out Minor domain shift: gold in garbage out Diffusion models: garbage in gold out

Lisa033 Reposted

Sebastian Raschka

@rasbt

23 Mar 2023

New AI research & news everywhere! A short post on my personal approach to keeping up with things. sebastianraschka.com/blog/2023/keep…

Keeping Up With AI Research And News

Source: https://t.co/F7Mcn9kCBl

Lisa033 Reposted

Lightning AI ⚡️

@LightningAI

23 Mar 2023

🤔 Want to improve your PyTorch model's training performance without sacrificing accuracy? Learn how you can cut training time on a single GPU from 22.53 mins to 2.75 mins and maintain prediction accuracy🤯🚀 Check out this blog by @rasbt: lightning.ai/pages/communit… #PyTorch…

How to Speed Up PyTorch Model Training

Source: https://t.co/TaSJcFlume

Lisa033 Reposted

Jaya Kawale

@jayakawale

23 Mar 2023

This blog on attention and the intuition by @rasbt is so well written! Bonus: It also has the code to run with. sebastianraschka.com/blog/2023/self…

Understanding and Coding the Self-Attention Mechanism of Large Language Models From Scratch

Source: https://t.co/a0jrHAuQfZ

Lisa033 Reposted

Gal Mishne 💔🇮🇱

@gmishne

23 Mar 2023

Reviewers: If you ask the authors to do something and they have followed through successfully, or you made a claim authors successfully refuted, then you need to be prepared to change your recommendation to positive. #ICML2023 1/3

Lisa033 Reposted

Lightning AI ⚡️

@LightningAI

28 Mar 2023

We're excited to release Lit-LLaMA🦙, a minimal, optimized rewrite of LLaMA for training and inference licensed under Apache 2.0 🎉 Check out the repo👉👉 github.com/Lightning-AI/l…

LightningAI's tweet image. We're excited to release Lit-LLaMA🦙, a minimal, optimized rewrite of LLaMA for training and inference licensed under Apache 2.0 🎉

Check out the repo👉👉 <a style="text-decoration: none;" rel="nofollow" target="_blank" href="https://t.co/Q3y56BXaAr">github.com/Lightning-AI/l…</a>

Lisa033 Reposted

Sebastian Raschka

@rasbt

29 Mar 2023

I am old enough to remember people cheering AI when it was defeating the human Go champion

Lisa033 Reposted

Sebastian Raschka

@rasbt

29 Mar 2023

Want to get into AI? My book is a 775-page journey from the fundamentals of machine learning to finetuning large language models. Today is the last day to catch "Machine Learning with PyTorch & Scikit-Learn book" during the Spring Sale – 25% off! 🌱📚 amazon.com/Machine-Learni…

Machine Learning with PyTorch and Scikit-Learn: Develop machine learning and deep learning models...

Source: https://t.co/fSuM2NUI5E

Lisa033 Reposted

Melanie Mitchell

@MelMitchell1

29 Mar 2023

I didn't sign "the letter". Current AI poses lots of risks, but describing these systems as "ever more powerful digital minds" that no one can control is likely to make the problem even worse. What's needed: more transparency and better public discourse.

Lisa033 Reposted

Sebastian Raschka

@rasbt

30 Mar 2023

Speaking of better public discourse, @MelMitchell1 has an excellent newsletter/blog on the state & problems of large language models. (It's free from attention-seeking headlines, and thus probably not as popular as it should be.) Highly recommended: aiguide.substack.com

Melanie Mitchell

@MelMitchell1

29 Mar 2023

Lisa033 Reposted

Sebastian Raschka

@rasbt

2 Apr 2023

Open Source Sunday! Just released a new version of MLxtend: rasbt.github.io/mlxtend/ Featuring - a snappier ExhaustiveFeatureSelector - the H-Mine frequent pattern mining algo - multiprocessing for plot_decision_regions Thx to contributors, Fatih Sen, Nima Sarajpoor & others