Omiita @omiita_atiimo Twitter Profile

Omiita

@omiita_atiimo

ML Engineer / M. Eng. / Book: "Vision Transformer入門"(https://t.co/V9mJEOQp7q) / Blog: https://t.co/nEStqseyZK

243Posts 6KFollowers 0Following

Similar User

@hillbig

@ogawa_yutaro_22

@CVpaperChalleng

@ai_scholar

@DL_Hacks

@SaitohKoki

@goto_yuta_

@stateofai_ja

@ImAI_Eruel

@PreferredNetJP

@sammy_suyama

@shinmura0

@AkiraTOSEI

@icoxfog417

@mi141

Omiita Reposted

OpenAI

@OpenAI

12 Sep

We're releasing a preview of OpenAI o1—a new series of AI models designed to spend more time thinking before they respond. These models can reason through complex tasks and solve harder problems than previous models in science, coding, and math. openai.com/index/introduc…

Introducing OpenAI o1

Source: https://t.co/peKzzKX1bu

Omiita Reposted

elvis

@omarsar0

23 Jul

Llama 3.1 is here! 8B, 70B, and 405B versions are available.

Omiita

@omiita_atiimo

18 Apr

Llama3(70B/8B)出ました。 llama.meta.com/llama3/ • 70BはClaude 3 Sonnetに勝ってる模様 • 8BもMistral/Gemmaを圧倒していて、期待を上回る性能 • さらに進行形で400Bも学習中とのこと。すでにClaude 3 Opusにも並びそうなベンチマーク結果を叩き出している

Omiita

@omiita_atiimo

11 Mar

LLMをチューニングしたい人にとって、かなり有益な内容でした。非常に勉強になりました。レポ: github.com/hiroshi-matsud… #NLP2024

hiroshi matsuda

@hmtd223

11 Mar

本日13:00にスタートする #NLP2024 で「チュートリアル３：作って学ぶ日本語大規模言語モデル」の講師を私が担当します。日本語LLMの成り立ちについて、学習・推論の実行方法を含めて解説します。 anlp.jp/nlp2024/#tutor…

GitHub - hiroshi-matsuda-rit/NLP2024-tutorial-3: NLP2024 チュートリアル３作って学ぶ日本語大規模言語モデル - 環境構築手順とソースコード...

Source: https://t.co/ikOlE2SDb9

Omiita Reposted

OpenAI

@OpenAI

15 Feb

Introducing Sora, our text-to-video model. Sora can create videos of up to 60 seconds featuring highly detailed scenes, complex camera motion, and multiple characters with vibrant emotions. openai.com/sora Prompt: “Beautiful, snowy…

Omiita

@omiita_atiimo

23 Oct 2023

非常に楽しかった #abcillm

Omiita

@omiita_atiimo

28 Jun 2023

中国語のベンチマークで、GPT-4超えモデルが出てきたとのこと。ユーザーによる報告のため鵜呑みにできないかもしれないが、中国語ではGPT-4/ChatGPT超えモデルがいくつか出ているのはすごい。（中国語を主言語としていないGPT-4/ChatGPTもすごいのだが。） cevalbenchmark.com/static/leaderb… （↑PC推奨）

Omiita

@omiita_atiimo

17 May 2023

rinnaからも36億パラメータの日本語LLMが出てたのか！しかもinstruction-tuningしたバージョンも出してくれている。（ありがとうございます！）日本語LLM界隈、最近きてるな huggingface.co/rinna/japanese…

rinna/japanese-gpt-neox-3.6b-instruction-sft · Hugging Face

Source: https://t.co/IdEc0Wemrr

Omiita

@omiita_atiimo

17 May 2023

日本語で事前学習されたLLMは激アツすぎる。しかも商用利用可能！ありがとうございます！

サイバーエージェント　広報＆IR

@CyberAgent_PR

17 May 2023

当社が開発した「最大68億パラメータの日本語LLM」を商用利用可能なライセンスで公開いたしました。本モデルをベースにチューニングを行うことで、対話型AI等の開発が可能です。今後もモデル公開や産学連携を通し、国内における自然言語処理技術の発展に貢献してまいります。 cyberagent.co.jp/news/detail/id…

Omiita

@omiita_atiimo

6 May 2023

ついにLLaMA-7Bと同等の性能を持つ商用利用可能なLLM「MPT-7B」が登場！ブログ：mosaicml.com/blog/mpt-7b デモ：huggingface.co/spaces/mosaicm… 下図はZero-shot性能の結果です。StableLM / Pythia / Cerebrasなど最近の商用利用可能なLLMと比べてもMPT-7Bがかなり良いことが分かります。

Omiita

@omiita_atiimo

4 May 2023

なんと第4刷！ありがとうございます！ Self-AttentionやVision Transformerの仕組みをゼロから理解したい方はぜひ！

技術評論社販売促進部

@gihyo_hansoku

21 Apr 2023

【好評につき第4刷】片岡裕雄さん監修，山本晋太郎さん，徳永匡臣さん，箕浦大晃さん，邱玥さん，品川政太朗さん執筆の『Vision Transformer入門』の増刷が決定！注目のViTのしくみと応用先がわかるとともに，コンピュータビジョン分野の最新状況を概観できます。gihyo.jp/book/2022/978-…