Fun

Arbeits

AI时代个人生存/摸鱼探索指南.Beta by 余一.Dev
AI变革公司/产业实践探索：从2023 年报，看中国上市公司怎么使用生成式AI.
qdaily_backup by LampScript: 好奇心日报备份计划。
WorkingTime by WorkerLivesMatter: 公司作息表，文档链接(离线版本)，如何看待近期在互联网人之间流传的「公司作息表」？各大公司上下班时间的真实情况是怎样的？。
行行查

Art & Design

Animation
- Live2D: a software technology that allows you to create dynamic expressions that breathe life into an original 2D illustration.
- Anime-Girls-Holding-Programming-Books by layn
- ChineseBQB: Chinese sticker pack,More joy / 表情包的博物馆, Github最有毒的仓库, 中国表情包大集合, 聚欢乐~, homepage
- SCOTT PARTRIDGE: 美国艺术家©Scott Partridge 创作的世界鸟类平面化图鉴。
- XKCD中文站: 一个奇奇怪怪的科学漫画站，中文1051 篇，英文2721 篇。
- Refined-Anime-Text: 一份包含超过一百万条、约4400万个 GPT-4/3.5 token的、全新合成的文本数据集的动漫主题子集。
- ZuoMeme: 在线制作MEME梗图生成器。
Art and Design
- WikiArt: 收录世界名画的维基网站.
- ArtStation: 新年重磅！Artstation 大师级艺术课程全部免费开放
Decoration
- Rich: a Python library for rich text and beautiful formatting in the terminal.
- RunCat_for_windows: A cute running cat animation on your windows taskbar.
Front-End
- Streamlit: 算法不会前端，也可以做出好看的界面-Streamlit工具
- Gradio: Gradio: 让机器学习算法秒变小程序
Github
- Github.io:
- Hello Github
- GitDown: Github文件夹下载.
- Design your Github.io style
- ithub-readme-stats by Anurag Hazra: Github首页动态生成状态图之一.
- Metrics: Github首页动态生成状态图之二.
- Moe-counter: Github首页动态生成状态图之三，多种风格可选的萌萌计数器，demo.
- emoji by Lee Reilly: 可在Markdown使用的开源emoji素材库.
- Emoji大全
- Emoji: emoji terminal output for python.
- Your3dEmoji
- Fluentui-Emoji: a collection of familiar, friendly, and modern emoji from microsoft.
- WikiEmoji
- AI Emoji Generator
ICON
- feather by Feather with homepage: 简洁开源的icon素材库.
- iconfont+(注册): 幻灯片制作矢量图素材库.
- free-font: 2020年最全的免费可商用字体.
- Lucide: beautiful & consistent icon toolkit made by the community, open-source project and a fork of feather icons.
- 得意黑 Smiley Sans: 一款在人文观感和几何特征中寻找平衡的中文黑体.
- Logo Galleria: 在线logo生成工具，默认免费免登录.
Machine Learning & Neural Networks
- Neural Network Architecture Diagrams: diagrams for visualizing neural network architecture (created with diagrams.net)
- Python Graphs(paper): A LIBRARY FOR REPRESENTING PYTHON PROGRAMS AS GRAPHS FOR MACHINE LEARNING.
- ML Visuals: 有了这个机器学习画图神器，论文、博客都可以事半功倍了！.
- 22个神经网络结构设计及可视化开源工具整理分享 | 我不爱机器学习 2023-02-16
Python
- 详解可视化神器 seaborn，制作图形又快又美！
- 从图片中提取颜色然后绘制成可视化图表
- DearPyGui: a fast and powerful Graphical User Interface Toolkit for Python with minimal dependencies.
- Aquarel: a lightweight templating engine and wrapper around Matplotlibs' rcparams to make styling plots simple.
- CuteCharts: 又一款超酷的可视化神器.
- invisible-watermark: python library for invisible image watermark (blind image watermark).
- BertViz: BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.), see intro.
UI
- Textual with code: a TUI (Text User Interface) framework for python inspired by modern web development.

Biology

blog:
- 语言模型生成了自然界不存在的蛋白质，图灵奖得主LeCun：蛋白质编程来了 | 机器之心 2022-12-23

Book

Some-Many-Books: Dujltqzv个人收藏书籍电子版列表。

City Life & Transport

Self-building
- HowToCook: 程序员在家做饭方法指南。
- HowToLiveLonger: 程序员延寿指南。
- HumanSystemOptimization: 健康学习到150岁——人体系统调优不完全指南。
- RehabilitationGuide: 为程序员群体提供简单可靠的颈椎病/腰突康复指南。
- 三联生活周刊：低潮生活躺平指南｜40条小建议
School & University Life
- university-information: 一些大学的生活质量。
City Living

Computer Language

《The Go Programming Language》(GO语言圣经，GOPL): 中文版.
Go-Course by Karan Pratap Singh: master the fundamentals and advanced features of the Go programming language.
ugo-compiler-book by 凹语言™ : µGo语言实现(从头开发一个迷你Go语言编译器)，使用Go版本与Rust版本。
OneFile: 汇集了仅一个文件，好玩的开源项目，访问页面。
深入理解函数式编程（上）, 深入理解函数式编程（下）。

Culture & History

全历史: 以知识图谱为核心引擎的历史知识网站.
小鸡词典: 网络流行语速查百科.
萌娘百科: 由MediaWiki软件支持的非商业ACGN主题在线百科全书, “秉承‘万物皆可萌’的精神原则”.
Wayback Machine: 网页时光机，输入任意网址，恢复显示网址对应的页面。

Economy & Finance

如何快速了解一个行业？| 方法论 from 很帅的投资客（微信号：shuai_investor）
境内可用的AI分析工具 from 很帅的投资客（微信号：shuai_investor）

Language

VocabularyMap by Niannian Zhang: 将学习过的英语单词词根及频率较高单词通过思维导图的形式不断联想，扩充词汇量，帮助英语学习者解决记单词的困难。
English-level-up-tips-for-Chinese by byoungd: 英语进阶指南。
Awesome-Japanese by Siye Sam Yu.
Grammarly: 语法检查App.
日语语法指南(Learn Japanese): 本教程原文为 Tae Kim 所写的《Japanese Grammar Guide》, 资源2。
Effective Language Learning.
English-Writing: enhance your English writing.
乡音: 汇集各地方言，内容可在线听，也可录制上传自己的声音。

Laws & Regulations

TopJudge
中国法律服务网: 智能法律咨询

Literature & Writing

豆坟
THUAIPoet (Jiuge, 九歌)/THUNLP-MT
expert_readed_books by 0voice
aichpoem by wangjiezju1988
Style-Transfer-in-Text by Zhenxin Fu
Internet Archive: a non-profit library of millions of free books, movies, software, music, websites, and more.
Ren'Py: The Ren'Py Visual Novel Engine.
lifeRestart: 人生重开模拟器.
喵语言(原理相关：零宽字符加密).
pua-lang: a dialect of The Monkey Programming Language.
智能创作助手 with paper.
AFFiNE: there can be more than Notion and Miro. Affine is a next-gen knowledge base that brings planning, sorting and creating all together. Privacy first, open-source, customizable and ready to use, see homepage.
Twinejs: a tool for telling interactive, nonlinear stories.
书格: 一个自由开放的在线古籍图书馆。致力于开放式分享、介绍、推荐有价值的古籍善本，并鼓励将文化艺术作品数字化归档。发布的书籍主要以高清彩色影像版本 PDF 格式，大部分书籍书籍单页宽度在 1400 像素以上，跨页宽度在 2400 像素以上。书籍刊行年代有从宋元珍本，明清善本到近代刊本。
CSFDB 中文科幻数据库
中国哲学书电子化计划
全球最大的中文非虚构图书馆藏(仅限LLM公司使用)

Math & Computer

Repository
- DL4MATH - Reading List: reading list for research topics in mathematical reasoning and artificial intelligence.
- 24个运筹学优化算法包汇总
Tutorial
- Everything You Always Wanted To Know About Mathematics by Brendan W. Sullivan.
- The-Art-of-Linear-Algebra by Kenji Hiranabe: Graphic notes on Gilbert Strang's "Linear Algebra for Everyone".
- AI4Math？IJCAI2023最新《数学推理中的深度学习》教程，详述深度学习数学推理最新进展与未来展望，243页ppt | 专知 2023-08-29
Research
Competition
- “希望杯”全国数学邀请赛
- 数学新星网

Medical

ML for Health Lab Pub by van_der_Schaar Lab

Music & Instrument & Voice

DANGO（试用&付费）: 团子AI-人工智能提取伴奏人声.
声音汇: 普惠长笛教学，一站式乐器购买平台。
国际乐谱库（使用教程）.
OMNIZART:a python library that aims for democratizing automatic music transcription, i.e., acoustic music to midi-format file.
AIComposer: MIDI Mashups: Using Machine Learning to Generate Unique Musical Scores.
music21: a python-based toolkit for computer-aided musicology.
Python MIDI: a python-based toolkit to deal with midi-format files.
Midi-to-MusicSheet
Text-to-Music
- Mubert-Text-to-Music
- Museformer: Transformer with Fine- and Coarse-Grained Attention for Music Generation.
- 当歌曲创作遇上大模型，无所不能的AI音乐家SongComposer | InternLM 2024-03-09
- 词曲创作只需几秒，「AI作曲家」Suno引爆音乐圈，第一手体验和攻略来了 | 机器之心 2024-03-25
- 音乐ChatGPT时刻来临！「天工SkyMusic」音乐大模型今日启动邀测 | 机器之心 2024-04-02
- 浙大发布歌曲合成工具Prompt-Singer，歌手性别风格均可控！ | 夕小瑶科技说 2024-04-02
- 音乐界Sora隆重发布！效果炸裂，超越Suno！根据指令生成定制音乐，原创续歌样样行！前谷歌Deepmind人员创建 | 夕小瑶科技说 2024-04-11
Speech Synthesis
Text-to-Speech
- VITS:
  - code: https://github.com/jaywalnut310/vits
  - project: https://jaywalnut310.github.io/vits-demo/index.html
  - paper: VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech.
- SoftVC VITS Singing Voice Conversion
- VITS-fast-fine-tuning
- VoiceBox:
  - paper: Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale.
  - 语音领域的GPT时刻：Meta 发布「突破性」生成式语音系统，一个通用模型解决多项任务 | 机器之心 2023-06-17
- Branchformer:
  - 上海声通团队在WeNet中开源Branchformer | WeNet步行街 2023-06-27
- kNN-VC:
  - code: https://bshall.github.io/knn-vc/
  - paper: Voice Conversion With Just Nearest Neighbors
  - 支持跨语言、人声狗吠互换，仅利用最近邻的简单语音转换模型有多神奇 | 机器之心 2023-07-02
- XTTS:
  - code: https://github.com/coqui-ai/TTS
  - code(huggingface): https://huggingface.co/spaces/coqui/xtts
  - note: 开放音频基础模型，只需 3 行代码即可实现跨语言和多语言语音生成，支持3秒克隆、跨语言语音克隆、24khz质量。
- StableTTS:
  - code: https://github.com/KdaiP/StableTTS.
  - note: 轻量TTS模型，专为汉语和英语语音生成服务，参数仅有 10M。

Recommender system

deep-recommender-system: 深度学习在推荐系统中的应用及论文小结。
fun-rec: 推荐系统基础、推荐系统进阶和推荐系统应用教程，来自Datawhale。
DeepMatch: a deep matching model library for recommendations & advertising.
Recommend-System-tf2.0: 原理解析及代码实战，推荐算法也可以很简单。
Reco-papers: classic papers and resources on recommendation.
RecBole: a unified, comprehensive and efficient recommendation library.
AlgoNotes: “浅梦学习笔记”公众号文章汇总。
推荐系统百面百搭: 作者们根据个人面试和经验总结出的推荐系统(RES) 面试准备的学习笔记与资料。
推荐系统设计模式：大厂框架解析.

Reinforcement Learning

Survey & Tutorial:

Vision

AwesomeAnimeResearch by SerialLain3170
Text2All: a comprehensive list of resources about text-guided generative models.
满足你的各种绘图需求，生成式AI的8种新玩法
Neural Style Transfer
- AI-Art: about PyTorch (and PyTorch Lightning) implementation of Neural Style Transfer, Pix2Pix, CycleGAN, and Deep Dream.
- deepdream and its various approaches/tutorials: pytorch-deepdream, neural-dream, deep-dream-pytorch.
Generative Adversarial Network (GAN)
- Review
- Github List
- Github Lib
  - PyTorch-GAN
  - Mimicry
  - pix2pix-pytorch/pytorch-CycleGAN-and-pix2pix: 一对一有监督图像风格迁移
  - PyTorch-CycleGAN/DualGAN: 一对一无监督图像风格迁移
  - stargan: 一对多有监督图像风格迁移
  - UGATIT-pytorch: 高质量图像风格转换
  - animeGAN: 动漫风格迁移
  - StackGAN-Pytorch: 根据文字生成图片
  - BigGAN-PyTorch
- Application (for fun or horror?)
- GAN Model
  - awesome-pretrained-stylegan2
Image Caption
- 2021年7月：让机器学会看图说话：Image Caption任务最新综述(From Show to Tell: A Survey on Image Captioning)
- awesome-image-captioning
- Awesome-Visual-Captioning
- Awesome-Captioning
- ImageCaptioning.pytorch
- bili2text: Bilibili视频转文字，一步到位，输入链接即可使用.
Text-to-Image
- 2022年6月：文本生成图像这么火，你需要了解这些技术的演变
- Text2Art: Generate art from text with AI (VQGAN+CLIP).
- Awesome-Text-to-Image
- awesome-text-to-image-studies
- arbitrary-text-to-image-papers
- VQ-VAE: 超越BigGAN，DeepMind提出「史上最强非GAN生成器」VQ-VAE-2
- VQ-GAN(code, paper)
- DallEval
- Dream Fields and its official repository
- FuseDream
- GLIDE: 缩小规模，OpenAI文本生成图像新模型GLIDE用35亿参数媲美DALL-E(paper)
- DALL-E
  - dalle-mini
  - DALLE-pytorch
  - ru-dolph
  - dalle-playground
  - min(DALL·E): a minimal implementation of DALL·E Mini.
- CLIP : contrastive language-image pretraining.
  - open_clip
  - Chinese-CLIP
  - CLIPasso
  - ChieseClip: 中文「大大大大大」模型开源开放！从吟诗作画写代码到蛋白质预测全都有，源代码可编程API均奉上
- DALLE2-pytorch: implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch.
- Discoart: Create Disco Diffusion artworks in one line.
- PARTI(code): Pathways Autoregressive Text-to-Image model.
- Imagen(paper): Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding.
  - AI作画新高度！谷歌发布imagen，效果惊艳全场
  - extra: MinImagen(doc): a minimal implementation of the imagen text-to-image model.
- Modelverse: GAN、扩散模型应有尽有，CMU出品的生成模型专属搜索引擎Modelverse来了
- 基于GPT-2提示词训练的小项目: code & data.
- DreamFace: 上科大等发布DreamFace：只需文本即可生成「超写实3D数字人」
NeRF
- NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
- Neural Radiance Fields (NeRF，神经辐射场)介绍
- NeRF神经渲染技术盘点
- 计图：5秒训好NeRF！已开源
- 从点云到NeRF，多伦多大学CSC 2547课程全面讲解3D计算机视觉
- 浅谈如何基于深度方法进行三维重建
- 大卫复活！英伟达再造「神经朗基罗」，3D重建肌肉纹理肉眼可见 | 新智元 2023-06-02
CityDreamer
- code: https://github.com/hzxie/city-dreamer
- paper
- demo: https://haozhexie.com/project/city-dreamer
- CityDreamer：一键生成无边界的3D城市
ViT(Vision in Transformer)
- 简单实现 ViT
- CVPR 2022 | 道高一尺，魔高一丈，ConvNet还是ViT？
- Awesome-Transformer-in-CV
- How GPT3 Works - Visualizations and Animations
- VIT-Pytorch
- Vision-Language Pre-training: Basics, Recent Advances, and Future Trends
Diffusion Model
- What are Diffusion Models?
- Diffusion Models专栏文章汇总：入门与实战
- 由浅入深了解Diffusion Model
- guided-diffusion
- diffusers
- Awesome-Diffusion-Models
- diffusion_tutorial: 扩散模型实例教程集
- 从大一统视角理解扩散模型（Diffusion Models）
- 生成扩散模型漫谈：条件控制生成结果
- NLP技术前沿动手实践：基于Huggingface的AIGC自动作图原理解析、实践与推理加速开源实操
- 妙鸭=SD + Lora? 对 SD+LoRA 的一些探索与验证
- paper:
- Understanding Diffusion Models: A Unified Perspective
- All about Flow
  - 一文详解基于流的深度生成模型
  - 生成扩散模型漫谈（一）：DDPM = 拆楼 + 建楼
  - 生成扩散模型漫谈（二）：DDPM = 自回归式VAE
  - 生成扩散模型漫谈：统一扩散模型（理论篇）
  - 扩散模型初探：原理及应用
- Waifu-Diffusion
  - 微博1，微博2
  - ChromedSets的推和教程
  - hakurei的开源
  - Waifu Diffusion Demo (Huggingface试玩)
- Stable-Diffusion(NovelAI ver.)
  - latent-diffusion by CompVis
  - stable-diffusion by CompVis
  - stable-diffusion-webui
  - Stability.AI Easy Diffusion
  - Stable Diffusion Tutorial
  - one more thing(s):
    - DreamBooth
    - Textual Inversion
  - blog:
- Stable-Diffusion(Chinese ver.)
  - 太乙
  - AltDiffusion
  - AltCLIP
  - Stable-Diffusion-Pokemon
  - MochiDiffusion: Run Stable Diffusion on Mac natively.
  - Stable Diffusion-XL: demo.
  - blog:
- A Beginner’s Guide to Prompt Design for Text-to-Image Generative Models
- ContralNet and its Web UI.
  - ControlNet大更新：仅靠提示词就能精准P图，保持画风不变，网友：效果堪比定制大模型 | 量子位 2023-05-15
- ChilloutMix with its huggingface page, with LoRA: Low-Rank Adaptation of Large Language Models.
- Shape·E(paper): OpenAI文本生成3D模型再升级，数秒完成建模，比Point·E更好用 | 机器之心 2023-05-14
- TextDiffuser
  - code: https://github.com/microsoft/unilm/tree/master/textdiffuser
  - paper:https://arxiv.org/abs/2305.10855
  - homepage: https://jingyechen.github.io/textdiffuser/
  - demo: https://huggingface.co/spaces/microsoft/TextDiffuser
  - blog: 2023年09月25日：无惧图像中的文字，TextDiffuser提供更高质量文本渲染
- CV-oriented Search Engine
  - Modelverse: GAN、扩散模型应有尽有，CMU出品的生成模型专属搜索引擎Modelverse来了
  - StockAI: 新型AI图片库，会根据你的查询自动创建图片，可免费下载。
  - Lexica: Stable Diffusion生成图片的搜索引擎，最新升级支持文本提示图片生成。
  - Atlas: 非结构化资源检索工具。
  - awesome-ai-painting: AI数字绘画资料汇总。
  - CivitAL: C站，模型分享社区。
- Stable LM
  - code: https://github.com/stability-AI/stableLM/
  - demo: https://huggingface.co/spaces/stabilityai/stablelm-tuned-alpha-chat
  - blog:
    - Stable Diffusion的开发商Stability AI开源大语言模型Stable LM | AI 共存派 2023-04-20
- UniDiffuser
  - code: https://github.com/thu-ml/unidiffuser
  - paper: One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale.
  - blog:
    - 清华朱军团队开源首个基于Transformer的多模态扩散大模型，文图互生、改写全拿下 | 机器之心 2023-03-13
- CoDi
  - code: https://github.com/microsoft/i-Code/tree/main/i-Code-V3
  - paper: Any-to-Any Generation via Composable Diffusion
  - homepage: https://codi-gen.github.io/
  - blog:
    - 可组合扩散模型主打Any-to-Any生成：文本、图像、视频、音频全都行 | 机器之心 2023-05-23
    - 「大一统」大模型论文爆火，4种模态任意输入输出，华人本科生5篇顶会一作，网友：近期最不可思议的论文 | 量子位 2023-05-28
- World-from-Eyes:
  - 眼球反射解锁3D世界，黑镜成真！马里兰华人新作炸翻科幻迷 | 新智元 2023-06-17
- DragDiffusion:
  - DragGAN重磅开源！扩散模型版的DragDiffusion也来了！ | CVer 2023-06-29
- Stable Diffusion 3
  - Stable Diffusion 3突然发布！与Sora同架构，一切都更逼真了 | 量子位 2024-02-23
  - Stable Diffusion 3更多隐藏功能曝光：文字可更改图片细节 | 量子位 2024-02-23
Text-to-Video
- 图像生成卷腻了，谷歌全面转向文字→视频生成，两大利器同时挑战分辨率和长度
- Phenaki with paper
- 一句话拍大片，导演末日来了！Gen-2震撼发布，科幻日系二次元统统拿捏
- 也看文本生成短视频开源项目Open Chat Video Editor：从依赖数据集到具体实现逻辑解析
- W.A.L.T：李飞飞谷歌破局之作！用Transformer生成逼真视频，下一个Pika来了？ | 新智元 2023-12-12
Segmentation
- SAM(Segment Anything):
  - code: https://github.com/facebookresearch/segment-anything
  - paper: Segment Anything.
  - demo: https://segment-anything.com/
  - extra:
    - Awesome-Segment-Anything
  - blog:
    - CV不存在了？Meta发布CV届的GPT模型「SAM」，可以分割一切 | 夕小瑶的卖萌屋 2023-04-07
    - CV大一统模型的第一步！Segment Anything Model 最全解读！ | Datawhale 2023-04-07
- SALT(Segment Anything Labelling Tool):
  - code: https://github.com/anuragxel/salt
- Image2Paragraph:
  - code: https://github.com/showlab/Image2Paragraph
  - blog:
    - 从Blip2到Segment Anything视觉语义金字塔+ChatGPT= 把图片变文本段落， 8G显存即可Run | 我爱计算机视觉 2023-04-17
- SAM-Adapter:
  - code: https://github.com/tianrun-chen/SAM-Adaptor-PyTorch
  - paper: SAM Fails to Segment Anything? -- SAM-Adapter: Adapting SAM in Underperformed Scenes: Camouflage, Shadow, and More.
  - demo
  - blog:
    - SAM无法分割一切？SAM-Adapter：首次让SAM在下游任务适应调优！ | CVer 2023-04-19
- SEEM:
  - demo: https://huggingface.co/spaces/xdecoder/SEEM
  - paper: Segment Everything Everywhere All at Once.
  - blog:
    - 华人团队颠覆CV！SEEM完美分割一切爆火，一键分割「瞬息全宇宙」 | PaperWeekly 2023-04-24
Consistency models
- code: https://github.com/openai/consistency_models
- blog: 图像生成终结扩散模型，OpenAI「一致性模型」加冕！GAN的速度一步生图，高达18FPS | 新智元 2023-04-13
LLaVA
- code: https://github.com/haotian-liu/LLaVA
- data: https://huggingface.co/datasets/liuhaotian/LLaVA-Instruct-150K
- model: https://huggingface.co/liuhaotian/LLaVA-13b-delta-v0
- paper: Visual Instruction Tuning
- demo
- blog:
  - Visual Instruction Tuning: 用LLaVA近似多模态GPT-4 | PaperWeekly 2023-04-19
- extra:
  - Awesome-Visual-Instruction-Tuning - Latest Papers and Datasets on Visual Instruction Tuning.
SPAE
- code: https://github.com/google-research/magvit/
- paper: SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs.
- blog:
  - 谷歌新作SPAE：GPT等大语言模型可以通过上下文学习解决视觉任务 | CVer 2023-07-08
AnimateAnyone
- blog:
  - 兵马俑跳《科目三》，是我万万没想到的 | 量子位 2024-01-04
Sora
- blog:
  - 春节大礼包！OpenAI首个视频生成模型发布，60秒高清大作，网友已叹服 | 机器之心 2024-02-16
  - 我在模拟世界！OpenAI刚刚公布Sora技术细节：是数据驱动物理引擎 | 机器之能 2024-02-16
  - OpenAI文生视频方案Sora技术浅析：兼看知识图谱与多模态的融合工作 | 老刘说NLP 2024-02-16
  - 真·降维打击，Sora与Runway、Pika的对比来了，震撼效果背后是物理引擎模拟现实世界 | 机器之心 2024-02-17
  - 后Sora时代，CV从业者如何选择模型？卷积还是ViT，监督学习还是CLIP范式 | 机器之心 2024-02-18
  - Sora背后团队：应届博士带队，00后入列，还专门招了艺术生 | 量子位 2024-02-18
  - 复刻Sora有多难？一张图带你读懂Sora的技术路径 | 魔搭ModelScope社区 2024-02-17
  - Sora为何出自OpenAI？一线员工作息时间线揭秘：我们疯狂地卷 | 机器之心 2024-02-21
  - Sora物理悖谬的几何解释 | 集智俱乐部 2024-02-22
  - 北大发起复现Sora，框架已搭！袁粒田永鸿领衔，AnimateDiff大神响应 | 量子位 2024-03-03
  - 被误解的「中文版Sora」背后，字节跳动有哪些技术？ | 机器之心 2024-03-12
  - 没等来OpenAI，等来了Open-Sora全面开源 | 机器之心 2024-03-18
  - 微软新作「Mora」，复原了Sora | 夕小瑶科技说 2024-03-22
  - Sora之后，OpenAI Lilian Weng亲自撰文教你从头设计视频生成扩散模型 | 机器之心 2024-04-22
Latte
- blog:
  - 详解Latte：去年底上线的全球首个开源文生视频DiT | 机器之心 2024-03-27
Flag-DiT
- blog:
  - DiT架构大一统：一个框架集成图像、视频、音频和3D生成，可编辑、能试玩 | 机器之心 2024-05-12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fun.md

Fun.md

Fun

Arbeits

Art & Design

Biology

Book

City Life & Transport

Computer Language

Culture & History

Economy & Finance

Language

Laws & Regulations

Literature & Writing

Math & Computer

Medical

Music & Instrument & Voice

Recommender system

Reinforcement Learning

Vision

Files

Fun.md

Latest commit

History

Fun.md

File metadata and controls

Fun

Arbeits

Art & Design

Biology

Book

City Life & Transport

Computer Language

Culture & History

Economy & Finance

Language

Laws & Regulations

Literature & Writing

Math & Computer

Medical

Music & Instrument & Voice

Recommender system

Reinforcement Learning

Vision