Summary of recent AI information articles (for personal use)

1. AIGC

1.1 AIGC

In addition to the ordinary text-generating image function, you can also input images to generate similar images, input images to generate text, input text to generate similar text, image semantic decoupling editing, input images and text to generate videos, edit image content based on latent space, etc. . Future versions will also support more modes such as voice, music, video and 3D. According to the paper, it has been proven that VD and its underlying framework have the following advantages:
- All subtasks can be processed with competitive high quality.
- Support new extensions and applications, such as the separation of graphic style and semantics, image-text dual guidance generation, etc.
- These experiments and applications provide richer semantic insights into the generated output.

1.2 ChatGPT series

1.3 Google Series

1.4 CV

1.4.1 Stable Diffusion
1.4.2 Smart CV

1.5 Code generation

  • AI code generator CodeWhisperer: Autumn leaves introduction video , official website registration
  • codeinterpreter-api : Open source implementation of the ChatGPT code interpreter
  • CodeGeeX2-6B : CodeGeeX2 is the second generation model of the multi-language code generation model CodeGeeX, which is implemented by injecting code based on the ChatGLM2 architecture.
  • Copilot Chat: VSCode’s latest plug-in, GitHub Copilot’s new feature—Copilot ChatAI chat assistant is coming!

1.6 Domestic

  • "ChatGPT, Academic Professional Edition of the Chinese Academy of Sciences" : The project is aimed at the daily scientific research work of the Chinese Academy of Sciences. It has customized a set of practical functions based on ChatGPT to optimize academic research and develop daily workflow. The built-in tools include but are not limited to the following: one-click polishing of academic papers, grammatical error search; fast translation between Chinese and English; one-click code explanation; shortcut key customization; high-level experimental modular design; project source code self-analysis ; Intelligently read papers and generate abstracts.

  • Llama2-Chinese : An advanced technical community focusing on the optimization and upper-layer construction of the Llama model in Chinese, including Chinese corpus, model deployment, fine-tuning, etc.

  • Colossal-AI : The world's largest and most active large model development tool and community, providing out-of-the-box LLaMA2 training, fine-tuning, and inference solutions from 8 to 512 cards, accelerating 70 billion parameter training by 195%, and providing one-stop Cloud platform solutions greatly reduce the cost of large model development and implementation.

1.7 Google plug-in

  • "ChatGPT plug-in recommended, efficiency increased by 100 times!"

  • Immersive translation : The most useful translation plug-in at present, it can perform one-click translation on various web pages while retaining the original English text, making it convenient for you to compare Chinese and English. Even the English subtitles in the video can be translated.

    • Export bilingual e-books with one click and support real-time bilingual translation of PDF, subtitles, TXT and other files.
    • Innovative mouse-over translation - Just hover your mouse over any paragraph on any web page and the corresponding translation will immediately appear below the paragraph.
    • In-depth customization and optimization of mainstream websites - Optimize mainstream websites such as Google, Twitter, Reddit, YouTube, Bloomberg, and Wall Street Journal to make searching, social networking, and obtaining information smoother and more efficient.
      Insert image description here
  • WebChatGPT : Make ChatGPT have Internet access function and integrate various prompts. Note: Please turn off this function when you do not need to search for items online, otherwise all your prompts will be used for searching items online.
    Insert image description here

  • ChatGPT File Uploader : After installation, the chatgpt page will have an additionalsubmit filebutton for uploading files. Similar functions include ChatGPT File Uploader Extended .

  • ChatGPT to Markdown : After installation, a small M icon will appear in the upper right corner of the chatgpt conversation page. After clicking, a window will pop up, which contains the markdown format of the conversation content and is displayed in real time. In other words, the newly added conversation content will be displayed in the window immediately, which is very convenient and very useful for those who often ask questions and write blogs.

  • MaxAI.me : Use ChatGPT AI on any web page. You can select a piece of text on any web page to operate, and then easily select the required operation (translate, summarize, continue, explain, run...), and supports various mainstream AI models. Similar functions include Monica (AI co-pilot) .
    Insert image description here
    Insert image description here

  • Web2Markdown : Converts the content of the current page to Markdown format. This conversion preserves the text, titles, links, images, and other elements of the web page, allowing you to save and share your website content in a concise and readable way.

Insert image description here

As shown in the picture above, it is fixed in the browser plug-in bar after installation. The above is my blog on csdn. Click the Web2Markdown plug-in icon, and the markdown content window will pop up in a few seconds. Just copy it with one click.

  • YouTube Summary with ChatGPT & Claude : In the AI ​​​​era, the fastest way to watch videos is not 2x speed, 3x speed, or 10x speed, but to let AI watch it for you. After installation, a plug-in icon will appear on the YouTube website. After clicking ittranscript&summary, a pop-up window will generate the entire video content in seconds, and you can choose your own language. At the time point of each paragraph, the video will automatically jump to the corresponding position after clicking. Click the chatgpt icon to send the content to chatgpt, and the button on the far right can also copy the entire video text.
    Insert image description here

  • Mr.-Ranedeer-AI-Tutor : The author created chatgpt into a professional AI tutor by writing a prompt word of 7800 tokens to guide you in course study. To me, it is the same as the function of a plug-in. Just choose to continue this conversationthrough the author's shared link , then enter /language Simplified Chinese and switch to Chinese mode first. /config sets the tutor type, /plan sets the course, and /start starts learning. The following is the detailed configuration
    Insert image description here
    Insert image description here

2. Deep learning

2.1 NLP

"RoPE may be Resnet in the LLM era" : Resnet solves the problem of gradient disappearance after the convolution model becomes deeper, making the depth model shine. RoPE (positional encoding) similarly solves the problem of context inability to associate when the LLM context is too long.

2.2 CV

2.3 LLM

2.4 Neural Network

3. Machine Learning

4. Competition

5. Tools

Guess you like

Origin blog.csdn.net/qq_56591814/article/details/128194422