LangBot is an open-source, multimodal chatbot platform designed for seamless integration with various instant messaging platforms like QQ, WeChat, Feishu, and Discord. It supports multiple large language models (LLMs) such as ChatGPT, DeepSeek, and Gemini, enabling text, voice, and image interactions. LangBot includes built-in features like access control, rate limiting, and sensitive word filtering to ensure stability and security. It also offers a plugin system and a web management panel for customization and easy bot management.
AI co-scientist is a multi-agent AI system developed by Google, designed to function as a virtual research assistant. It helps researchers manage scientific tasks such as research topic selection, literature review, and experimental design. Powered by Gemini 2.0, the system employs multiple agents for generation, reflection, ranking, and evolution to simulate the entire scientific research process. It can understand research objectives, generate innovative hypotheses and research plans, and enhance reasoning capabilities through "test-time computation." AI co-scientist has demonstrated promising results in areas like drug repurposing, target discovery, and antibiotic resistance mechanisms, showcasing its potential to accelerate scientific discovery.
We0 is an open-source AI code editor tailored for developers and product managers. It enables running and debugging code directly in the browser, leveraging a WebContainer environment. With high-fidelity design restoration, it improves design draft accuracy to 90%. We0 supports multiple frameworks like Vue, React, Next.js, Python, and Java, facilitating rapid AI application development and deployment. It also integrates with WeChat Mini Program Developer Tools, offering seamless debugging for mini-program developers. Available for both Windows and Mac, We0 also operates in web containers, ensuring flexibility across different scenarios.
SWE-agent is an open-source AI programmer and software engineer system developed by researchers at Princeton University's NLP group. It leverages large language models (e.g., GPT-4) to automatically resolve issues in GitHub repositories. The system interacts with codebases through an Agent-Computer Interface (ACI), enabling it to perform tasks such as browsing, editing, testing, and executing code. SWE-agent has demonstrated accuracy similar to the closed-source AI programmer Devin on the SWE-bench test set, solving issues in an average of 93 seconds and achieving state-of-the-art (SOTA) performance.
BrushNet is an image inpainting model developed by Tencent's ARC Lab and researchers from the University of Hong Kong. It uses a dual-branch architecture to decompose and process masked areas in images, ensuring high-quality restoration while preserving the original image's coherence. The model excels in handling various image types and styles, offering pixel-level precision and compatibility with pre-trained diffusion models.
FlashFace is a high-fidelity AI portrait tool developed by Alibaba and HKU. It generates personalized high-fidelity portrait images based on user-provided facial images and text prompts. FlashFace features high-fidelity identity retention, instant personalization, and diverse result generation. It supports changing a person's age and gender, and even transforming virtual characters into realistic human photos. FlashFace can also convert real photos into different artistic styles or blend multiple character traits to create new images. It is suitable for personalized photo creation, virtual character design, and movie and game character design.
Gemini Live is an intelligent voice assistant developed by Google, designed to enhance user interaction through natural language understanding and multimodal recognition. It supports voice, image, and video interactions, enabling users to automate daily tasks with ease. With 10 voice options, deep integration with Google's native applications, and plans for iOS expansion and additional language support, Gemini Live offers a seamless and intelligent user experience.
ChatPDF is a free AI conversational PDF reading tool that leverages a large language model (LLM) to parse and understand PDF content. Users can upload PDF files and interact with the AI to ask questions, receive answers, and gain deeper insights into the document. The tool supports automatic question extraction, multilingual communication, and provides features like document summarization, content comparison, and cross-platform use. It is designed to simplify information retrieval and enhance reading efficiency for a wide range of users, including researchers, students, and professionals.
Janitor AI is a platform that enables users to create and explore AI virtual characters for free. It offers tools for personalizing characters and categorizing them based on popularity, gender, animation, and more. The platform integrates with social media platforms like YouTube, Twitter, TikTok, Reddit, and Discord, enhancing user interaction and content sharing.
SpicyChat is an AI application that provides role-playing chat interactions, enabling users to interact with 150,000 chatbots and create their own personalized virtual characters. It offers deep, personalized conversations and emotional experiences, emphasizing privacy protection. The platform provides a safe, non-judgmental environment for users to freely explore desires and fantasies. It supports multiple languages, AI voice responses, and image generation, revolutionizing adult entertainment.
Smartcat is an all-in-one AI translation platform that combines AI translation, computer-assisted translation (CAT) tools, and a translation management system (TMS). It supports 280+ languages and 50+ file formats, making it ideal for businesses looking to streamline their translation processes. The platform features an integrated marketplace that connects clients with translation experts globally, ensuring efficient and accurate translations. Additionally, Smartcat offers project management tools and automated workflows to accelerate content globalization.
DressPlay is an AI-based virtual try-on application that enables users to try on various outfits by uploading photos or videos. Using advanced AI algorithms, it analyzes body shape and posture to seamlessly fit clothing images onto the user, creating realistic try-on effects. The app supports both static images and video outfit changes, making it ideal for social media creators, e-commerce platforms, and fashion enthusiasts. DressPlay enhances the shopping experience by allowing users to preview outfits virtually, while also helping merchants improve sales efficiency and customer satisfaction.
STORM AI is an open-source AI writing tool developed by Stanford University that transforms a topic into a detailed article or research paper in seconds. It leverages large language models (LLMs) to simulate expert conversations, generate multi-angle questions, and produce in-depth, well-researched content. STORM AI is ideal for tasks requiring extensive research and citations, as it automatically gathers materials, creates outlines, and compiles complete articles. Users can access the tool for free via the STORM AI website or deploy it locally using an API key for automated writing assistance.
LlamaCoder is an open-source AI tool that leverages the Llama 3.1 405B model to rapidly generate full-stack applications. It integrates components like Sandpack, Next.js, Tailwind, and Helicone to support code sandboxing, application routing, styling, and observability analysis. LlamaCoder allows users to generate components based on requests, making it suitable for building various applications such as calculators, quiz apps, games, and e-commerce product catalogs. It also supports data analysis and PDF analysis, offering local installation and usage guides, making it a powerful tool for developers to efficiently build applications.
AniEraser is a cross-platform AI tool developed by Wondershare, designed to remove watermarks, objects, and unwanted text from images and videos. It preserves the clarity and quality of the original files, making it ideal for content creators, marketers, and personal use. The tool is available on desktop (Windows & Mac), mobile (iOS & Android), and web platforms, offering a user-friendly interface and advanced features like batch processing, high-resolution support, and customizable brush tools.
EasySlide is an AI-driven intelligent slide generation tool that helps users create professional presentations quickly and efficiently. By leveraging natural language processing, EasySlide automatically generates slides based on user-input themes, offering a variety of templates, content optimization, and multimedia support. It supports real-time collaboration, multi-language content generation, and exporting presentations in multiple formats. Its intuitive design and powerful features make it ideal for students, professionals, and educators looking to enhance their presentation quality and efficiency.
GPT-SoVITS is an open-source voice cloning project developed by Bilibili UP host and RVC voice changer founder Huaer Buku. It integrates GPT (Generative Pre-trained Transformer) models with SoVITS (Speech-to-Video Voice Transformation System) to enable high-quality voice cloning and text-to-speech (TTS) conversion using minimal sample data. The tool is ideal for scenarios requiring rapid generation of specific voices, allowing users to train models that mimic a target speaker's voice, including emotion, timbre, and speed, even with limited or no voice samples.
Roop is an open-source AI video face swap tool that enables users to replace faces in videos using just one image, eliminating the need for complex datasets or training processes. It is designed for users with technical skills, offering a command-line interface for customization and support for CPU/GPU acceleration to enhance processing speed. Roop is ideal for entertainment, film production, education, and artistic creation.
insMind is a professional AI editing tool for product images that simplifies the image editing process with an intuitive interface. It supports a wide range of design needs, including social media content creation, and offers features like background removal, object erasure, and new background generation. With batch processing and various creative filters, insMind improves work efficiency and creative expression.
Cascade is an advanced AI feature integrated into the Windsurf programming tool, designed to enhance developer productivity. It offers two modes: edit mode for direct code modifications and chat mode for interactive coding assistance. Cascade synchronizes with developers' operations in real-time, automatically captures code change contexts, executes terminal commands, installs dependencies, and suggests solutions to optimize the development process. Its context-aware engine provides deep understanding of code repositories, enabling iterative reasoning and multi-file editing.