Rakib Ansari

Achievement driven professional with 2 years of experience,

skilled in Generative AI, statistical modeling, machine learning, Deep learning, python and data visualization.

  • Continue
  • Rakib
    Data Scientist

    Strong problem-solving abilities and a passion for driving data-driven innovation. Effective communicator with a proven track record of delivering impactful solutions. Lifelong learner committed to making a meaningful impact.

  • Blogs
  • Anomaly-Gpt

    Large Language Models (LLMs) have taken the field of Natural Language Processing by storm, and their potential goes beyond just that. Recently, they have been extended to include visual processing capabilities by aligning visual features with text features. In this report, it explore how LLMs can be used to address the challenges associated with Industrial Anomaly Detection and introduce, AnomalyGPT - a novel approach to Industrial Anomaly Detection (IAD).

    Animate-Diff

    In the era of augmented and virtual reality, text-to-image models are becoming more advanced and personalized. However, they are still static. To solve this, researchers at AnimateDiff have developed a framework to animate personalized text-to-image models, adding motion to their generated images

    Audio-LDM2

    If you're interested in generating high-quality audio that spans across various types such as speech, music, and sound effects, keep reading! In this blog, they explored AudioLDM2, a new framework for audio generation that uses the same learning method for all audio types

    Chat-Dev

    CHAT DEV is a virtual chat-powered company that utilizes large language models for comprehensive software solutions. The diverse team of agents collaborates to streamline software development, providing quick and affordable solutions

    Chat-Home

    In a world of rapidly growing open-source language models, the field of home renovation has been relatively unexplored. To address this gap, presenting ChatHome, a language model designed specifically for the intricate field of home renovation.

    CoDeF

    While image processing has seen significant advancements, video processing has not progressed at the same rate. In the CoDeF paper, researchers propose a new video representation that combines a canonical content field with a temporal deformation field, allowing for the reconstruction of high-quality videos without the need for training.

    Disc-MedLLM

    Telemedicine has revolutionized healthcare services, broadening access to professionals, reducing medical costs, and allowing remote consultations. The proposed DISC-MedLLM leverages Large Language Models (LLMs) to provide accurate and truthful medical responses in end-to-end conversational healthcare services. The model surpasses existing medical LLMs in both single-turn and multi-turn consultation scenarios.

    Ecom-GPT

    The use of natural language processing (NLP) and deep learning (DL) has revolutionized many industries, including E-commerce. However, the complex structure of E-commerce data requires a language model tailored specifically for this domain. presenting EcomGPT, an instruction-following large language model (LLM) built using the EcomInstruct dataset, which combines atomic tasks and expert-written instruction schemas to enhance the generalization capability of LLMs on E-commerce tasks.

    Face-Chain

    FaceChain is an open-source framework that generates personalized portraits with only a few input images. It utilizes customized image-generation models and a suite of face-related perceptual understanding models to create truthful, high-quality portraits while retaining individual identity.

    Factool

    In this paper, the author presents FACTOOL - a task and domain-agnostic framework for detecting factual errors in text generated by large language models (LLMs). Despite the high quality of the text generated by LLMs, there is a high chance of inaccuracies or deviations from the truth. The current literature does not adequately address the factuality detection and verification needs of writing tasks that users commonly engage with when interacting with the generative models.

    Meta-GPT

    As artificial intelligence continues to advance, new possibilities for improving human workflows arise. Multi-agent systems that use Large Language Models (LLMs) offer great potential for enhancing human workflows, but existing systems often oversimplify real-world applications. In this report, lets discuss MetaGPT, a framework that combines efficient human workflows with LLMs to create multi-agent systems capable of solving complex real-world challenges.

    QwenVL

    Qwen-VL is a groundbreaking breakthrough in AI technology that bridges the gap between text and images. These models represent a revolutionary development in natural language processing and computer vision, bringing these two previously separate fields of research together.

    Scaleup-GAN-TTIS

    The success of text-to-image synthesis has raised questions about scaling up GANs to benefit from large datasets. GigaGAN, a new GAN architecture that can synthesize high-resolution images in 3.66 seconds.

    Speech-Tokenizer

    Speech language models are invaluable tools used in natural language processing today. However, current models utilize speech representations that are not specifically designed for speech language modeling. Enter SpeechTokenizer - a unified speech tokenizer that assesses speech tokens based on their strong alignment with text and effective preservation of speech information, paving the way for a Unified Speech Language Model (USLM).

    Weather-Bench

    Weather forecasting has come a long way, and so has the evaluation of the models used to predict it. WeatherBench 2 is an open-source framework that helps evaluate data-driven weather models based on industry standards and best practices.

    Wizard-Math

    Large-scale language models have transformed natural language processing tasks, but complex multi-step quantitative reasoning remains a major challenge. Developed a new method named Reinforcement Learning from Evol-Instruct Feedback (RLEIF), which enhances mathematical reasoning abilities in Llama-2. In this report, it present WizardMath, which outperforms all other open-source LLMs substantially

    AgentBench

    The world of artificial intelligence has evolved a lot, and large language models (LLMs) have become a remarkable development in various domains. To test the prowess of these models, we introduce to you, AgentBench, a groundbreaking multidimensional evolutionary benchmark that tests its mettle in not one, not two, but eight unique and challenging tasks.

    Audio-Craft

    AudioCraft is revolutionizing the world of generative AI, particularly in music generation. Its models - MusicGen, AudioGen and EnCodec - aim to simplify the process while offering high-quality, consistent and versatile audio output. With AudioCraft, producing new music and sound effects from raw signals has never been easier, making it accessible to various users, from musicians to game developers and small business owners.

    DB-Gpt

    DB-GPT is an experimental open-source project that revolutionizes how we engage with databases, ensuring 100% secure and confidential data. With localized GPT-3 models, DB-GPT provides a paradigm shift in data security and privacy.

    Embed-chain

    Embedchain simplifies the process of collecting and working with data by breaking it down into manageable parts and storing it in a database. With Embedchain, you can easily build chatbots and language models from any dataset, including YouTube videos, PDFs, and websites. Learn more below

    God-mode

    GodMode helps you choose the best language model for your needs. With a wide variety of providers to choose from, you can be sure to find the one that fits your use case. Here's a guide to help you get started!

    Gpt-Pilot

    In a constantly evolving technological landscape, the boundaries of what is possible in app development are being pushed by the GPT Pilot research project, which harnesses the power of GPT-4 to create production-ready applications. So, can artificial intelligence write up to 95% of the code for an app and leave the remaining 5% for human developers? Lets explore the key components of this groundbreaking project.

    Inst-Inpaint

    Image editing just got easier with the groundbreaking technology of image inpainting. Say goodbye to tedious masking and hello to a whole new world of seamless photo editing.

    LlamaGPT

    Meet LlamaGPT, the innovative self-hosted, offline, and private conversational assistant. With LlamaGPT, you can enjoy confidential and secure conversations without worrying about data privacy. This chatbot is like having a personal assistant right on your device!

    Metaphor

    Are you looking for a way to connect your LLM to the internet? Look no further. The Metaphor API allows you to search in natural language and get relevant results with our neural search model. And with the /contents endpoint, you can summarize the results in cleaned HTML content for your users. Plus, individual developers can get a free API key for up to 1000 requests per month

    Open-Copilot

    OpenCopilot is your own AI co-pilot that can connect to the tools your product uses behind the scenes. With advanced language models and a smart decision-making system, it can help you get the job done with ease.

    Open-Interpreter

    Get ready to unlock a groundbreaking way of making your code work with Open Interpreter. This innovative open-source tool allows language models to run code right on your own computer, making coding in different languages like Python and Javascript super easy. Experience a new way of running your code locally with a simple terminal command '$ interpreter' after installation. It's as simple as using a ChatGPT-like interface!

    TextGen-webUI

    A Gradio web UI for Large Language Models. Its goal is to become the AUTOMATIC1111/stable-diffusion-webui of text generation.

    Prompt2-Model

    Prompt2Model is a tool that uses simple language instructions (like the ones you give to ChatGPT) to create a small, specific modelthat's easy to use and set up.