Eden Yavin’s Post

Applied Researcher @ Accenture Cyber Labs | M.Sc. AI and Autonomous systems @ BGU

I agree - deep search for ways to improve the models from existing dataset is better from breadth search for new data to add to the dataset. This method seems interesting because intuitively we as humans also do the same in other domains, not just NLP. The difficult part is how to create the dependence graph in other domains other than NLP. Interesting and Worth monitoring!

Yoel Zeldes

Research Engineer (NLP) @ Google DeepMind | Content Creator @ One Shot Learning

We all know that data is one of the most important ingredients in making a powerful LLM. Did you know that once you get the data, you can use it for training your model in a much better way than randomly sampling from it? In "Skill-it! A Data-Driven Skills Framework for Understanding and Training Language Models" the authors propose a neat framework for curriculum learning: Instead of randomly sampling your training data, identify which skills can be learned from the data and which connections exist between the skills so that you can train on prerequisite skills before moving on to more advanced skills. Intuitively, this is what humans do, and I'm glad to see a practical approach that translates that into the LLM domain. Why does it matter? Because gathering more relevant, high-quality data becomes increasingly challenging. This topic has been discussed in several research papers. Therefore, using the data more efficiently is one way to improve. The paper's weakness is that it doesn't align with recent research about how to evaluate LLMs in a meaningful way. Most of the experiments they did measured perplexity, but what matters is evaluating the actual generations of the models on actual use cases. I hope we'll see extensions of the paper to these areas in the near future. 🤞

To view or add a comment, sign in

More Relevant Posts

Eden Yavin

Applied Researcher @ Accenture Cyber Labs | M.Sc. AI and Autonomous systems @ BGU
14h
Report this post
Interesting to see both Claude Sonnet 3.5 and GPT4o struggle with the same task - Visualize a graph from the CSV I provided them. In previous times, they both preformed well without hallucination. But now, when I asked for the same y axis as before but a different x axis with values range of 1 to 4, they both hallucinate the same values range - 1 to 20. Made me wonder what is different in this case that made both models hallucinate something similar. #hallucinations #llm
Like Comment
To view or add a comment, sign in
Eden Yavin

Applied Researcher @ Accenture Cyber Labs | M.Sc. AI and Autonomous systems @ BGU
1d
Report this post
How can we improve both speed and accuracy of RAG? A new paper introduces a novel approach called "superposition prompting" that tackles this challenge head-on. 📊 Current retrieval-augmented generation (RAG) solutions face significant hurdles in processing long contexts efficiently. Traditional methods often struggle with quadratic inference costs as sequence length increases, leading to expensive deployments and degraded output quality due to the "distraction phenomenon" where irrelevant context negatively impacts results. Existing approaches to address these issues frequently require architectural modifications or retraining, limiting their practicality with pre-trained language models. 🚀 Enter superposition prompting, a groundbreaking RAG methodology that can be applied directly to pre-trained transformer-based LLMs without fine-tuning. This innovative approach processes input documents in parallel prompt paths, leveraging clever techniques like KV cache reuse and path pruning. By discarding irrelevant paths early in the process, superposition prompting not only accelerates inference but also enhances the model's ability to focus on pertinent information. 🏆 The results are impressive, showcasing significant improvements in both accuracy and compute efficiency. For instance, when applied to the MPT-7B instruction-tuned model on the NaturalQuestions-Open dataset, superposition prompting achieved a remarkable 93× reduction in compute time while simultaneously boosting accuracy by 43% compared to naive RAG implementations. This dual enhancement in speed and precision demonstrates the potential of superposition prompting to revolutionize RAG applications across various domains. #AI #MachineLearning #NLP #RAG
Like Comment
To view or add a comment, sign in
Eden Yavin

Applied Researcher @ Accenture Cyber Labs | M.Sc. AI and Autonomous systems @ BGU
3d
Report this post
super cool and all Hugging Face native
Dipanjan S.

Head of Community • Principal AI Scientist • Google Developer Expert & Cloud Champion Innovator • Author
1w

Agentic AI systems will continue to evolve and improve. Check out a completely open-source Text2SQL solution to build your own AI Agent using Llama 3 to query SQL databases in natural language and get results. This is using Hugging Face's Transformers Agents framework! Key features of this agent: - ReAct style agent which makes the LLM reason and take actions - Reasoning involves understand the natural language query, database schema and which tools to call which is our SQLExecutor Tool - Action involves running this tool, getting the SQL query and retrieving the data from the database - If the question is answered it will then return the answer to your question This uses the open Llama 3 LLM from Meta so everything can be hosted locally. Do try it and yes this can be extended to multiple tables also, check out the cookbook recipe below, all credits to HuggingFace. Share with others if its helpful!
Like Comment
To view or add a comment, sign in
Eden Yavin

Applied Researcher @ Accenture Cyber Labs | M.Sc. AI and Autonomous systems @ BGU
6d Edited
Report this post
Claude's Artifacts is incredible. I frequently utilize it for my research assignments. In this case, I asked him to make graphs out of the CSV file I had saved with the findings of my experiment, and the output is exactly what I wanted in an interactive window! Claude: 1. wrote the graph's HTML code 2. Ran the code 3. Displayed the outcomes to me Amazing
4 Comments
Like Comment
To view or add a comment, sign in
Eden Yavin

Applied Researcher @ Accenture Cyber Labs | M.Sc. AI and Autonomous systems @ BGU
1w
Report this post
A very important use case for everyone doing RAG

Ryan Siegler

GenAI | Vector DBs | Data Science | Emerging Technology Advocate
1w

One aspect of RAG that’s been bugging me is the decline in accuracy when retrieving specific values from embedded tables within a document. It’s even worse when the document is packed with multiple similar complex tables (especially nested columns), like in an earnings report. So I set out on a mission to improve RAG on table heavy documents using LangChain, unstructured.io, and KDB.AI, and wrote this article to tell the tale: https://lnkd.in/gBDNYx68 Key highlights: ✔ Precise table extraction techniques ✔ Contextual enrichment using LLMs ✔ Clever format standardization tricks ✔ A unified embedding approach for optimal retrieval Enjoy the article and share your thoughts in the comments below!

High-Precision RAG for Table Heavy Documents… (Using LangChain, Unstructured.io, & KDB.AI)

medium.com
Like Comment
To view or add a comment, sign in
Eden Yavin

Applied Researcher @ Accenture Cyber Labs | M.Sc. AI and Autonomous systems @ BGU
1w Edited
Report this post
After reading Vaibhav Srivastav post, I had to test Hugging Face's new Speech-to-Speech (S2S) project on my local MacBook Pro. The project gives you easy access to: - Voice Activity Detection (VAD): silero VAD v5 - Speech to Text (STT): Whisper checkpoints (including distilled versions) - Language Model (LM): Any instruct model available on the Hugging Face Hub! (I Was using Phi3) - Text to Speech (TTS): Parler-TTS Because there isn't immediate support for Mac, things got off to a difficult start. However, I was able to get it to work after making a few adjustments and improvements, which you can see in my github issue: https://lnkd.in/dA3hNveq Using melo tts makes the sound and response time awesome as well! github repo: https://lnkd.in/dMi4E9Wz You can hear me having a chat with Phi3 😂 #llm #sts #tts #stt
Like Comment
To view or add a comment, sign in

1,020 followers

161 Posts

View Profile Follow

Eden Yavin’s Post

More Relevant Posts

Explore topics