freeword-new.blogspot.com: November 2024

Show HN: Minimal, customizable new tab for Chrome/Firefox
13 by georg-stone | 6 comments on Hacker News.
Hello HN! Flowtide is a project I have been working on for about 2 months now. It is a customizable new tab page for Firefox or Chrome. By default, it is configured to have a minimal amount of features, but it can be configured to include a clock, to-do list, or even soundscapes. Install: https://flowtide.app/ GitHub: https://ift.tt/QJx0X7K

Sunday, November 24, 2024

New top story on Hacker News: Petnames: A humane approach to secure, decentralized naming

Petnames: A humane approach to secure, decentralized naming
14 by todsacerdoti | 0 comments on Hacker News.

Saturday, November 23, 2024

New top story on Hacker News: Linux CoC Announces Decision Wrt Kent Overstreet (Bcachefs)

Linux CoC Announces Decision Wrt Kent Overstreet (Bcachefs)
42 by pantalaimon | 36 comments on Hacker News.

Friday, November 22, 2024

New top story on Hacker News: Rendering "modern" Winamp skins in the browser

Rendering "modern" Winamp skins in the browser
9 by mariuz | 0 comments on Hacker News.

New top story on Hacker News: Private School Labeler on Bluesky

Private School Labeler on Bluesky
19 by tosh | 1 comments on Hacker News.

Thursday, November 21, 2024

New top story on Hacker News: Handling cookies is a minefield

Handling cookies is a minefield
61 by todsacerdoti | 13 comments on Hacker News.

New top story on Hacker News: Pg_karnak: Transactional schema migration across tenant databases

Pg_karnak: Transactional schema migration across tenant databases
16 by gwen-shapira | 2 comments on Hacker News.

Wednesday, November 20, 2024

New top story on Hacker News: The Northeast is becoming fire country

The Northeast is becoming fire country
8 by gregorymichael | 1 comments on Hacker News.

New top story on Hacker News: AMD crafts custom EPYC CPU with HBM3 for Azure: 88 Zen 4 cores and 450GB of HBM3

AMD crafts custom EPYC CPU with HBM3 for Azure: 88 Zen 4 cores and 450GB of HBM3
18 by rbanffy | 1 comments on Hacker News.

Urban voters disappointed in Maharashtra, least number of voters came out from Mumbai, Pune and Thane, see the figures

Maharashtra assembly elections 2024

New Delhi: Voting for the first and last phase of Maharashtra assembly elections took place today. The figures that have come out after the voting was completed at 6 pm are not very encouraging. According to the Election Commission, low voter participation was seen in cities like Mumbai, Pune and Thane. By 5 pm, 58.22 percent voting took place in Maharashtra and 67.59 percent in Jharkhand. At the same time, in Jharkhand, 67.04 percent voting took place on these assembly seats in 2019.

Tuesday, November 19, 2024

New top story on Hacker News: Rumble CEO Chris Pavlovski Expresses Interest in Buying Google Chrome

Rumble CEO Chris Pavlovski Expresses Interest in Buying Google Chrome
12 by agensaequivocum | 3 comments on Hacker News.

New top story on Hacker News: The Analog Thing: Analog Computing for the Future

The Analog Thing: Analog Computing for the Future
8 by cgeier | 1 comments on Hacker News.

Monday, November 18, 2024

New top story on Hacker News: Unreal 5.5 is a big deal [video]

Unreal 5.5 is a big deal [video]
24 by RobinHirst11 | 10 comments on Hacker News.

New top story on Hacker News: Show HN: FastGraphRAG – Better RAG using good old PageRank

Show HN: FastGraphRAG – Better RAG using good old PageRank
22 by liukidar | 5 comments on Hacker News.
Hey there HN! We’re Antonio, Luca, and Yuhang, and we’re excited to introduce Fast GraphRAG, an open-source RAG approach that leverages knowledge graphs and the 25 years old PageRank for better information retrieval and reasoning. Building a good RAG pipeline these days takes a lot of manual optimizations. Most engineers intuitively start from naive RAG: throw everything in a vector database and hope that semantic search is powerful enough. This can work for use cases where accuracy isn’t too important and hallucinations are tolerable, but it doesn’t work for more difficult queries that involve multi-hop reasoning or more advanced domain understanding. Also, it’s impossible to debug it. To address these limitations, many engineers find themselves adding extra layers like agent-based preprocessing, custom embeddings, reranking mechanisms, and hybrid search strategies. Much like the early days of machine learning when we manually crafted feature vectors to squeeze out marginal gains, building an effective RAG system often becomes an exercise in crafting engineering “hacks.” Earlier this year, Microsoft seeded the idea of using Knowledge Graphs for RAG and published GraphRAG - i.e. RAG with Knowledge Graphs. We believe that there is an incredible potential in this idea, but existing implementations are naive in the way they create and explore the graph. That’s why we developed Fast GraphRAG with a new algorithmic approach using good old PageRank. There are two main challenges when building a reliable RAG system: (1) Data Noise: Real-world data is often messy. Customer support tickets, chat logs, and other conversational data can include a lot of irrelevant information. If you push noisy data into a vector database, you’re likely to get noisy results. (2) Domain Specialization: For complex use cases, a RAG system must understand the domain-specific context. This requires creating representations that capture not just the words but the deeper relationships and structures within the data. Our solution builds on these insights by incorporating knowledge graphs into the RAG pipeline. Knowledge graphs store entities and their relationships, and can help structure data in a way that enables more accurate and context-aware information retrieval. 12 years ago Google announced the knowledge graph we all know about [1]. It was a pioneering move. Now we have LLMs, meaning that people can finally do RAG on their own data with tools that can be as powerful as Google’s original idea. Before we built this, Antonio was at Amazon, while Luca and Yuhang were finishing their PhDs at Oxford. We had been thinking about this problem for years and we always loved the parallel between pagerank and the human memory [2]. We believe that searching for memories is incredibly similar to searching the web. Here’s how it works: - Entity and Relationship Extraction: Fast GraphRAG uses LLMs to extract entities and their relationships from your data and stores them in a graph format [3]. - Query Processing: When you make a query, Fast GraphRAG starts by finding the most relevant entities using vector search, then runs a personalized PageRank algorithm to determine the most important “memories” or pieces of information related to the query [4]. - Incremental Updates: Unlike other graph-based RAG systems, Fast GraphRAG natively supports incremental data insertions. This means you can continuously add new data without reprocessing the entire graph. - Faster: These design choices make our algorithm faster and more affordable to run than other graph-based RAG systems because we eliminate the need for communities and clustering. Suppose you’re analyzing a book and want to focus on character interactions, locations, and significant events: from fast_graphrag import GraphRAG DOMAIN = "Analyze this story and identify the characters. Focus on how they interact with each other, the locations they explore, and their relationships." EXAMPLE_QUERIES = [ "What is the significance of Christmas Eve in A Christmas Carol?", "How does the setting of Victorian London contribute to the story's themes?", "Describe the chain of events that leads to Scrooge's transformation.", "How does Dickens use the different spirits (Past, Present, and Future) to guide Scrooge?", "Why does Dickens choose to divide the story into \"staves\" rather than chapters?" ] ENTITY_TYPES = ["Character", "Animal", "Place", "Object", "Activity", "Event"] grag = GraphRAG( working_dir="./book_example", domain=DOMAIN, example_queries="\n".join(EXAMPLE_QUERIES), entity_types=ENTITY_TYPES ) with open("./book.txt") as f: grag.insert(f.read()) print(grag.query("Who is Scrooge?").response) This code creates a domain-specific knowledge graph based on your data, example queries, and specified entity types. Then you can query it in plain English while it automatically handles all the data fetching, entity extractions, co-reference resolutions, memory elections, etc. When you add new data, locking and checkpointing is handled for you as well. This is the kind of infrastructure that GenAI apps need to handle large-scale real-world data. Our goal is to give you this infrastructure so that you can focus on what’s important: building great apps for your users without having to care about manually engineering a retrieval pipeline. In the managed service, we also have a suite of UI tools for you to explore and debug your knowledge graph. We have a free hosted solution with up to 100 monthly requests. When you’re ready to grow, we have paid plans that scale with you. And of course you can self host our open-source engine. Give us a spin today at https://circlemind.co and see our code at https://ift.tt/lXzjWo8 We’d love feedback :) [1] https://ift.tt/Ow8FjoM... [2] Griffiths, T. L., Steyvers, M., & Firl, A. (2007). Google and the Mind: Predicting Fluency with PageRank. Psychological Science, 18(12), 1069–1076. https://ift.tt/OZ0R9fb [3] Similarly to Microsoft’s GraphRAG: https://ift.tt/W6YFs4a [4] Similarly to OSU’s HippoRAG: https://ift.tt/numkr9D https://ift.tt/a0C84ek

New top story on Hacker News: Illegally logged wood from Cambodia likely ending up in U.S. homes

Illegally logged wood from Cambodia likely ending up in U.S. homes
33 by PaulHoule | 7 comments on Hacker News.

New top story on Hacker News: MailCatcher runs a super simple SMTP server

MailCatcher runs a super simple SMTP server
3 by mooreds | 0 comments on Hacker News.

Sunday, November 17, 2024

New top story on Hacker News: Japan ranked 31st in digital competitiveness, 92nd for English skills

Japan ranked 31st in digital competitiveness, 92nd for English skills
10 by mikhael | 5 comments on Hacker News.

New top story on Hacker News: AlphaProof's Greatest Hits: the coolest bits from the IMO 2024 solutions

AlphaProof's Greatest Hits: the coolest bits from the IMO 2024 solutions
8 by rishicomplex | 0 comments on Hacker News.

New top story on Hacker News: Good Software Development Habits

Good Software Development Habits
20 by mmphosis | 6 comments on Hacker News.

New top story on Hacker News: A Taxonomy of AgentOps

A Taxonomy of AgentOps
5 by omarsar | 0 comments on Hacker News.

Saturday, November 16, 2024

New top story on Hacker News: I tried every top email marketing tool

I tried every top email marketing tool
35 by steve-benjamins | 28 comments on Hacker News.

Friday, November 15, 2024

New top story on Hacker News: Bill Atkinson Has Pancreatic Cancer

Bill Atkinson Has Pancreatic Cancer
4 by mpweiher | 1 comments on Hacker News.

New top story on Hacker News: I Used Star Methodology to Crack Amazon Leadership Principles Interview

I Used Star Methodology to Crack Amazon Leadership Principles Interview
4 by arihantparsoya | 0 comments on Hacker News.

New top story on Hacker News: 100-MW solar farm just broke ground in Wisconsin

100-MW solar farm just broke ground in Wisconsin
17 by belter | 17 comments on Hacker News.

New top story on Hacker News: Pex: A tool for generating .pex (Python EXecutable) files, lock files and venvs

Pex: A tool for generating .pex (Python EXecutable) files, lock files and venvs
3 by eamag | 0 comments on Hacker News.

Thursday, November 14, 2024

New top story on Hacker News: Something weird is happening with LLMs and Chess

Something weird is happening with LLMs and Chess
18 by gregorymichael | 5 comments on Hacker News.

New top story on Hacker News: ℘

℘
30 by IdealeZahlen | 0 comments on Hacker News.

New top story on Hacker News: Origin private file system – MDN

Origin private file system – MDN
8 by thunderbong | 1 comments on Hacker News.

New top story on Hacker News: The Corpus of United States State Statutes–Design, Construction and Use

The Corpus of United States State Statutes–Design, Construction and Use
7 by cwmoore | 0 comments on Hacker News.

Wednesday, November 13, 2024

New top story on Hacker News: Steve Jobs, NeXTSTEP, and early object-oriented programming (2016)

Steve Jobs, NeXTSTEP, and early object-oriented programming (2016)
30 by wmlive | 6 comments on Hacker News.

New top story on Hacker News: Show HN: Konga Beat – A custom track editor for Donkey Konga 2 and 3

Show HN: Konga Beat – A custom track editor for Donkey Konga 2 and 3
31 by CIARobotFish | 7 comments on Hacker News.
Howdy HN! For those who don't know, back in the early 2000s, Nintendo and Namco developed a series of music rhythm games for the GameCube featuring Donkey Kong called Donkey Konga: https://ift.tt/RhCuPST The Donkey Konga games borrowed heavily from Taiko no Tatsujin (another music rhythm game by Namco). However, instead of taiko drums, the player would use DK Bongos to jam along with music from different eras and genres. Long story short, I figured out how to add custom tracks to some of the Donkey Konga games (Donkey Konga 2 and 3) but found the entire process cumbersome, so I decided to make a dedicated editor. It was a lot of fun to make, and I hope others get some enjoyment out of it too!

New top story on Hacker News: Covering the Supreme Court and the Judiciary in (Mostly) Plain English

Covering the Supreme Court and the Judiciary in (Mostly) Plain English
26 by everybodyknows | 8 comments on Hacker News.

Tuesday, November 12, 2024

New top story on Hacker News: Large Language Models in National Security Applications

Large Language Models in National Security Applications
34 by bindidwodtj | 9 comments on Hacker News.

New top story on Hacker News: The Future of Programming: Copilots vs. Agents (Part I)

The Future of Programming: Copilots vs. Agents (Part I)
3 by thunderbong | 0 comments on Hacker News.

New top story on Hacker News: Avremu: An 8-Bit AVR Microcontroller Simulator Written in LaTeX

Avremu: An 8-Bit AVR Microcontroller Simulator Written in LaTeX
9 by nurple | 3 comments on Hacker News.

Monday, November 11, 2024

New top story on Hacker News: Brian Kernighan Reflects on Unix: A History and a Memoir [video]

Brian Kernighan Reflects on Unix: A History and a Memoir [video]
8 by zdw | 0 comments on Hacker News.

Sunday, November 10, 2024

New top story on Hacker News: We All Scream for Ice Cream

We All Scream for Ice Cream
3 by mooreds | 1 comments on Hacker News.

Saturday, November 9, 2024

New top story on Hacker News: IronCalc – Open-Source Spreadsheet Engine

IronCalc – Open-Source Spreadsheet Engine
10 by kaathewise | 4 comments on Hacker News.

New top story on Hacker News: OpenCoder: Open-Source LLM for Coding

OpenCoder: Open-Source LLM for Coding
38 by pil0u | 6 comments on Hacker News.

Friday, November 8, 2024

New top story on Hacker News: The case of a program that crashed on its first instruction

The case of a program that crashed on its first instruction
6 by zdimension | 0 comments on Hacker News.

New top story on Hacker News: Mitochondria Are Alive

Mitochondria Are Alive
32 by mailyk | 6 comments on Hacker News.

New top story on Hacker News: Pirating "The Pirate Bay" TV Series Is Ironically Difficult

Pirating "The Pirate Bay" TV Series Is Ironically Difficult
20 by HieronymusBosch | 5 comments on Hacker News.

New top story on Hacker News: Genetic repair via CRISPR can inadvertently introduce other defects

Genetic repair via CRISPR can inadvertently introduce other defects
25 by amichail | 7 comments on Hacker News.

Thursday, November 7, 2024

New top story on Hacker News: Mushroom Color Atlas

Mushroom Color Atlas
36 by gaws | 1 comments on Hacker News.

New top story on Hacker News: Show HN: BemiDB – Postgres read replica optimized for analytics

Show HN: BemiDB – Postgres read replica optimized for analytics
25 by exAspArk | 13 comments on Hacker News.

Wednesday, November 6, 2024

New top story on Hacker News: Starship's Sixth Flight Test

Starship's Sixth Flight Test
60 by hnburnsy | 27 comments on Hacker News.

New top story on Hacker News: Launch HN: Midship (YC S24) – Turn unstructured documents into usable data

Launch HN: Midship (YC S24) – Turn unstructured documents into usable data
6 by maxmaio | 1 comments on Hacker News.
Hey HN, we are Max, Kieran, and Aahel from Midship ( https://midship.ai ). Midship makes it easy to extract data from unstructured documents like pdfs and images. Here’s a video showing it in action: https://ift.tt/W4wFRue?... , and a demo playground (no signup required!) to test it out: https://ift.tt/QRsAd1b We started 5 months ago initially trying to make an AI natural language workflow builder that would be a simpler alternative to Zapier or Make.com. However, most of our users seemed to be much more interested in the basic (and not very good) document extraction feature we had. Seeing how people were spending hours a day manually extracting data from pdfs inspired us to build what has become Midship! The problem is that despite all our progress in software, huge amounts of business data still lives in PDFs and images. Sure, you can OCR them, but getting clean, structured data out is still painful. Most existing tools just give you a blob of markdown - leaving you to figure out which parts matter and how they relate. We've found that combining OCR with language models lets us do something more useful: extract specific fields and tables that users actually care about. The LLMs help correct OCR mistakes and understand context (like knowing that "Inv#" and "Invoice Number" mean the same thing). We have two main kinds of users today, non-technical users that extract data via our web app and developers who use our extraction api. We were initially focused on the first one as they seemed like an underserved part of the market, but we’ve received a lot of interest from developers who face the same issues. For pricing, we currently charge a monthly Saas fee per seat for the web app and a volume based pricing for the API. We’re really excited to share what we’ve built so far and look forward to any feedback from the community!