Commit Graph

58 Commits

Author SHA1 Message Date
Nielson Janné
ce88140ee9 Remove unused var 2025-04-10 15:35:26 +02:00
Nielson Janné
58f9d1f6b5 Remove superfluous import 2025-04-10 15:35:15 +02:00
ab78fdc0c7 Merge pull request 'CondRetGenLangGraph() returns web and pdf sources by levering the stream defintion' (#19) from cond-sources into main
Reviewed-on: AI_team/generic-RAG-demo#19
Reviewed-by: nielsonj <nielson.janne@sogeti.com>
2025-04-09 19:39:58 +02:00
Ruben Lucas
df2afd73cb 🎨 Reset retrieved sources 2025-04-09 16:30:39 +02:00
Ruben Lucas
67ba306d3e 🐛 If check for pdf source because try doesn't fail 2025-04-09 16:06:15 +02:00
Ruben Lucas
ab1235bd28 Create single source aggregation definition 2025-04-09 16:03:35 +02:00
3295bb8992 Find and add found sources 2025-04-09 15:23:54 +02:00
6ad6ac4a34 Merge pull request 'Change RetGenLangGraph to use streaming instead of invoking on the LLM' (#17) from reg_gen_stream into main
Reviewed-on: AI_team/generic-RAG-demo#17
2025-04-09 11:21:13 +02:00
Nielson Janné
9baa7b0ef6 Fix a small bug in report web page source 2025-04-09 11:17:09 +02:00
Nielson Janné
9bb9f0ea22 Make RetGenGrap async for smoother user experience 2025-04-09 11:16:55 +02:00
Nielson Janné
db3d1cfa20 Change RetGenLangGraph to use streaming instead of invoking on the LLM 2025-04-08 23:37:04 +02:00
Nielson Janné
5d86ad6961 Add logging warning when no page number is found 2025-03-28 18:56:01 +01:00
Nielson Janné
f3429a15b1 small refacter in ret_gen.py
- improve readability
- reduce chance for potential mistake in future
2025-03-28 18:19:55 +01:00
Nielson Janné
cd14c8add2 Fix pdf source retrieval information 2025-03-28 18:11:41 +01:00
Nielson Janné
d1e9b3d8cf pretty up source printing in chainlit 2025-03-28 18:11:41 +01:00
Nielson Janné
47c1c1cd6e Add extra logging regarding pdf/web soure parsing 2025-03-28 18:11:41 +01:00
0fe4a628d7 🐛 Check CHAINLIT_STARTERS with default value 2025-03-28 13:47:45 +01:00
cfcaf84f2d 🐛 Make project setup more restrictive Add black settings 2025-03-28 13:47:08 +01:00
Nielson Janné
d37012f780 Make the --pdf-data argument optional 2025-03-26 11:08:54 +01:00
Nielson Janné
506857deeb Factor out processing of responses in functions 2025-03-26 10:34:34 +01:00
Nielson Janné
5258127ae1 Update readme 2025-03-24 11:57:34 +01:00
Nielson Janné
6cf26dabce Fix a argparse bug 2025-03-17 17:48:39 +01:00
Nielson Janné
bb1bf558f7 Add memory in the RetGenGraph 2025-03-17 17:48:30 +01:00
Nielson Janné
ee0c731faf Add support for both lang Graphs 2025-03-17 17:40:54 +01:00
Nielson Janné
f25770e3ce Add a Conditional Retrieve/Generator LangGraph 2025-03-17 16:51:15 +01:00
Nielson Janné
3965ce0fb2 Fixes small bug in RetGenLangGraph 2025-03-17 14:46:16 +01:00
Nielson Janné
2ba3eadec4 Add some doc-strings 2025-03-17 14:25:39 +01:00
Nielson Janné
3fa0e31521 Refactor out Retrieval/Generator LangGraph 2025-03-17 14:15:50 +01:00
Nielson Janné
3412dea813 Bugfix not passing unstructured_pdf argument along 2025-03-17 14:15:15 +01:00
Nielson Janné
f46887b19d Add Ruff as dev dependency 2025-03-17 12:47:36 +01:00
Nielson Janné
674220f442 Extend readme documentation 2025-03-17 12:47:22 +01:00
Nielson Janné
67d681fcc4 Fix --backend flag and description 2025-03-17 12:47:09 +01:00
Nielson Janné
b9d2f348a2 Add option to configure the us of (un)structred pdfs 2025-03-17 12:46:52 +01:00
Nielson Janné
e79d7b9867 Adds a structured pdf parser 2025-03-17 11:57:10 +01:00
Nielson Janné
cfbfe5f609 Add pdf/web source and viewer to chainlit 2025-03-15 15:35:29 +01:00
Nielson Janné
37aa171924 Add filetype key for web chunk 2025-03-15 15:35:09 +01:00
Nielson Janné
865c2c3b8b Add chainlit file cache directory to gitignore 2025-03-15 15:34:52 +01:00
Nielson Janné
1168432278 Clarify the current google backend is the google vertext backend 2025-03-15 14:21:04 +01:00
Nielson Janné
f020257fbe Update readme with .env information 2025-03-15 14:14:58 +01:00
Nielson Janné
0d7d5cf39f Remove dependency on huggingface and use Ollama 2025-03-15 14:12:24 +01:00
Nielson Janné
abc4ac947c Add some newlines for read ability 2025-03-15 13:42:28 +01:00
Nielson Janné
976ccd155f Fix a small bug using local LLM's 2025-03-15 13:42:06 +01:00
Nielson Janné
450a00e6fe Add logging info when skipping file or website 2025-03-15 13:41:56 +01:00
Nielson Janné
a6e9c95a6d Add .chroma directory to gitignore 2025-03-14 23:50:54 +01:00
Nielson Janné
e99d26ed96 Apply RUFF linting 2025-03-14 23:21:51 +01:00
Nielson Janné
b07eca8f9b Only process files and websites if not already in Chroma DB. 2025-03-14 23:20:34 +01:00
Nielson Janné
1f75264e96 Add ruff linting configuration 2025-03-14 22:18:05 +01:00
Nielson Janné
06dbfed5a0 Implement persistent chroma DB 2025-03-12 22:05:02 +01:00
Nielson Janné
860cfc3438 Resolve bug in process_local_files and clean up variabel names 2025-03-12 22:04:40 +01:00
Nielson Janné
e259808322 Adds information to the readme 2025-03-12 14:47:34 +01:00