PDF To Data - Python help!!

newcomer

New Member
Joined
Oct 3, 2023
Messages
3
Reaction score
0
Credits
32
Hi. I'm making a medical chatbot called prodoc. I use just basic codes for that. I need to insert many medical PDFs and get data from those. So, once users ask a question, it will provide the answer. I'm a beginner for python & programming. Can anyone guide me, please? (I can't afford much).

I searched and found some things called longchain, ... But, I don't know well

If somebody can help me, a Big blessing!

1696344260512.png
 


I know of some python libraries hat will extract text from a PDF file.

PyPDF2 and pdfplumber both will extract text from a PDF file.
But then, how do you know what specific text to extract? I'm not a professional programmer.
But, when I have done this, I extract 'ALL' the text, and then use other tools to parse and extract
the part I need.

With AI type querying, the python ( or whatever language ) has to be smart enough to know
what text to look for.
 
I know of some python libraries hat will extract text from a PDF file.

PyPDF2 and pdfplumber both will extract text from a PDF file.
But then, how do you know what specific text to extract? I'm not a professional programmer.
But, when I have done this, I extract 'ALL' the text, and then use other tools to parse and extract
the part I need.

With AI type querying, the python ( or whatever language ) has to be smart enough to know
what text to look for.
Yeah. That is the thing I need. Will it be possible if we use Langchan API Or any AI API? (I'm searching for free)
 

Members online


Top