I have thought about the legalese Q&A and translation tool for a bit already.
The best way to have such tools be reliable using an llm without much hallucinations is via the use of embeddings of large quantities of legal documents and only ask the model to look for the ones that may answer the user question, a bit like with privateGPT. Also, always refer to the source and show it to the user.
Maybe even ask the user additional questions like : “is your question about finance, family, rights…etc ?” to decrease the error rate even further.
Then I’d have the llm put all the law articles and court cases it found in a list and use langchain to make it ask itself in each one? Is it really related to the user question ? Is it the right category ?.." to try and remove the most false positive possible .
Now that we’d have a cleaned list, ask the llm to combine what he got and transforme the legalese into understandable language. In this step the fine tuning ( honestly, i don’t know how yet) using legal documents could greatly help the model to understand the legalese better.
Could be a great business idea or generally helpful for anyone you release it to.
If you think it could be a good business idea then feel free to make it a reality. My main goal however is to allow people to know their rights. The government is quick to remind us of our duties but unless we seek to know our rights ourselves, they’ll be trampled by anyone and everyone. I’d like to Imagine a few kiosks scattered around town to help the citizens.
I have thought about the legalese Q&A and translation tool for a bit already.
The best way to have such tools be reliable using an llm without much hallucinations is via the use of embeddings of large quantities of legal documents and only ask the model to look for the ones that may answer the user question, a bit like with privateGPT. Also, always refer to the source and show it to the user.
Maybe even ask the user additional questions like : “is your question about finance, family, rights…etc ?” to decrease the error rate even further.
Then I’d have the llm put all the law articles and court cases it found in a list and use langchain to make it ask itself in each one? Is it really related to the user question ? Is it the right category ?.." to try and remove the most false positive possible .
Now that we’d have a cleaned list, ask the llm to combine what he got and transforme the legalese into understandable language. In this step the fine tuning ( honestly, i don’t know how yet) using legal documents could greatly help the model to understand the legalese better.
If you think it could be a good business idea then feel free to make it a reality. My main goal however is to allow people to know their rights. The government is quick to remind us of our duties but unless we seek to know our rights ourselves, they’ll be trampled by anyone and everyone. I’d like to Imagine a few kiosks scattered around town to help the citizens.