You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Jan 6, 2025. It is now read-only.
I have a PDF that I wish to extract the table from. The package worked perfectly on most of the pdfs on which I used it before. But this time, I'm getting gibberish in English instead of Hindi Text.
Note that Dependencies are properly installed and that wouldn't be the issue here. This is what I'm doing: pdf = "./Pradhanjee.pdf" table = camelot.read_pdf(pdf, pages="all",flavor='lattice') df = [] for i in range(len(table)): df.append(table[i].df) new_df = pd.DataFrame() for i in range(len(df)): new_df = pd.concat([new_df, df[i]], axis=0) new_df.to_excel(f"{title}.xlsx", index=False) new_df
I'm not sure why this is happening. Any help would be appreciated :')
pdf = "./Pradhanjee.pdf"table = camelot.read_pdf(pdf, pages="all",flavor='lattice')df = []for i in range(len(table)):df.append(table[i].df)new_df = pd.DataFrame()for i in range(len(df)):new_df = pd.concat([new_df, df[i]], axis=0)new_df.to_excel(f"{title}.xlsx", index=False)new_dfI'm not sure why this is happening. Any help would be appreciated :')