Skip to content

Commit

Permalink
Merge pull request #3110 from BirkbeckCTP/3109-bugfix
Browse files Browse the repository at this point in the history
#3109 full text indexing will now work as expected for documents with a <body> tag.
  • Loading branch information
mauromsl authored Sep 1, 2022
2 parents 959d7dd + d085f09 commit 62a667a
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion src/core/files.py
Original file line number Diff line number Diff line change
Expand Up @@ -930,7 +930,7 @@ def html_to_text(file_path):
soup = BeautifulSoup(f.read(), "html.parser")
body = soup.find("body")
if body:
body.text
text = body.text
else:
text = soup.text

Expand Down

0 comments on commit 62a667a

Please # to comment.