Page 3 of 3 First 123
  1. #21
    Quote Originally Posted by LordEntrails View Post
    That's the problem with text recognition, it has flaws.

    If you want lots of monsters, see Maasq's conversion of the Monster a Day / 1d6 Adventures. It has over 400 NPCs in it; https://www.fantasygrounds.com/forum...ter-Compendium
    As always, thanks so much for the replies! I appreciate the links as well I will have to cross reference what's already been done by these folks, I might have already done some unnesseray labor, but eh.

    Shout-out to Minty23185Fresh and Zacchaeus as well!
    Last edited by mostcallmetim; April 26th, 2019 at 00:20.

  2. #22
    LordEntrails's Avatar
    Join Date
    May 2015
    Location
    GMT -7
    Posts
    8,269
    Blog Entries
    9
    If you want some of the information on why text recognition (from OneNote or other apps) often has problems, you can look up "ligature" and see that in many fonts, some character pairs are actually placed in the file as a single 'special' character. Plus, even though the recognition software tries to determine the font style used, their are a nearly unlimited number of fonts and variations that look exceedingly similar. Plus the issue of image quality.

    Unfortunately, unless the PDF is always saved with all text and fonts included, often times you are going to get errors. And since many people use PDF to protect their files so they can not easily be "copied" to other sources, they don't want to use those features.

    Current Projects: Ultimate Undermountain (NYDUM)
    Community Contributions: Gemstones, 5E Quick Ref Decal, Adventure Module Creation, Dungeon Trinkets
    DMsGuild Content: Balance Disturbed (Adventure), Dungeon Room Descriptions
    FG Product Reviews: Virtual Scribe Reviews

  3. #23

    Join Date
    May 2016
    Location
    Jacksonville, FL
    Posts
    1,734
    Blog Entries
    7
    Quote Originally Posted by LordEntrails View Post
    Unfortunately, unless the PDF is always saved with all text and fonts included, often times you are going to get errors.
    Most of the time when I fully extract all data from PDFs, the full font isn't even included—they're 'subfonts' with only the characters present in the text. Must be an InDesign function?

    If the poster above is working with older material now in PDF form (such as AD&D 1E and 2E) that's because WotC requested people send in their best quality scans a few years back because the original manuscripts were lost, so they'll naturally be images rather than proper text and you'll be limited to the OCR capabilities of whatever software you use to attempt to figure out what the text is. Text styles, custom characters, ligatures, and many other factors (not the least of which is the image quality itself) all contribute to OCR fallibility.

  4. #24
    LordEntrails's Avatar
    Join Date
    May 2015
    Location
    GMT -7
    Posts
    8,269
    Blog Entries
    9
    Quote Originally Posted by Talyn View Post
    Most of the time when I fully extract all data from PDFs, the full font isn't even included—they're 'subfonts' with only the characters present in the text. Must be an InDesign function?
    Don't know. But I do know most PDF print drivers have an option to embedd all fonts. So at least in some cases it depends upon what options are selected when the PDF is created.

    And, as you point out, sometimes even the publishers don't have much option as to how the PDF is created and we (the end users) have no say in what features the PDF includes.

    Current Projects: Ultimate Undermountain (NYDUM)
    Community Contributions: Gemstones, 5E Quick Ref Decal, Adventure Module Creation, Dungeon Trinkets
    DMsGuild Content: Balance Disturbed (Adventure), Dungeon Room Descriptions
    FG Product Reviews: Virtual Scribe Reviews

Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  

Log in

Log in