Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
retrieve_pdf_metadata [2017/11/19 10:31] – bwiernik | retrieve_pdf_metadata [2022/03/05 16:58] (current) – dstillman | ||
---|---|---|---|
Line 1: | Line 1: | ||
====== Retrieve PDF Metadata ====== | ====== Retrieve PDF Metadata ====== | ||
+ | Users new to Zotero may find the prospect of importing all their data somewhat daunting. Many researchers already have a large collection of PDFs that they' | ||
- | [[/support/ | + | To use this feature, simply drag your existing PDFs into your Zotero library or use the "Store Copy of File" or "Link to File" options from the add new item menu (green plus sign). By default, Zotero will automatically retrieve metadata for each PDF, create an appropriate parent item, and rename the associated file based on the metadata. (You can disable these automatic functions in the [[preferences/general|General pane]] of Zotero preferences.) |
- | Users new to Zotero may find the prospect of importing all their data somewhat daunting. Zotero can import bibliographic data in a wide variety of formats, but what of PDFs? Many researchers find themselves managing a massive collection of PDFs, possibly with another program designed only for that purpose or through their own methods. Zotero makes it a breeze to import these PDFs, which takes much of the pain out of switching. | + | {{https:// |
- | Begin by dragging your existing PDFs into your Zotero library or use the "Store Copy of File" option from the add new item menu (green plus sign). Once they appear in the middle column, select the ones for which you wish to retrieve metadata. Right-click on them and select " | + | If Zotero can find a match for the PDF, it will create a full Zotero item with the available |
- | The Retrieve Metadata feature uses several method to locate item metadata. First, it will look for a Digital Object Identifier (DOI) or ISBN number and look these up in their respective registries. If no DOI or ISBN is found, Zotero will query the Google Scholar database for matches to the item text. If Retrieve Metadata | + | If you're not happy with the metadata |
- | //While this feature can greatly facilitate importing large existing libraries of PDFs, it **is not** the best way to add items to your library in general. | + | Zotero should retrieve high-quality metadata for most academic PDFs. While it can sometimes extract basic information (title, author) from other documents, you shouldn' |
+ | |||
+ | **Note: | ||
+ | |||
+ | ==== How It Works ==== | ||
+ | |||
+ | The Retrieve Metadata feature uses a Zotero web service to find item metadata. The Zotero client sends the first few pages of text from the PDF to the web service, which uses a variety of extraction algorithms and known metadata from Crossref, paired with DOI and ISBN lookups, |