Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
retrieve_pdf_metadata [2019/10/30 15:28]
dstillman
retrieve_pdf_metadata [2020/05/22 16:32]
dstillman
Line 1: Line 1:
 ====== Retrieve PDF Metadata ====== ====== Retrieve PDF Metadata ======
  
-Users new to Zotero may find the prospect of importing all their data somewhat daunting. Many researchers already have a large collection of PDFs that they'​ve previously organized manually ​or using another program. Zotero makes it easy to import these PDFs and retrieve full bibliographic metadata (for searching, citing, indexing, and organizing),​ taking much of the pain out of switching. ​+Users new to Zotero may find the prospect of importing all their data somewhat daunting. Many researchers already have a large collection of PDFs that they'​ve previously organized manually. Zotero makes it easy to import these PDFs and retrieve full bibliographic metadata (for searching, citing, indexing, and organizing),​ taking much of the pain out of switching. ​
  
-Follow these steps to use the Retrieve Metadata for PDFs function: +To use this feature, simply drag your existing PDFs into your Zotero library or use the "Store Copy of File" or "Link to File" options from the add new item menu (green plus sign). By default, Zotero will automatically retrieve metadata for each PDF, create an appropriate parent item, and rename the associated ​file based on the metadata(You can disable these automatic functions in the [[preferences/​general|General pane]] of Zotero preferences.)
-  - Drag your existing PDFs into your Zotero library or use the "Store Copy of File" or "Link to File" options from the add new item menu (green plus sign). +
-  - By default, Zotero will [[/​blog/​zotero-5-0-36/​|automatically retrieve metadata for PDFs]] ​and rename the associated ​filesIf you prefer, you can disable these automatic functions in the [[preferences/​general|General pane]] of Zotero preferences.+
  
-If you have disabled automatic metadata retrieval or have other PDF files already in your library, you can also retrieve metadata manually: +{{https://www.zotero.org/​static/​images/​blog/​5.0.36/auto-recognize@2x.gif?744}}
-  - In the middle pane, select the PDFs you want to retrieve metadata for. +
-  - Right-click on the PDFs and select "​Retrieve Metadata for PDFs". +
-  - If Zotero can find a match for the item, it will create a full Zotero item with the available data and attach the PDF. +
-  - If Zotero can't find metadata for an item, you can right-click on it and choose "​Create Parent Item" to manually enter the item metadata.+
  
-With this featurethere should be no major hurdles to switching to Zotero and taking full advantage of all its powerful searchindexingorganizing, and citation features.+If Zotero can find a match for the PDFit will create a full Zotero ​item with the available data and attach the PDF. If it can't, it will leave the PDF as a standalone attachment, allowing you to add a parent item [[adding_items_to_zotero|from the web or by identifier]] and drag the PDF onto it. If all else failsyou can right-click on the PDFchoose Create Parent Item, and manually enter metadata for the item.
  
-The Retrieve Metadata feature uses a Zotero ​web service to find item metadata. ​The Zotero client sends the first few pages of a PDF to the web servicewhich uses a variety of extraction algorithms and known metadata ​from Crossrefpaired with DOI and ISBN lookups, to build parent item for the PDF. The Zotero lookup service ​doesn’t ​require a Zotero account and doesnt log any data about the content or results of searches.+Zotero ​should retrieve high-quality ​metadata ​for most academic PDFsWhile it can sometimes extract basic information (titleauthor) ​from other documentsyou shouldn'​t expect that — anything can be distributed as a PDF, but that doesn’t ​mean thereany standard metadata available for it.
  
-Note: While this feature can greatly facilitate importing large existing libraries of PDFs, it **is not** the best way to add items to your library in general. Items can be imported faster by using the Zotero Connector plugin in your browser from publisher websites or most scholarly databases. This saves several steps versus downloading the PDF manually and adding it to Zotero. The item metadata will also often be higher quality. See [[Adding Items to Zotero]] for more info.+**Note:** While this feature can greatly facilitate importing large existing libraries of PDFs, it **is not** the best way to add items to your library in general. Items can be imported faster by using the Zotero Connector plugin in your browser from publisher websites or most scholarly databases. This saves several steps versus downloading the PDF manually and adding it to Zotero. The item metadata will also often be higher quality. See [[Adding Items to Zotero]] for more info
 + 
 +==== How It Works ==== 
 + 
 +The Retrieve Metadata feature uses a Zotero web service to find item metadata. The Zotero client sends the first few pages of text from the PDF to the web service, which uses a variety of extraction algorithms and known metadata from Crossref, paired with DOI and ISBN lookups, to build a parent item for the PDF. The Zotero lookup service doesn’t require a Zotero account and doesn’t log any data about the content or results of searches.
retrieve_pdf_metadata.txt · Last modified: 2020/05/22 16:32 by dstillman