Opened 10 years ago

Closed 10 years ago

#396 closed defect (fixed)

Accents are lost when scraping multiple items

Reported by: dstillman Owned by: simon
Priority: major Milestone: 1.0 Beta 3
Component: ingester Version: 1.0
Keywords: Cc:

Description

Steps to reproduce:

  1. Go to http://libraries.colorado.edu/search/aderrida/aderrida/1,2,161,B/exact&FF=aderrida+jacques&49,160
  1. Click the folder icon in the URL bar.
  1. Select two items and save them.

Accents are lost and the text becomes jumbled, e.g. "L'Écriture Et La Différence" becomes "L'�Ecriture Et La Diff�ere".

Note, however, that scraping a single item works correctly both from the item page and from the multiple item interface.

Change History (2)

comment:1 Changed 10 years ago by stakats

For single records in InnoPAC, we parse the text MARC record. For multiple records, we are pulling MARC binaries. Looks like we're experiencing some kind of encoding issue with the binaries.

comment:2 Changed 10 years ago by simon

  • Resolution set to fixed
  • Status changed from new to closed

(In [888]) closes #396, accents are lost when scraping multiple items (with InnoPAC)

Note: See TracTickets for help on using tickets.