Ongoing Indexing Delays

ConversazioniTalk about LibraryThing

Iscriviti a LibraryThing per pubblicare un messaggio.

Ongoing Indexing Delays

1timspalding
Modificato: Set 10, 3:58 pm

Things are much better. See Message 29.

Over the weekend LibraryThing was up and down, and experienced other issues. We are now solidly up, but continue to have problems with system we use for indexing changes to members' catalogs.

As things stand, additions, deletions and edits work. You'll see the books in Add Books and your catalog. Features like clicking on tags and so forth work. But the indexing system for our catalog-search feature is delayed. We are reindexing members' catalogs regularly, but the reindexing is user-by-user, so expect delays.

Our plan is restart regular indexing Tuesday morning (September 10).

Thank you for your patience.

2lilithcat
Set 9, 3:40 pm

And thank you for keeping us informed!

3norabelle414
Set 9, 3:41 pm

Seconded! I appreciate the update.

4emsoahsd
Set 9, 3:42 pm

Thanks for update. 😉

5CENlibrary
Set 9, 4:01 pm

Thaks for the information

6Bookmarque
Set 9, 4:02 pm

Yes, thanks for the update. It was a bit wonky at times, not the usual quick response so I knew something was up. Luckily I haven't hit any used book sales lately so have no piles of things to enter. Hm...that's a bummer in a way.

7pgmcc
Set 9, 4:03 pm

>1 timspalding:
Thank you for the update.

8Mauritiana
Set 9, 4:40 pm

>1 timspalding: I had noticed many months ago that some of my French books were not indexed, and could not be retrieved through searching, and nobody at LT could explain why. I hope this re-indexing will fix things.

9RWVLibraries
Set 9, 4:43 pm

OMG. thank you so much - I was really overwrought...

I will take a nap. and try my serious load-ups tomorrow.

Pat M

10Garcia-Gonzalez
Set 9, 5:05 pm

Thanks for the update Tim, perhaps this is a good time for you to correct the number of reviews on my book? (It should be 5 reviews.)

Nelson

11quintanar
Set 9, 5:24 pm

>1 timspalding: Gracias Por la informacion, y su magnifico servicio

122wonderY
Set 9, 7:44 pm

13NicheNicoal
Set 9, 8:35 pm

I'm so glad to hear this was a noted issue, I thought something was wrong with how I was typing/seeing lmao

14SRB5729
Set 9, 9:25 pm

Just sending some appreciation for the folks who make all this happen.

15CharlesTheB
Set 9, 9:35 pm

Thanks for the update. I thought I was doing something wrong.

16karenb
Set 9, 10:04 pm

Thanks so much for announcing what's happening -- and making sure that everyone sees it.

17rmharris
Set 9, 10:35 pm

Thanks for keeping us informed. I guess this problem is also affecting the App as well - I've been unable to add books that way all day

18TsarinaTyna
Set 9, 10:43 pm

>1 timspalding: thanks for all you do so well!

19jamesjarrett00
Set 9, 11:29 pm

Thanks for the heads up!

20AndreasJ
Set 9, 11:58 pm

I had some trouble accessing the site on Sunday, but assuming it was simply because I was in a place with poor 4G coverage (some other sites were also hard to reach, but LT seemed to be worst affected).

21mmodine
Set 10, 1:26 am

Noticed this as well. Thanks for your hard work to fix it!

22MissWatson
Set 10, 4:48 am

Thanks for letting us know what's going on.

23arborschool
Set 10, 10:37 am

Whew! I thought it was me. Thanks for the update!

24Twinmountain
Set 10, 11:01 am

>1 timspalding: Thanks for the update. I was looking at the website and there's no apparent support contact or status updates.

25nazcasaitaliana
Set 10, 11:44 am

Thank you!

26JohnWCuluris
Set 10, 11:45 am

It still won't let me sign in on the phone app. i never signed out on the PC so I don't if there is a problem there too.

27chuck_ralston
Set 10, 11:55 am

>1 timspalding: Tim, thanks for the LT situation report. I usually start my day on the computer opening Library Thing (along with coffee). Today I started the day with the VA's "My Healthy Vet" -- NOT a user-friendly experience. Thank you for staying on top of LT.

28jasbro
Set 10, 2:35 pm

>17 rmharris: The app still isn't working for me. Are you able to use it? If so, what sort of device and operating system do you use?

29timspalding
Set 10, 3:26 pm

Okay, here's an update:

1. Site should be up and operating since the morning.
2. App should be too
3. New adds, edits and deletes have been going to the index since this morning.
4. Adds, edits and deletes made during the downtime are still processing. It's going member by member, so you either have the updates or you don't. It should finish in the next 12h. We'll post an update when it happens.

>28 jasbro:

Can you describe how it's not working?

>26 JohnWCuluris:

Still true?

30ocrhdlg
Set 10, 3:33 pm

>29 timspalding: Just out of interest, when you have time, can you tell us in general terms what caused the problem?

31conceptDawg
Set 10, 3:56 pm

>30 ocrhdlg: Our queuing system (that normally handles changes to catalog data and the search indexes immediately) crashed and caused the queues to be corrupted. We had to rebuild the queues and then requeue the items that didn't get processed while the queues were down over the weekend.

32timspalding
Modificato: Set 10, 4:00 pm

>30 ocrhdlg: Yeah. It was a serious of interlocked problems. This is my summary. I'm not the sysadmin, so I may have gotten some details wrong.

1. We've had some problems with our queueing system (RabbitMQ). These servers went down over the weekend. They weren't perfect before that, but they were under repair and expected to last the weekend. We believe that bot traffic sent them over the edge (see #2). We fixed this by turning off index updates until developers could fix the system on Monday. The system was fixed yesterday, so new updates started going through. Unfortunately, because the queue was down, we had no record of all the edits we missed. So we've had to reindex everyone who did basically anything on the site during the downtime. That's many thousands of users, doing ALL their books. That's taking a while to eat down.

2. We've been having more and more problems with bad, bot traffic. We've always had a lot of bot traffic. Some was well-behaved (Google), some so-so (sometimes even large companies would ignore robots.txt, our robots directive, or scrape us too fast) and some was bad. But, apart from the DDoS attack we had a year ago, the bad traffic was mostly manageable.

Since LLMs took off, internet traffic has seen huge jumps, as everyone and their brother wants to scrape the whole web to build their AI system. (Some coverage here.) We've seen huge increases, both from established players and mysterious organizations. Even the good players are remarkably cavalier about obeying robots.txt. (There's a good news article about this). The bad ones are… bad.

Anyway, over the weekend we started to get tons of new traffic. Much was from Singapore IPs. (Singapore has been a common culprit for us. I think there are some big data centers there.) It caused us to go down a number of times. It was the worst kind of traffic--incessant and using many, many IPs to avoid being blocked! We solved the traffic problem with some new "rules" on Cloudflare and our servers.

3. Unfortunately, the rules we put in place were faulty--they always worked at first, and then went south when an error happened. So we've be up and happy, the developer or sysadmin would leave, and then the problem would pop up again. The mechanics of it looked like the problem was bad bot traffic, but actually it was the SOLUTION to the bad bot traffic. The problem was a timeout set backwards--a high number not a low. This morning ccatalfo figured out the setting was backwards, and reversed it. Everything started working again perfectly.

Anyway, that's what I got for you, as a non-sysadmin.

33conceptDawg
Set 10, 4:17 pm

To give a little context of scale here: in the last 24 hours we've had over 10 million requests that have been managed by some of our bot rules. These aren't our total served, these are just the ones that we've managed by some rules that we have set up. Even more are getting through the cracks, some are being managed by CloudFlare before they get to our rules, and of course we also have legitimate traffic.

The numbers have been steadily increasing since the AI bots have come on the scene in the last year.

34reconditereader
Set 10, 4:47 pm

I really feel for all the hard work your team consistently puts in to manage this situation! Thanks for doing it.

35ocrhdlg
Set 11, 3:47 am

>34 reconditereader: Yes, I agree. Many thanks to Tim and his colleagues for their trouble-shooting and these helpful explanations.

36MissWatson
Set 11, 5:20 am

Thank you for explaining and for all your hard work. I had no idea what is going on behind the scenes.

37RWVLibraries
Set 11, 7:56 am

That was in "interesting" tale of the "interstices of the interweb".

We take a lot for granted.

Meanwhile I have 3 books sitting in a queue waiting to be uploaded because I won't begin the REAL uploading until those three are in the list.

What is the status of the system? is it still indexing?

I made the error of starting this add collection process over the weekend and it was painful and of course crazy-making.

Thanks

38RWVLibraries
Set 11, 11:17 am

In addition, the books that were added by hand (all 800) ... are not randomly appearing on the main screen but showing up alphabetically. As many of them are by the same author (hey this is a place with old people!) it does not reflect the wonderful variety we do have.

Is that is feature as of this weekend's event???

39jasbro
Set 11, 11:40 am

>29 timspalding: Can you describe how it's not working? No, not really at the moment. And having added 13 books to our library over the weekend (not trying the app), it may be a while before I have a quantity to try it again. (Hope not, but never know!) As I recall the last time I tried the app (still on a dinosaur iPhone 8 Plus, but at least it's iOS 16.7.10), it let me in, went all the way to the barcode scanner, "beeped" in my first item - and then it just froze. Going back a ways (maybe in August? July? before then?), earlier this week I found two twice-entered records from an earlier attempt with the app; as I recall, it was freezing up then too, and not realizing those works had actually made it to our library, I tried, tried again. If any other specifics would help, please let me know; thanks.

>That sure puts li'l ol' me with my obsessions & predilections in stark perspective!

40AnnieMod
Set 11, 12:31 pm

>39 jasbro: The iPhone 8's freezing after the beep after a scan is kinda known issue - we had been stuck with it for years now. :)

41conceptDawg
Modificato: Set 11, 1:25 pm

The indexing is still going. Unfortunately a couple of the indexers died overnight and we had to restart them this morning. But progress continues. We will update as soon as it finishes but you might find your catalog done well before that. Searching for your most recently added book (or, more specifically, books added Sunday-Monday) and finding it in the results is a good indicator that the indexer has completed your catalog.

42conceptDawg
Set 11, 1:26 pm

The iPhone 8 has a hardware issue (that ONLY presents on iPhone 8, no other variation of the iPhone). We've been trying to work around that issue with various permutations but haven't been able to solve it.

43bnsmith
Set 11, 2:04 pm

>42 conceptDawg: oh interesting. I have noticed an issue with iphone 8 and camera view for ISBN bar codes. It worked really well at first, now just drags (for some time)

44paradoxosalpha
Modificato: Set 11, 11:21 pm

>41 conceptDawg: Unfortunately a couple of the indexers died overnight ...

Since an "indexer" has in the past been a type of human worker in publishing, this phrase was highly alarming at first glance. I presume, though, that these were machines that needed to be "restarted."

45garypederson
Set 11, 5:11 pm

>1 timspalding: i just joined and added one book to my library, “dead men do tell tales”. I want to read it but can’t find a button to do so. Help. Gary pederson

46AnnieMod
Set 11, 6:18 pm

>45 garypederson: You cannot. LT is a cataloging site, not a library site. You cannot read books here.

LT contains links to places where you can search for the book and eventually find it (for free or more often to buy it) but the site itself does not contain any book. For these links, look at the Quick links menu on the work page.

47jasbro
Modificato: Set 11, 10:32 pm

>42 conceptDawg: Does that include the 8 Plus? (I’m guessing “yes,” but it seems worth asking.)

>44 paradoxosalpha: My thought exactly!

48GraceCollection
Modificato: Set 11, 11:13 pm

Thank you for all the work you do!

49GuildfordInstitute
Set 12, 4:36 am

>41 conceptDawg: Could you say whether or not MARC imports are affected by this problem? I've been trying over the last few days to import several hundred works (mostly without ISBNs) but nothing has been processed AFAIK. Also, is there any way to view a list of background jobs and their status, so we could check?

50RWVLibraries
Set 13, 11:12 am

I cannot upload only 4 books w universal import.

51conceptDawg
Set 15, 12:43 am

>49 GuildfordInstitute: We've had some recurring problems with our queuing system this week. I'll let "the other Chris" comment on this when he comes in on Monday. He's our resident import-queue guru—and also happens to be our MARC guru also, so bonus.

52ryasef
Set 16, 1:12 am

Queueing system might still be funky. I got about 100 of about 400 imported and then it stalled.

53ccatalfo
Set 16, 8:54 am

>49 GuildfordInstitute: the imports are really a separate system and so wouldn't have been affected by these queue issues. That said, the import system has its own set of ongoing "infelicities" (for example no way to see the status of current or past ones) which we need to address.

>52 ryasef: Sorry about this: yes, i can see on our side that only about a quarter have finished importing. File looks fine so need to figure out what's holding them up.

54ryasef
Set 16, 1:24 pm

>53 ccatalfo: If it's helpful, it looks like all the Overcat ones worked but anything that wasn't found in Overcat (which was Option 1) is now stalled.

55ryasef
Set 17, 11:20 am

>53 ccatalfo: Can confirm the stall continues; I tried manually uploading 4 of them again and I think they are also stalled? For what it's worth, I did change my username between the big upload a day or two ago and now.

56ccatalfo
Set 19, 7:24 am

>55 ryasef: Thanks for the updates - we've restarted some indexing processes as of yesterday afternoon and that has kicked things into a higher, more normal, gear.

Could you try importing once again - it looks as if you aborted the last few (out of frustration, I bet)?