Ublog recommender to increase old blog visibility #17315

michael1241 · 2025-04-08T12:11:43Z

Works in combination with https://github.com/michael1241/lila-blogrecommender

Process:

blogrecommender ingests ublog_post collection from mongodb
blogrecommender builds neo4j graph database of blog posts and users who liked the posts
blogrecommender generates projection / calculation of blog similarity based on how many overlapping users liked posts
lila queries blogrecommender via http with a blog ID and gets up to 20 similar blogs by ID returned
lila displays recommended blogs at bottom of blog post in carousel
blogrecommender watches mongo replica changestream for insertions or edits to ublog_post collection and adds or updates the graph database accordingly (removal of likes is not updated)
blogrecommender updates graph projection hourly (for accuracy) and full database refresh weekly (to account for blog deletions/unliking/GDPR) (timeframe can be edited with whatever you think is best, full ingestion takes about 10 minutes, reprojection takes about 1 minute)

Still to do: set up on lichess server.

blog-recommender sample code for michael

…recommender

michael1241 · 2025-04-08T12:14:33Z

I'm not sure if the changestream will hit the secondary mongo server too hard if it's looking at every single view (since view count is updated)

schlawg · 2025-04-08T16:01:13Z

i know that neo4j has a quick turnaround for these requests even on prod data, but we'll need another cache in lila for this.

ornicar · 2025-04-09T09:28:58Z

This adds 2 new technologies to the prod stack: python and neo4j.

It also adds an HTTP server for lila to hit, and a connection to the prod mongodb replicaset. And it will require baby-sitting when it gets out of sync with the db, which will happen over the years for a variety of reasons.

These are serious infra commitments to carefully consider.

Also I see it lists "mongodb (8.0.4)" as a dependency, which we don't have in prod.

michael1241 · 2025-04-09T10:48:18Z

This adds 2 new technologies to the prod stack: python and neo4j.

True. I tried doing the same type of query directly in mongo and it was very slow / heavy. A graph database seems the correct technology for this service (unless we wanted to go for some ML approach which is even more complexity). Python is already present in Kaladin and Irwin so I didn't think that in itself was an issue. Neo4j could definitely be a concern, although the way I was thinking about it, Lila could fallback to showing other blogs of the user as it does currently, in case of issues with this new service. Ultimately it is or should be a non critical service.

It also adds an HTTP server for lila to hit, and a connection to the prod mongodb replicaset. And it will require baby-sitting when it gets out of sync with the db, which will happen over the years for a variety of reasons.

These are serious infra commitments to carefully consider.

The weekly refresh will avoid getting out of sync. But yes the additional network and database load is my main concern / question.

Also I see it lists "mongodb (8.0.4)" as a dependency, which we don't have in prod.

I think it should also work with current mongodb I just listed the version I used.

Ultimately if we don't go ahead with integrating this, I understand. I think it's worth trying due to being non critical - provided it doesn't disrupt / overload http or mongo. There are definitely efficiencies to be made there as well if necessary - more filtered changestream view, and batch responses for http.

```js db.ublog_post.updateMany({similar:{$exists:false}},{$set:{similar:[]}}) ```

ornicar · 2025-04-09T15:27:10Z

the UI is broken, there should be a card grid there

ornicar · 2025-04-09T15:29:47Z

we're not counting vertical pixels on that page anyway. Also the file was only loaded for logged in users. The UI was broken for anons

ornicar · 2025-04-09T17:54:26Z

ui/lib/css/layout/_page-menu.scss

    height: 100%;
-
-    // overflow: hidden; /* fixes crazy text overflow on Fx */
+    min-width: 0;


schlawg and others added 6 commits April 5, 2025 07:19

blog-recommender

12cb12f

Merge pull request #17303 from schlawg/ublog-recommender

ea30bd2

blog-recommender sample code for michael

Merge remote-tracking branch 'upstream/master' into ublog-recommender

800be1d

blog recommendation carousel

a2e5f0b

Merge remote-tracking branch 'upstream/ublog-recommender' into ublog-…

9f46750

…recommender

fix lint

ee76c8d

michael1241 requested a review from ornicar April 8, 2025 12:12

Merge branch 'master' into ublog-recommender

6f559f4

ornicar added 9 commits April 9, 2025 15:52

full and incremental computations of similar ublog posts

988e488

document mongodb scripts

eac442e

Merge branch 'master' into ublog-recommender

c08b32a

pnpm format

6fef73a

rename scripts

74f6223

remove references to ublog recommender external service - MIGRATION

f91e95b

```js db.ublog_post.updateMany({similar:{$exists:false}},{$set:{similar:[]}}) ```

compute 6 similar posts

eba6a4b

no need to fetch 4 posts here, also prevent recommendation duplicates

6bd10a6

move code out of app/ controller

4ae40ae

ornicar marked this pull request as ready for review April 9, 2025 15:26

fix other posts card grid UI

596068d

ornicar added 4 commits April 9, 2025 19:47

remove superfluous imports and dependencies

f47edd1

remove unused translation key

182ac3a

remove unused css dep

ef211e0

remove broken ublog page carousel

76ce16a

we're not counting vertical pixels on that page anyway. Also the file was only loaded for logged in users. The UI was broken for anons

ornicar approved these changes Apr 9, 2025

View reviewed changes

ui/lib/css/layout/_page-menu.scss

height: 100%;

// overflow: hidden; /* fixes crazy text overflow on Fx */

min-width: 0;

Copy link

Collaborator

ornicar Apr 9, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🤔

better similar blog previews

828fb0d

ornicar merged commit fa11ba8 into master Apr 9, 2025
8 checks passed

schlawg deleted the ublog-recommender branch April 22, 2025 16:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Ublog recommender to increase old blog visibility #17315

Ublog recommender to increase old blog visibility #17315

Uh oh!

michael1241 commented Apr 8, 2025

Uh oh!

michael1241 commented Apr 8, 2025

Uh oh!

schlawg commented Apr 8, 2025

Uh oh!

ornicar commented Apr 9, 2025 •

edited

Loading

Uh oh!

michael1241 commented Apr 9, 2025

Uh oh!

ornicar commented Apr 9, 2025

Uh oh!

ornicar commented Apr 9, 2025

Uh oh!

ornicar Apr 9, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Ublog recommender to increase old blog visibility #17315

Ublog recommender to increase old blog visibility #17315

Uh oh!

Conversation

michael1241 commented Apr 8, 2025

Uh oh!

michael1241 commented Apr 8, 2025

Uh oh!

schlawg commented Apr 8, 2025

Uh oh!

ornicar commented Apr 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

michael1241 commented Apr 9, 2025

Uh oh!

ornicar commented Apr 9, 2025

Uh oh!

ornicar commented Apr 9, 2025

Uh oh!

ornicar Apr 9, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ornicar commented Apr 9, 2025 •

edited

Loading