Suggestions wanted for archival of data

Hi all,
As we’ve seen with the core Oppia platform that we’ve used for some implementations, the size of the database can grow significantly over time, almost entirely due to the activity tracker log table.

Although this isn’t necessarily a problem on a technical level, I wonder if user tracker data doing back 3/4/5 years or more really needs to be kept, or should it be archived in some way?

A couple of thoughts I had about this:

  1. archive tracker data that’s older than X years
  2. archive tracker data for users who haven’t accessed the content in X years

To me, the second approach seems most appropriate, but we need to think about what ‘X’ should be. Some of this also depends on what type of analysis might need to be done in the future on this historical data.

Any suggestions/feedback welcome…
Cheers,
Alex