Skip to content

Conversation

@LVerneyEC
Copy link
Contributor

Hi,

This adds the possibility to archive raw content serve over HTTP (plaintext or markdown). This would typically cover archiving of content served through https://raw.githubusercontent.com.

Best,

@MattiSG
Copy link
Member

MattiSG commented May 19, 2025

Hi @LVerneyEC!

Thank you for this suggestion 🙂
Can you please provide a few examples of contractual documents that currently cannot be tracked and that would become trackable with this changeset?

Thank you!

@LVerneyEC
Copy link
Contributor Author

Not directly a contractual document, but an example would be https://github.com/xai-org/grok-prompts/blob/main/ask_grok_summarizer.j2. Would be way easier to track through the raw github endpoint :)

@MattiSG
Copy link
Member

MattiSG commented Jun 1, 2025

Thanks for clarifying the intention @LVerneyEC and for this suggestion!

Considering that adding a feature to the engine means guaranteeing its maintenance, we want to make sure that every additional feature aligns with Open Terms Archive’s Design Principles, that there are clear use cases associated with each of them, and that software quality is ensured 🙂

In the current case, we do appreciate the upcoming relevance of prompts, and would add software licenses as potential cases as well. We will request the following elements before proceeding with merging:

  • At least five examples of contractual documents that are provided to end users in the newly supported format. This could be obtained by a simple online search (ensuring that the TXT files are indeed the ones that are shown to the end users, in accordance with principle 3) for existing terms types.
    • If no such cases currently exist, seeing the provided example, new terms types such as “Source Code License” or “System Prompt” could be added, that would then support listing examples.
  • Automated tests are added for each new supported format, to ensure the stability of the feature over time. You can liaise with @Ndpnt for test design.

For the provided example (grok prompts stored on GitHub), while the topic is definitely exciting, the relevance of Open Terms Archive for tracking seems a bit remote, as one could simply clone the repository for history preservation, and directly subscribe to RSS to be notified of changes to that specific file 🙂

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants