[DISCUSS] Prototype a Base64Encoder by kinkie · Pull Request #2452 · squid-cache/squid

kinkie · 2026-06-27T08:08:45Z

Stemming from the converstaions in PR #2447 which focuses on
addressing a problem as narrowly as possible , here
is what I would envision a base64 encoder to look like.

It's a prototype, vibe-coded with lots of hand-holding on my side,
so please don't focus too much on the code itself, it comes with
a comprehensive unit test which is meant to showcase the API.

Does anyone see anything obviously wrong with the approach or is
it worth investing into by reviewing, polishing, writing a decoder and using it?

yadij · 2026-06-27T13:13:40Z

The only issue I see is that Squid base64 encoding is only used on buffers. So we would end up adding a stream just for the purpose of using the encoder.

rousskov

I had already flagged this API direction as problematic before this PR was posted. Will find time to detail that claim.

rousskov

IMO, the serious problems identified in this incomplete review are enough to warrant switching to a different approach to base64-encoding. There are other, mostly secondary problems in this PR that this review does not flag. Most will probably disappear on the adjusted path.

I can suggest starting with an API like this:

namespace AnyP {

/// A base64-encoded version of the given input.
SBuf Base64Encode(const SBuf &);

} // namespace AnyP

rousskov · 2026-06-29T18:43:17Z

+ * This class inherits from std::ostream to provide a familiar streaming
+ * interface, similar to SBufStream.
+ */
+class Base64Encoder : public std::ostream


The proposed inheritance is an API mistake for several reasons, including:

std::ostream is meant for writing bytes to various destinations (as defined by derived classes). A base64 encoder should not know or care about the destination of the encoded bytes. I should be able to use the same encoder for writing into a file, a socket, or an SBuf. Changing high-level information into written bytes is the domain of formatting I/O functions (e.g., STL-provided << operators) and I/O manipulators, not std::ostream kids.

Base64 encoding requires a special action to finalize the result. std::ostream does not have a caller-convenient, safe, and efficient way for triggering that action. Neither class destructor nor buf() are it, for various reasons.

Formatted and locale-specific output (e.g., various STL-provided << operators and manipulators) is not fully compatible with many (all?) known base64 use cases in Squid code, where callers must base64-encode values while following strict syntax rules. An extra space character or the wrong number representation is likely to violate the protocol requirements but is not going to be caught by the compiler. In this context, we probably want to highlight value conversions rather than hide them behind various convenience APIs.

rousskov · 2026-06-29T18:45:15Z

+Base64Encoder::Base64StreamBuf::sync()
+{
+    encoder_.encodePending();
+    encoder_.finalize();


This sync() implementation essentially violates STL streambuf::sync() API. STL code that we do not control should be able to call this method multiple times, as needed, without being worried that the very first call effectively places (or should place) the stream into a "no more updates" state, a state that Base64Encoder enters when its finalize() method is called.

rousskov · 2026-06-29T19:04:59Z

+
+    /// Create a Base64Encoder with optional maximum encoded output size limit
+    /// \param maxEncodedSize maximum encoded output size (default: noLimit)
+    explicit Base64Encoder(size_t maxEncodedSize = noLimit);


Burdening this SBuf-based class with explicitly maintaining output size restrictions is probably wrong. The underlying code should not care. Modern users should not care either. Legacy users, if any, would be better off with checking when converting from SBuf into their own storage.

kinkie added 2 commits June 27, 2026 09:50

Prototype a Base64Encoder

aaebdbd

fix memcpy

89cb57e

kinkie force-pushed the base64encoder branch from 4877512 to 89cb57e Compare June 27, 2026 08:56

yadij added the M-ignored-by-merge-bots https://github.com/measurement-factory/anubis/blob/master/README.md#pull-request-labels label Jun 27, 2026

rousskov requested changes Jun 28, 2026

View reviewed changes

rousskov requested changes Jun 29, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[DISCUSS] Prototype a Base64Encoder#2452

[DISCUSS] Prototype a Base64Encoder#2452
kinkie wants to merge 2 commits into
squid-cache:masterfrom
kinkie:base64encoder

kinkie commented Jun 27, 2026

Uh oh!

yadij commented Jun 27, 2026

Uh oh!

rousskov left a comment

Uh oh!

rousskov left a comment

Uh oh!

rousskov Jun 29, 2026 •

edited

Loading

Uh oh!

rousskov Jun 29, 2026

Uh oh!

rousskov Jun 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

kinkie commented Jun 27, 2026

Uh oh!

yadij commented Jun 27, 2026

Uh oh!

rousskov left a comment

Choose a reason for hiding this comment

Uh oh!

rousskov left a comment

Choose a reason for hiding this comment

Uh oh!

rousskov Jun 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rousskov Jun 29, 2026

Choose a reason for hiding this comment

Uh oh!

rousskov Jun 29, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rousskov Jun 29, 2026 •

edited

Loading