Pre-SIP: A Syntax for Collection Literals

Ichoran · February 28, 2025, 2:10am

I already did that, kind of! A library solution is mostly adequate.

The strategy I took was to define a slice as an opaque type in a Long–yes, this means that you can’t stride, but those cases are comparatively rare–and have one type being “collection relative” where if you want to be relative to the end you use End. Then I decorated Array with a ton of methods that take these.

It’s quite nice! I rarely miss Python slices any longer. Because I have to evade existing names, it’s not quite perfect.

But all over my code now I have things like

val ys = xs.select(1 to End-1)
ys.edit(3 to 10): (y, i) =>
  if y*i > 10 then -y else 2*y
ys(End-5 to End-2) = 0
ys(_ < -2) = -2

Anyway, my conclusion from this is that you don’t really need the language to do anything for you. Regular syntax is enough unless you actually want to just copy the exact Python syntax so you don’t need to relearn it. Otherwise, I find 3 to End a lot clearer than 3:.

(If one wanted to intercept the usual 1 to 5 by 2 syntax of ranges via a macro, one could also do that. I find it still a bit too much work to make everything a macro, so I did without.)

Runnable version: Scastie - An interactive playground for Scala.

dwalend · March 1, 2025, 7:13pm

Having in-line XML in Scala made my keystone Scala project possible. Having in-line JSON would be amazing. I’ve worked many projects that needed short snippets of these standard external formats for communicating with other systsems. Maybe if this feature were packaged up as part of full support for JSON in Scala it would be more attractive. Just checking json"{...}" format at compile time would be valuable.

However, something that “kinda acts like JSON” but isn’t “paste in JSON” doesn’t provide that value, and has none of the appeal. A non-standard, internal format doesn’t fit this common use case as well as json"".

In contrast Seq(a,b,Seq(c,d)) structures have worked really well when I’m teaching. The better students hit command-B on Seq and discover apply() methods. ("What the heck is unapply()? " - it’s been that good.) Square brackets offer far less for them to find, especially if some unseen implicit holds everything together. Adding a 1991 Python feature I think takes teaching the wrong direction.

The other big use case discussed is build systems. Bleep uses a .yaml file for input - and simple Scala trait implementations for its plugins. The kids have no problem switching between the two. Bleep is a joy to use for projects that fit in its nascent ecosystem. Maybe an external .yaml file for lists of common things is a better general path forward for the build system use case.

rjolly · March 7, 2025, 10:14am

It depends on what you mean by checking. Just checking syntax is not enough. I don’t know JSON very well but in XML you have schemas that you can validate against. In Scala this would be done by type checking against a type ascription:

val x: MyCaseClassOrCollectionThereof = json"{...}"

I’m not sure how this could be done concretely, without substantial macro machinery, and even then, do we have expected type information in macros? Again, I don’t know enough of macros to tell.

RichType · March 13, 2025, 2:14pm

Well they came across to me as a voice of reason. I believe we have evolved as a species to solve problems through diverse personality types. So even if you are disposed to see the negative over the positive, its always good to have at least one person with that disposition in a decision making group.

So I would say please keep up the snarkiness, even though I don’t doubt that down the road, it will be one of my bright ideas, that you’re throwing cold water on.

As someone outside the main contributors circle, it seems like Scala has gone from one extreme to the other. For years it seemed virtually impossible to get anything changed. Now everything has to change and by last week to. In the past it seemed that everything was sacrificed on the alter of “simplicity” and orthogonality, no special case could ever be considered. Now it seems like every special case under the sun must be catered to.

Its good that we realised that people don’t want simple languages. I never believed that the attacks on Scala’s complicatedness were made in good faith. But its as if we’ve done an about turn and are now deliberately trying to make the language more complicated.

Code and Data are different. Code needs a higher level of verbosity than Data, because Code can be anything, whereas Data can be succinct because the user of the data, knows what it means, knows the types of the Data that are being parsed.

dwalend · March 13, 2025, 2:38pm

(This is all a bit of a tangent from Collection Literals at this point.)

We demonstrated “just checking the format” for XML was very valuable for projects with lots of little, boring snippets of XML. Those are more rare in 2025, out-competed in the ecosystem by smaller, still boring snippets of json.

We have a healthy ecosystem of libraries for processing json. I agree that handling json really should be in the province of those libraries, even at compile time.

I think tasks to support that work would add more value than the proposal here might add.

diesalbla · April 4, 2025, 6:13pm

I am mildly curious if there is any more convincing or unconvincing around the previous suggestion to develop special-syntax string templates and interpolation as a way to “embed” literals syntax. To recapitulate some points in its favour:

it follows precedent of Json literals in circe or SQL literals in doobie (to name but a few),
it ring-fences (in a triplequote-ring) new syntax literals from the rest of the language; so the new syntax can be as feature-rich as desired without fear of harm for the old.
it is library based, even standard library based, which makes it easier to iterate, experiment, refine, or deprecate features or designs. Indeed, someone could write and publish one such template for each proposal and release them “next week”.

Edit: In short: why just use the same syntax that simpler languages do, rather than use a feature (typed string interpolation) that puts Scala ahead of Haskell or Java?

rjolly · April 6, 2025, 7:19am

I’m still not sure if string interpolation can undergo target typing. @odersky?

ansvonwa · April 6, 2025, 9:44am

It can.
In the end, it’s just a “normal” method

implicit class Foo(sc: StringContext):
  def n[T:Numeric](args: Any*) = summon[Numeric[T]].zero

val i: Int = n""
val bi: BigInt = n""

rjolly · April 8, 2025, 1:54pm

Interesting. So a dummy implementation of the present topic could be:

trait ExpressibleAsCollectionLiteral[+Coll]:
  type Elem
  inline def fromLiteral(inline xs: Elem*): Coll

given [T] => ExpressibleAsCollectionLiteral[Seq[T]]:
  type Elem = T
  inline def fromLiteral(inline xs: T*) = xs

extension (sc: StringContext)
  inline def coll[Coll : ExpressibleAsCollectionLiteral as e](inline args: e.Elem*) = e.fromLiteral(args*)

coll"[${1}, ${2}, ${3}]": Seq[Int] // ArraySeq(1, 2, 3)

rjolly · April 15, 2025, 11:34am

But, it does not work for case class literals. Suppose we have:

case class Person(name: String, age: Int)

To assign it, we would have to use something like [name=$name, age=$age] but, as per the string interpolation mechanism, which is based on varags, the type of the parameters would have to be Any - that is, the LUB - thus loosing type information.

So Martin’s data values proposal is still relevant.

philwalk · April 23, 2025, 3:25pm

I agree, the point isn’t to make conversion easier, but to reduce confusion and frustration experienced by experienced developers coming from other languages. Converting working python or other code to scala exposes various confusing and non-orthogonal syntax seen by newcomers.

ansvonwa · May 1, 2025, 2:13am

Not per se: you can define your interpolator however you like.
Want name and age? works!

implicit class Foo(sc: StringContext):
  def person(name: String, age: Int) = Person(name, age) // omitting checks for the string parts in sc for brevity

person"${"Martin"}${42}"

rjolly · May 2, 2025, 2:24pm

Interesting, but we’d need to be able to abstract over any case class.