Proposal: Potentially Useful openrc Feature

krinn · Watchman Joined: 02 May 2003 Posts: 7470

i would disallow myself b and d format.
if you use ":" only "cmd:_" or "cmd:$" are valid (no space between command and ":"), it will even ease parser as you need to look at next char after a command, is it "_" or ":" or "$"

i don't know if "cmd:_args" or "cmd:args" should be both valid. I suppose it could if you still parse only one char after command, you can assume "start: --args" as cmd:<start> and args:<_--args>, i don't think any program will dislike a param starting with a space.

John R. Graham · Posted: Tue Feb 07, 2017 8:24 pm Post subject:

The shell argc / argv construction mechanism does most of the parsing for us, throwing out leading and trailing whitespace. The only question (with the "krinn approach", anyway) is whether we recognize ':' as a separate argv value or only when appended to a valid command.

The construct "cmd:arg" as laid out by Mr. krinn himself, looks like an argument, or else an undefined command, not a command with an argument.

- John
_________________
I can confirm that I have received between 0 and 499 National Security Letters.

steveL · Posted: Tue Feb 07, 2017 8:54 pm Post subject:

Yeah, you start to get into lexing vs standard option parsing.

Really the pair of you are confirming my feeling against colons (which are not that rare ime) as a special introducer.

Any halfway experienced C coder is used to handling '--', as it's required by POSIX (which doesn't mandate longopts.)

John R. Graham · Posted: Tue Feb 07, 2017 9:25 pm Post subject:

Part of my objection to it is that your proposed use of '--' doesn't always feel like a "special introducer" to me, a term which I take to mean something along the lines of, "Things after this point are different." Am I close?

Its first occurrence is used to signal that commands take arguments. Wouldn't it be more natural, more consistent with *nix conventions, if this is the right approach, to use a regular option, for instance -a and --args, to indicate that commands on this invocation take arguments? Its second occurrence is used to signal the end of the current argument list—and possible start of a new command and optional associated arguments, so it's being used as a separator here, not as a special introducer, if I understand correctly, and '--' is never used as a mere separator with any other *nix tool that I know of. Anyway, that's what makes it look odd to me.

- John
_________________
I can confirm that I have received between 0 and 499 National Security Letters.

steveL · Posted: Tue Feb 07, 2017 11:02 pm Post subject:

Looking back it appears I missed an earlier post, my bad.

John R. Graham · Posted: Tue Feb 07, 2017 11:15 pm Post subject:

krinn · Watchman Joined: 02 May 2003 Posts: 7470

John R. Graham · Posted: Wed Feb 08, 2017 1:13 am Post subject:

krinn · Watchman Joined: 02 May 2003 Posts: 7470

I have rethink about it, the trouble is keeping "multi-command" in one line and passing args to them.
While openrc accept multi-command, it would just make everything easy to accept only one command with args.
Two running mode, multi-command no args or one command with args ; you remain compatible, you can use whatever you like as separator, you only loose ability to run multi-command in one line if your command have args.
And the parsing will be a piece of cake: have -- ? take what is prior -- as command, take what is after -- as args ; if no -- it's multi-command parsing, just as of today.

John R. Graham · Posted: Wed Feb 08, 2017 1:22 pm Post subject:

If I'm interpreting the syntax guidelines properly, at least it's compliant usage, but don't give up just yet. It seems to me that

steveL · Posted: Thu Feb 09, 2017 12:04 am Post subject:

steveL · Posted: Thu Feb 09, 2017 12:31 am Post subject:

John R. Graham · Posted: Thu Feb 09, 2017 1:35 am Post subject:

steveL · Posted: Thu Feb 09, 2017 1:44 am Post subject:

steveL · Posted: Thu Feb 09, 2017 2:05 am Post subject:

steveL · Posted: Thu Feb 09, 2017 2:14 am Post subject:

John R. Graham · Posted: Thu Feb 09, 2017 2:19 am Post subject:

I think we're back in order now.

Hu · Moderator Joined: 06 Mar 2007 Posts: 21633

As a quick aside, when I suggested semicolon, I had not looked at (and still have not looked at) the internals you two are discussing patching. I picked it because, as steveL recapped, any delimiter that gets dropped into the middle of an existing language needs to be very unlikely to be otherwise valid in the language. I did not know at the time that the data is processed in a way that makes some characters more complicated than others. In light of that quirk, semicolon may not be a good choice.

mv · Watchman Joined: 20 Apr 2005 Posts: 6747

John R. Graham · Posted: Thu Feb 09, 2017 2:34 pm Post subject:

Yeah, sorry. We're discussing the relative trivialities of syntax and it's gotten long. From earlier in the thread:

John R. Graham · Posted: Mon Feb 27, 2017 12:05 am Post subject:

I have completed a proof of concept implementation of the modified syntax. As soon as Infra gets my overlay set up I'll make this available via Layman, but for the time being if you want to take a look, the work is available on a fork of openrc on GitHub. I'm working on three syntax variations:

The krinn syntax is on branch script-cli-args.
The steveL syntax is on branch double-dash.
The single command per invocation syntax, introduced by -a, is on branch dash-a.

All three implementations preserve backwards compatibility in all respects. Interestingly, the patch files for the two "delimiter" approaches against current stable openrc differ in length by only 12 bytes. #3 is a bit longer because more changes have to be made to pass the -a option from rc-service to openrc-run.

Obviously there's a major increase in complexity involved in parsing the colon!

Pulling my tongue resolutely out of my cheek, all three are minor changes with the major effort expended becoming familiar with the code base.

They're all three basically feature complete, man pages and everything, but I'm still working on a few additional enhancements:

Provide a mechanism for init scripts to advertise that they can use arguments, erroring out (or at least warning) on those that can't if arguments are present. Just so you know, arguments are completely harmless to init scripts that don't use them, but it seems the right thing to do. (Such a mechanism should require no modification to init scripts that do not take arguments.)
Enhance the built-in usage description, which is somewhat incomplete.
For completeness, implement an escaping mechanism for the unlikely event that required arguments collide with the new syntax. Not required for #3 above, which is a point in its favor.

Comments, scathing critiques, accolades, or anything in between all welcome.

- John
_________________
I can confirm that I have received between 0 and 499 National Security Letters.

John R. Graham · Posted: Wed Mar 08, 2017 6:58 pm Post subject:

In the furtherance of (a) and (b) above, I'm expanding an existing mechanism that openrc-run uses to collect information on the init script. The init script is already sourced and the description variable is echoed and then collected. The following is a generic expansion of that (very) short shell script so that it can be easily expanded to collect arbitrary information:

steveL · Posted: Tue Mar 21, 2017 11:36 pm Post subject:

mv · Watchman Joined: 20 Apr 2005 Posts: 6747

John R. Graham · Posted: Wed Mar 22, 2017 8:14 pm Post subject:

Is true not portable in some cases? It is mentioned in the POSIX Shell spec.

- John
_________________
I can confirm that I have received between 0 and 499 National Security Letters.