TODO

   1 When priorities are embedded without a name, the name of the current machine is
   2 used. Perhaps a unique name for each instance of the current machine should be
   3 used instead. This idea could work well if applied to user-defined embeddings.
   4
   5 Verbose embeddings %eof a  to   $eof(a).
   6 User defined embeddings <-name(a1,a2,...).
   7 User defined operators expr1 <name> expr2.
   8
   9 Doesn't make make sense for [] to be the lambda (zero-length) machine. This
  10 should be the empty set (the empty machine). But then would it be invalid in a
  11 regular expression?
  12
  13 Add an option (probably -L) for turning off line numbers for debugging the
  14 generated state machine.
  15
  16 Expressions of the form: expr1 <: expr2 . expr3
  17 sometimes don't behave as expected. If expr2 contains the empty string then
  18 it's possible for the machine to escape to expr3. This is somewhat unexpected.
  19 The empty string does count as moving through the right machine so there should
  20 not be an exception for this case.
  21 Using the stronger implementation: ( expr1 $1 %0 . expr2 ) will solve the problem.
  22
  23 Also, there is potential for nondeterminism to persist via the empty string
  24 with the :> and :>> operators. Should also guard against this using leaving
  25 priorities.
  26
  27 The |> guarded operator and the <| guarded operator need to be added.
  28
  29 The fixed size stack is a problem for manual recursion of unlimited depth.
  30
  31 An option to turn off the removal of duplicate actions might be useful for
  32 analyzing unintentional nondeterminism.
  33
  34 Might be a good idea to add in some protection against using up all of a
  35 system's memory. This protection could then be removed by people when someone
  36 is sure they want to use a lot of memory.
  37
  38 fbreak should advance the current char. Depreciate fbreak and add
  39     fctl_break;
  40     fctl_return <expr>;
  41     fctl_goto <label>;
  42
  43 Should the fbreak functions execute the to-state actions of the target state?
  44
  45 It should be possible to import/export definitions.
  46
  47 If a scanner can be optimized into a pure state machine, maybe permit it to be
  48 referenced as a machine definition. Alternately: inline scanners with an
  49 explicit exit pattern.
  50
  51 The split codegen needs a profiler connected to a graph partitioning algorithm.
  52
  53 Die a graceful death when rlcodegen -F receives large alphabets.
  54
  55 It's not currently possible to have more than one machine in a single function
  56 because of label conflicts. Labels should have a unique prefix.
  57
  58 Emit a warning when a subtraction has no effect.
  59
  60 Emit a warning when unnamed priorities are used in longest match machines.
  61 These priorities may unexpectedly interact across longest-match items. Changing
  62 the language such that unwated interaction cannot happen would require naming
  63 longest-match items.
  64
  65 Testing facilities: Quick easy way to query which strings are accepted.
  66 Enumerate all accepted strings. From Nicholas Maxwell Lester.
  67
  68 Add more examples, add more tests and write more documentation.
  69
  70 A debugger would be nice. Ragel could emit a special debug version that
  71 prompted for debug commands that allowed the user to step through the machine
  72 and get details about where they are in their RL.
  73
  74 A quick and easy alternative would be a trace code generation option. This
  75 would open a trace file and list all the active machines at each step of the
  76 input.
  77
  78 Frontend should allow the redefinition of fsm section delimiters.
  79
  80 Do more to obscure ragel's private variables. Just a leading underscore is not
  81 enough. Maybe something more like __ri__.
  82
  83 Some talk about capturing data:
  84
  85 Separate tokstart/tokend from the backtracking.  One var for preservation,
  86 called preserve.  Write delcarations;  produces the necessary variables used by
  87 ragel.  Move pattern start pattern end concepts into the general?  The
  88 variables which may need to influence the preserve is dependent on the state.
  89 States have a concept of which variables are in use.  Can be used for length
  90 restrictions.  If there is an exit pattern, it is the explicit way out,
  91 otherwise the start state and all final states are a way out.