[Home] [Downloads] [Search] [Help/forum]

Register forum user name Search FAQ

Gammon Forum

[Folder]  Entire forum
-> [Folder]  MUSHclient
. -> [Folder]  Lua
. . -> [Subject]  Converting lua patterns to PCRE regex

Converting lua patterns to PCRE regex

It is now over 60 days since the last post. This thread is closed.     [Refresh] Refresh page

Posted by Fiendish   USA  (2,525 posts)  [Biography] bio   Global Moderator
Date Sun 21 Jun 2020 09:37 PM (UTC)
Using rex.gsub from http://mushclient.com/forum/?id=13348&reply=6#reply6 (because it's fun), I think this code converts Lua string matching patterns into their PCRE pattern equivalents.

The only thing explicitly not handled at the moment is Lua's frontier pattern.

rex.gsub = function(str, re, rep)
   local output = ""
   local as_func = (type(rep) == "function")
   local startfrom = 1
   local s, e, t = re:match(str, startfrom)
   while s ~= nil do
      local filled_rep
      if as_func then
         local substr = str:sub(s,e)
         if (#t > 0) then
            filled_rep = rep(unpack(t)) or substr
            filled_rep = rep(substr) or substr
         filled_rep = rep:gsub("%%(%d+)", 
               local i = tonumber(index)*2
               return t[i-1] or ""
      output = output..str:sub(startfrom, s-1)..filled_rep
      startfrom = e+1
      s, e, t = re:match(str, startfrom)
   return output..str:sub(startfrom)

function character_classes(m)
   -- PCRE doesn't allow nested classes, so just remove the separation between internal ones
   m = m:sub(2,-2):gsub("%[",""):gsub("]","")
   -- temporarily bury minus signs inside the class because we need to do something special with the rest in a second
   m = m:gsub("-", "\0")
   return "["..m.."]"

function balance_pattern(a, b)
   special_chars = {
      ["."]=true, ["^"]=true, ["$"]=true, ["*"]=true, ["+"]=true, ["-"]=true,
      ["?"]=true, ["("]=true, [")"]=true, ["["]=true, ["]"]=true, ["{"]=true,
      ["}"]=true, ["\\"]=true, ["|"]=true
   if special_chars[a] then
      a = "\\"..a
   if special_chars[b] then
      b = "\\"..b
   return a.."(?:[^"..a..b.."]*(?R)?)*+"..b

local class_pattern = balance_pattern("[", "]")

function lua_to_regex(match_str)
   if rex.new("([^%]|^)%f"):exec(match_str) then
      -- we can't handle the frontier pattern yet
      return nil
   meta = {
      {"(["..[[\{\}\|]].."])", [[\%1]]}, -- escape non-Lua special characters
      {"%%", "\0"}, -- temporarily bury literal %
      {"%b%?(.)%?(.)", balance_pattern}, -- Lua's balanced string operator
      {"%a", "[a-zA-Z]"},
      {"%c", "[\x00-\x1F\x7F]"},
      {"%d", "[0-9]"},
      {"%l", "[a-z]"},
      {"%p", "[!\"\\#$%&'()*+,\\-./:;<=>?@\\[\\\\\\]^_{|}~]"},
      {"%s", "[ \\t\\r\\n\\v\\f]"},
      {"%u", "[A-Z]"},
      {"%w", "[a-zA-Z0-9]"},
      {"%x", "[A-Fa-f0-9]"},
      {[[%\\]], [[\]]},
      {"%(.)", [[\%1]]},
      {"\\0", "%"}, -- resurrect literal %
      {class_pattern, character_classes}, -- PCRE doesn't allow nested classes
      {"\\-", "*?"}, -- Lua minus modifier is nongreedy *
      {"\\0", "-"} -- resurrect minus signs from inside classes
   for _, v in ipairs(meta) do
      match_str = rex.gsub(match_str, rex.new(v[1]), v[2])
   return match_str

[Go to top] top

The dates and times for posts above are shown in Universal Co-ordinated Time (UTC).

To show them in your local time you can join the forum, and then set the 'time correction' field in your profile to the number of hours difference between your location and UTC time.


It is now over 60 days since the last post. This thread is closed.     [Refresh] Refresh page

Go to topic:           Search the forum

[Go to top] top

Quick links: MUSHclient. MUSHclient help. Forum shortcuts. Posting templates. Lua modules. Lua documentation.

Information and images on this site are licensed under the Creative Commons Attribution 3.0 Australia License unless stated otherwise.


Written by Nick Gammon - 5K   profile for Nick Gammon on Stack Exchange, a network of free, community-driven Q&A sites   Marriage equality

Comments to: Gammon Software support
[RH click to get RSS URL] Forum RSS feed ( https://gammon.com.au/rss/forum.xml )

[Best viewed with any browser - 2K]    [Hosted at HostDash]