mpc/README.md

Micro Parser Combinators
========================

Version 0.9.0


About
-----

_mpc_ is a lightweight and powerful Parser Combinator library for C.

Using _mpc_ might be of interest to you if you are...

* Building a new programming language
* Building a new data format
* Parsing an existing programming language
* Parsing an existing data format
* Embedding a Domain Specific Language
* Implementing [Greenspun's Tenth Rule](http://en.wikipedia.org/wiki/Greenspun%27s_tenth_rule)


Features
--------

* Type-Generic
* Predictive, Recursive Descent
* Easy to Integrate (One Source File in ANSI C)
* Automatic Error Message Generation
* Regular Expression Parser Generator
* Language/Grammar Parser Generator


Alternatives
------------

The current main alternative for a C based parser combinator library is a branch of [Cesium3](https://github.com/wbhart/Cesium3/tree/combinators).

_mpc_ provides a number of features that this project does not offer, and also overcomes a number of potential downsides:

* _mpc_ Works for Generic Types
* _mpc_ Doesn't rely on Boehm-Demers-Weiser Garbage Collection
* _mpc_ Doesn't use `setjmp` and `longjmp` for errors
* _mpc_ Doesn't pollute the namespace


Quickstart
==========

Here is how one would use _mpc_ to create a parser for a basic mathematical expression language.

```c
mpc_parser_t *Expr  = mpc_new("expression");
mpc_parser_t *Prod  = mpc_new("product");
mpc_parser_t *Value = mpc_new("value");
mpc_parser_t *Maths = mpc_new("maths");

mpca_lang(MPCA_LANG_DEFAULT,
  " expression : <product> (('+' | '-') <product>)*; "
  " product    : <value>   (('*' | '/')   <value>)*; "
  " value      : /[0-9]+/ | '(' <expression> ')';    "
  " maths      : /^/ <expression> /$/;               ",
  Expr, Prod, Value, Maths, NULL);

mpc_result_t r;

if (mpc_parse("input", input, Maths, &r)) {
  mpc_ast_print(r.output);
  mpc_ast_delete(r.output);
} else {
  mpc_err_print(r.error);
  mpc_err_delete(r.error);
}

mpc_cleanup(4, Expr, Prod, Value, Maths);
```

If you were to set `input` to the string `(4 * 2 * 11 + 2) - 5`, the printed output would look like this.

```
>
  regex
  expression|>
    value|>
      char:1:1 '('
      expression|>
        product|>
          value|regex:1:2 '4'
          char:1:4 '*'
          value|regex:1:6 '2'
          char:1:8 '*'
          value|regex:1:10 '11'
        char:1:13 '+'
        product|value|regex:1:15 '2'
      char:1:16 ')'
    char:1:18 '-'
    product|value|regex:1:20 '5'
  regex
```

Getting Started
===============

Introduction
------------

Parser Combinators are structures that encode how to parse particular languages. They can be combined using intuitive operators to create new parsers of increasing complexity. Using these operators detailed grammars and languages can be parsed and processed in a quick, efficient, and easy way.

解析器组合器是对如何解析特定语言进行编码的结构。它们可以使用直观的运算符组合在一起，以创建越来越复杂的新解析器。使用这些运算符，可以快速、高效、简单地解析和处理详细的语法和语言。

The trick behind Parser Combinators is the observation that by structuring the library in a particular way, one can make building parser combinators look like writing a grammar itself. Therefore instead of describing _how to parse a language_, a user must only specify _the language itself_, and the library will work out how to parse it ... as if by magic!

Parser Combinators背后的诀窍是观察到，通过以特定的方式构建库，可以使构建解析器组合子看起来像编写语法本身。因此，用户只需指定语言本身，而不是描述如何解析语言，库将计算出如何解析它。。。仿佛施了魔法！

_mpc_ can be used in this mode, or, as shown in the above example, you can specify the grammar directly as a string or in a file.

_mpc可以在这种模式下使用，或者，如上例所示，您可以直接将语法指定为字符串或文件。

Basic Parsers
-------------

### String Parsers

All the following functions construct new basic parsers of the type `mpc_parser_t *`. All of those parsers return a newly allocated `char *` with the character(s) they manage to match. If unsuccessful they will return an error. They have the following functionality.

以下所有函数都构造了类型为“mpc_parser_t*”的新基本解析器。所有这些解析器都返回一个新分配的“char*”，其中包含它们设法匹配的字符。如果失败，他们将返回错误。它们具有以下功能。

* * *

```c
mpc_parser_t *mpc_any(void);
```

Matches any individual character

匹配任何单个字符

* * *

```c
mpc_parser_t *mpc_char(char c);
```

Matches a single given character `c`

匹配单个给定字符`c`

* * *

```c
mpc_parser_t *mpc_range(char s, char e);
```

Matches any single given character in the range `s` to `e` (inclusive)

匹配范围`s`到`e`（含）内的任何单个给定字符

* * *

```c
mpc_parser_t *mpc_oneof(const char *s);
```

Matches any single given character in the string  `s`

匹配字符串中的任何单个给定字符`

* * *

```c
mpc_parser_t *mpc_noneof(const char *s);
```

Matches any single given character not in the string `s`

匹配不在字符串中的任何单个给定字符`

* * *

```c
mpc_parser_t *mpc_satisfy(int(*f)(char));
```

Matches any single given character satisfying function `f`

匹配满足函数`f`的任何单个给定字符

* * *

```c
mpc_parser_t *mpc_string(const char *s);
```

Matches exactly the string `s`

完全匹配字符串`s`


### Other Parsers

Several other functions exist that construct parsers with some other special functionality.

还有其他几个函数可以构造具有其他特殊功能的解析器。

* * *

```c
mpc_parser_t *mpc_pass(void);
```

Consumes no input, always successful, returns `NULL`

不消耗输入，总是成功，返回`NULL`

* * *

```c
mpc_parser_t *mpc_fail(const char *m);
mpc_parser_t *mpc_failf(const char *fmt, ...);
```

Consumes no input, always fails with message `m` or formatted string `fmt`.

不消耗任何输入，总是以消息`m`或格式化字符串`fmt`失败。

* * *

```c
mpc_parser_t *mpc_lift(mpc_ctor_t f);
```

Consumes no input, always successful, returns the result of function `f`

不消耗输入，总是成功，返回函数`f`的结果

* * *

```c
mpc_parser_t *mpc_lift_val(mpc_val_t *x);
```

Consumes no input, always successful, returns `x`

不消耗输入，总是成功，返回`x`

* * *

```c
mpc_parser_t *mpc_state(void);
```

Consumes no input, always successful, returns a copy of the parser state as a `mpc_state_t *`. This state is newly allocated and so needs to be released with `free` when finished with.

不消耗任何输入，总是成功的，返回解析器状态的副本作为`mpc_state_t*`。此状态是新分配的，因此在完成时需要用“free”释放。

* * *

```c
mpc_parser_t *mpc_anchor(int(*f)(char,char));
```

Consumes no input. Successful when function `f` returns true. Always returns `NULL`.

不消耗任何输入。当函数`f`返回true时成功。始终返回`NULL`。

Function `f` is a _anchor_ function. It takes as input the last character parsed, and the next character in the input, and returns success or failure. This function can be set by the user to ensure some condition is met. For example to test that the input is at a boundary between words and non-words.

函数`f`是一个_anchor_函数。它将解析的最后一个字符和输入中的下一个字符作为输入，并返回成功或失败。此功能可由用户设置，以确保满足某些条件。例如，测试输入是否位于单词和非单词之间的边界。

At the start of the input the first argument is set to `'\0'`. At the end of the input the second argument is set to `'\0'`.

在输入开始时，第一个参数设置为`'\0'`。在输入的末尾，第二个参数被设置为`'\0'`。


Parsing
-------

Once you've build a parser, you can run it on some input using one of the following functions. These functions return `1` on success and `0` on failure. They output either the result, or an error to a `mpc_result_t` variable. This type is defined as follows.

构建解析器后，您可以使用以下函数之一在某些输入上运行它。这些函数成功时返回`1`，失败时返回`0`。它们将结果或错误输出到`mpc_result_t`变量。这种类型定义如下。

```c
typedef union {
  mpc_err_t *error;
  mpc_val_t *output;
} mpc_result_t;
```

where `mpc_val_t *` is synonymous with `void *` and simply represents some pointer to data - the exact type of which is dependant on the parser.

其中`mpc_val_t*`与`void*`同义，只是表示指向数据的某个指针，其确切类型取决于解析器。


* * *

```c
int mpc_parse(const char *filename, const char *string, mpc_parser_t *p, mpc_result_t *r);
```

Run a parser on some string.

对某个字符串运行解析器。

* * *

```c
int mpc_parse_file(const char *filename, FILE *file, mpc_parser_t *p, mpc_result_t *r);
```

Run a parser on some file.

对某个文件运行解析器。

* * *

```c
int mpc_parse_pipe(const char *filename, FILE *pipe, mpc_parser_t *p, mpc_result_t *r);
```

Run a parser on some pipe (such as `stdin`).

在某个管道（如`stdin`）上运行解析器。

* * *

```c
int mpc_parse_contents(const char *filename, mpc_parser_t *p, mpc_result_t *r);
```

Run a parser on the contents of some file.

对某个文件的内容运行解析器。


Combinators
-----------

Combinators are functions that take one or more parsers and return a new parser of some given functionality.

组合器是接受一个或多个解析器并返回某个给定功能的新解析器的函数。

These combinators work independently of exactly what data type the parser(s) supplied as input return. In languages such as Haskell ensuring you don't input one type of data into a parser requiring a different type is done by the compiler. But in C we don't have that luxury. So it is at the discretion of the programmer to ensure that he or she deals correctly with the outputs of different parser types.

这些组合子的工作与解析器作为输入返回提供的数据类型完全无关。在Haskell等语言中，编译器会确保不将一种类型的数据输入到需要不同类型的解析器中。但在C中，我们没有这种奢侈。因此，程序员可以自行决定是否正确处理不同解析器类型的输出。

A second annoyance in C is that of manual memory management. Some parsers might get half-way and then fail. This means they need to clean up any partial result that has been collected in the parse. In Haskell this is handled by the Garbage Collector, but in C these combinators will need to take _destructor_ functions as input, which say how clean up any partial data that has been collected.

C中的第二个烦恼是手动内存管理。一些解析器可能会中途失败。这意味着他们需要清理在解析中收集到的任何部分结果。在Haskell中，这是由垃圾收集器处理的，但在C中，这些组合子需要将_destructor_函数作为输入，该函数表示如何清理已收集的任何部分数据。

Here are the main combinators and how to use then.

以下是主要的组合子及其使用方法。

* * *

```c
mpc_parser_t *mpc_expect(mpc_parser_t *a, const char *e);
mpc_parser_t *mpc_expectf(mpc_parser_t *a, const char *fmt, ...);
```

Returns a parser that runs `a`, and on success returns the result of `a`, while on failure reports that `e` was expected.

返回一个运行`a`的解析器，成功时返回`a`结果，失败时返回预期的`e`结果。

* * *

```c
mpc_parser_t *mpc_apply(mpc_parser_t *a, mpc_apply_t f);
mpc_parser_t *mpc_apply_to(mpc_parser_t *a, mpc_apply_to_t f, void *x);
```

Returns a parser that applies function `f` (optionality taking extra input `x`) to the result of parser `a`.

返回一个解析器，该解析器将函数`f`（可选性接受额外的输入`x`）应用于解析器`a`的结果。

* * *

```c
mpc_parser_t *mpc_check(mpc_parser_t *a, mpc_dtor_t da, mpc_check_t f, const char *e);
mpc_parser_t *mpc_check_with(mpc_parser_t *a, mpc_dtor_t da, mpc_check_with_t f, void *x, const char *e);
mpc_parser_t *mpc_checkf(mpc_parser_t *a, mpc_dtor_t da, mpc_check_t f, const char *fmt, ...);
mpc_parser_t *mpc_check_withf(mpc_parser_t *a, mpc_dtor_t da, mpc_check_with_t f, void *x, const char *fmt, ...);
```

Returns a parser that applies function `f` (optionally taking extra input `x`) to the result of parser `a`. If `f` returns non-zero, then the parser succeeds and returns the value of `a` (possibly modified by `f`). If `f` returns zero, then the parser fails with message `e`, and the result of `a` is destroyed with the destructor `da`.

返回一个解析器，该解析器将函数`f`（可选地接受额外的输入`x`）应用于解析器`a`的结果。如果`f`返回非零，则解析器成功并返回`a`的值（可能被`f`修改）。如果`f`返回零，那么解析器将失败，并返回消息`e`，`a`的结果将被析构函数`da`销毁。

* * *

```c
mpc_parser_t *mpc_not(mpc_parser_t *a, mpc_dtor_t da);
mpc_parser_t *mpc_not_lift(mpc_parser_t *a, mpc_dtor_t da, mpc_ctor_t lf);
```

Returns a parser with the following behaviour. If parser `a` succeeds, then it fails and consumes no input. If parser `a` fails, then it succeeds, consumes no input and returns `NULL` (or the result of lift function `lf`). Destructor `da` is used to destroy the result of `a` on success.

返回具有以下行为的解析器。如果解析器`a`成功，则它失败并且不消耗任何输入。如果解析器`a`失败，则它成功，不消耗任何输入并返回`NULL`（或提升函数`lf`的结果）。析构函数`da`用于析构`a`对成功的结果。

* * *

```c
mpc_parser_t *mpc_maybe(mpc_parser_t *a);
mpc_parser_t *mpc_maybe_lift(mpc_parser_t *a, mpc_ctor_t lf);
```

Returns a parser that runs `a`. If `a` is successful then it returns the result of `a`. If `a` is unsuccessful then it succeeds, but returns `NULL` (or the result of `lf`).

返回一个运行`a`的解析器。如果`a`成功，则返回`a`的结果。如果`a`不成功，则成功，但返回`NULL`（或`lf`的结果）。

* * *

```c
mpc_parser_t *mpc_many(mpc_fold_t f, mpc_parser_t *a);
```

Runs `a` zero or more times until it fails. Results are combined using fold function `f`. See the _Function Types_ section for more details.

运行`a`零次或多次，直到失败。使用折叠函数`f`组合结果。有关更多详细信息，请参阅_Function Types_部分。

* * *

```c
mpc_parser_t *mpc_many1(mpc_fold_t f, mpc_parser_t *a);
```

Runs `a` one or more times until it fails. Results are combined with fold function `f`.

运行`a`一次或多次，直到失败。结果与折叠函数`f`相结合。

* * *

```c
mpc_parser_t *mpc_sepby1(mpc_fold_t f, mpc_parser_t *sep, mpc_parser_t *a);
```

Runs `a` one or more times, separated by `sep`. Results are combined with fold function `f`.

运行`a`一次或多次，用`sep`分隔。结果与折叠函数`f`相结合。

* * *

```c
mpc_parser_t *mpc_count(int n, mpc_fold_t f, mpc_parser_t *a, mpc_dtor_t da);
```

Runs `a` exactly `n` times. If this fails, any partial results are destructed with `da`. If successful results of `a` are combined using fold function `f`.

运行`a`正好`n`次。如果失败，任何部分结果都将用`da`销毁。如果成功使用折叠函数`f`组合`a`。

* * *

```c
mpc_parser_t *mpc_or(int n, ...);
```

Attempts to run `n` parsers in sequence, returning the first one that succeeds. If all fail, returns an error.

尝试按顺序运行`n`个解析器，返回第一个成功的解析器。如果全部失败，则返回错误。

* * *

```c
mpc_parser_t *mpc_and(int n, mpc_fold_t f, ...);
```

Attempts to run `n` parsers in sequence, returning the fold of the results using fold function `f`. First parsers must be specified, followed by destructors for each parser, excluding the final parser. These are used in case of partial success. For example: `mpc_and(3, mpcf_strfold, mpc_char('a'), mpc_char('b'), mpc_char('c'), free, free);` would attempt to match `'a'` followed by `'b'` followed by `'c'`, and if successful would concatenate them using `mpcf_strfold`. Otherwise would use `free` on the partial results.

尝试按顺序运行`n`个解析器，使用fold函数`f`返回结果的倍数。必须指定第一个解析器，然后为每个解析器指定析构函数，不包括最后一个解析器。这些用于部分成功的情况。例如：`mpc_and（3，mpcf_strfold，mpc_char（'a'），mpc_char（'b'），mpc_char（'c'），free，free）；`将尝试匹配`'a'`、`'b'`和`'c'`，如果成功，将使用`mpcf_strfold`将它们连接起来。否则，将在部分结果上使用`free`。

* * *

```c
mpc_parser_t *mpc_predictive(mpc_parser_t *a);
```

Returns a parser that runs `a` with backtracking disabled. This means if `a` consumes more than one character, it will not be reverted, even on failure. Turning backtracking off has good performance benefits for grammars which are `LL(1)`. These are grammars where the first character completely determines the parse result - such as the decision of parsing either a C identifier, number, or string literal. This option should not be used for non `LL(1)` grammars or it will produce incorrect results or crash the parser.

返回一个在禁用回溯的情况下运行`a`的解析器。这意味着，如果`a`消耗了多个字符，即使失败，它也不会被还原。对于`LL（1）`语法，关闭回溯具有良好的性能优势。这些语法中，第一个字符完全决定了解析结果，例如解析C标识符、数字或字符串文字的决定。此选项不应用于非`LL（1）`语法，否则会产生不正确的结果或使解析器崩溃。

Another way to think of `mpc_predictive` is that it can be applied to a parser (for a performance improvement) if either successfully parsing the first character will result in a completely successful parse, or all of the referenced sub-parsers are also `LL(1)`.

另一种理解`mpc_cpredictive`的方法是，如果成功解析第一个字符将导致完全成功的解析，或者所有引用的子解析器都是`LL（1）`，则可以将其应用于解析器（以提高性能）。


Function Types
--------------

The combinator functions take a number of special function types as function pointers. Here is a short explanation of those types are how they are expected to behave. It is important that these behave correctly otherwise it is easy to introduce memory leaks or crashes into the system.

组合函数接受许多特殊的函数类型作为函数指针。以下是对这些类型的简要解释，即它们应该如何表现。重要的是，这些行为必须正确，否则很容易在系统中引入内存泄漏或崩溃。

* * *

```c
typedef void(*mpc_dtor_t)(mpc_val_t*);
```

Given some pointer to a data value it will ensure the memory it points to is freed correctly.

给定一个指向数据值的指针，它将确保它指向的内存被正确释放。

* * *

```c
typedef mpc_val_t*(*mpc_ctor_t)(void);
```

Returns some data value when called. It can be used to create _empty_ versions of data types when certain combinators have no known default value to return. For example it may be used to return a newly allocated empty string.

调用时返回一些数据值。当某些组合子没有已知的默认值可返回时，它可用于创建数据类型的_empty_版本。例如，它可用于返回新分配的空字符串。

* * *

```c
typedef mpc_val_t*(*mpc_apply_t)(mpc_val_t*);
typedef mpc_val_t*(*mpc_apply_to_t)(mpc_val_t*,void*);
```

This takes in some pointer to data and outputs some new or modified pointer to data, ensuring to free the input data if it is no longer used. The `apply_to` variation takes in an extra pointer to some data such as global state.

这会接收一些指向数据的指针，并输出一些新的或修改过的指向数据的指示器，确保在不再使用输入数据时释放它。`apply_to`变量引入了一个指向全局状态等数据的额外指针。

* * *

```c
typedef int(*mpc_check_t)(mpc_val_t**);
typedef int(*mpc_check_with_t)(mpc_val_t**,void*);
```

This takes in some pointer to data and outputs 0 if parsing should stop with an error. Additionally, this may change or free the input data. The `check_with` variation takes in an extra pointer to some data such as global state.

这会接收一些指向数据的指针，如果解析因错误而停止，则输出0。此外，这可能会更改或释放输入数据。`check_with`变量引入了一个指向某些数据（如全局状态）的额外指针。

* * *

```c
typedef mpc_val_t*(*mpc_fold_t)(int,mpc_val_t**);
```

This takes a list of pointers to data values and must return some combined or folded version of these data values. It must ensure to free any input data that is no longer used once the combination has taken place.

这需要一个指向数据值的指针列表，并且必须返回这些数据值的组合或折叠版本。它必须确保在组合发生后释放不再使用的任何输入数据。


Case Study - Identifier
=======================

Combinator Method
-----------------

Using the above combinators we can create a parser that matches a C identifier.

使用上述组合子，我们可以创建一个与C标识符匹配的解析器。

When using the combinators we need to supply a function that says how to combine two `char *`.

当使用组合子时，我们需要提供一个函数，说明如何组合两个`char*`。

For this we build a fold function that will concatenate zero or more strings together. For this sake of this tutorial we will write it by hand, but this (as well as many other useful fold functions), are actually included in _mpc_ under the `mpcf_*` namespace, such as `mpcf_strfold`.

为此，我们构建了一个fold函数，将零个或多个字符串连接在一起。为了本教程的目的，我们将手工编写它，但这个（以及许多其他有用的折叠函数）实际上包含在“mpcf_*”命名空间下的_mpc_中，例如“mpcf_strfold”。

```c
mpc_val_t *strfold(int n, mpc_val_t **xs) {
  char *x = calloc(1, 1);
  int i;
  for (i = 0; i < n; i++) {
    x = realloc(x, strlen(x) + strlen(xs[i]) + 1);
    strcat(x, xs[i]);
    free(xs[i]);
  }
  return x;
}
```

We can use this to specify a C identifier, making use of some combinators to say how the basic parsers are combined.

我们可以用它来指定一个C标识符，利用一些组合子来说明基本解析器是如何组合的。

```c
mpc_parser_t *alpha = mpc_or(2, mpc_range('a', 'z'), mpc_range('A', 'Z'));
mpc_parser_t *digit = mpc_range('0', '9');
mpc_parser_t *underscore = mpc_char('_');

mpc_parser_t *ident = mpc_and(2, strfold,
  mpc_or(2, alpha, underscore),
  mpc_many(strfold, mpc_or(3, alpha, digit, underscore)),
  free);

/* Do Some Parsing... */

mpc_delete(ident);
```

Notice that previous parsers are used as input to new parsers we construct from the combinators. Note that only the final parser `ident` must be deleted. When we input a parser into a combinator we should consider it to be part of the output of that combinator.

请注意，之前的解析器被用作我们从组合子构造的新解析器的输入。请注意，只有最后一个解析器`ident`必须删除。当我们将解析器输入组合子时，我们应该将其视为该组合子输出的一部分。

Because of this we shouldn't create a parser and input it into multiple places, or it will be doubly freed.

因此，我们不应该创建解析器并将其输入到多个位置，否则它将被双重释放。

Regex Method
------------

There is an easier way to do this than the above method. _mpc_ comes with a handy regex function for constructing parsers using regex syntax. We can specify an identifier using a regex pattern as shown below.

有一种比上述方法更简单的方法_mpc附带了一个方便的正则表达式函数，用于使用正则表达式语法构建解析器。我们可以使用正则表达式模式指定标识符，如下所示。

```c
mpc_parser_t *ident = mpc_re("[a-zA-Z_][a-zA-Z_0-9]*");

/* Do Some Parsing... */

mpc_delete(ident);
```


Library Method
--------------

Although if we really wanted to create a parser for C identifiers, a function for creating this parser comes included in _mpc_ along with many other common parsers.

虽然如果我们真的想为C标识符创建一个解析器，但_mpc_中包含了创建此解析器的函数以及许多其他常见的解析器。

```c
mpc_parser_t *ident = mpc_ident();

/* Do Some Parsing... */

mpc_delete(ident);
```

Parser References
=================

Building parsers in the above way can have issues with self-reference or cyclic-reference. To overcome this we can separate the construction of parsers into two different steps. Construction and Definition.

以上述方式构建解析器可能会出现自引用或循环引用的问题。为了克服这个问题，我们可以将解析器的构建分为两个不同的步骤。构造和定义。

* * *

```c
mpc_parser_t *mpc_new(const char *name);
```

This will construct a parser called `name` which can then be used as input to others, including itself, without fear of being deleted. Any parser created using `mpc_new` is said to be _retained_. This means it will behave differently to a normal parser when referenced. When deleting a parser that includes a _retained_ parser, the _retained_ parser will not be deleted along with it. To delete a retained parser `mpc_delete` must be used on it directly.

这将构造一个名为`name`的解析器，然后可以将其用作其他人的输入，包括它自己，而不必担心被删除。任何使用`mpc_new`创建的解析器都被称为_retaind_。这意味着当被引用时，它的行为将与普通解析器不同。删除包含_retained_解析器的解析器时，_retained_ 解析器不会与其一起删除。要删除保留的解析器，必须直接对其使用 `mpc_delete`。

A _retained_ parser can then be _defined_ using...

然后，可以使用…对_retaind_解析器进行_defined_。。。

* * *

```c
mpc_parser_t *mpc_define(mpc_parser_t *p, mpc_parser_t *a);
```

This assigns the contents of parser `a` to `p`, and deletes `a`. With this technique parsers can now reference each other, as well as themselves, without trouble.

这将解析器`a`的内容分配给`p`，并删除`a`。通过这种技术，解析器现在可以毫无困难地相互引用，也可以引用自己。

* * *

```c
mpc_parser_t *mpc_undefine(mpc_parser_t *p);
```

A final step is required. Parsers that reference each other must all be undefined before they are deleted. It is important to do any undefining before deletion. The reason for this is that to delete a parser it must look at each sub-parser that is used by it. If any of these have already been deleted a segfault is unavoidable - even if they were retained beforehand.

需要最后一步。相互引用的解析器在删除之前必须全部未定义。在删除之前进行任何未定义的操作都很重要。这样做的原因是，要删除一个解析器，它必须查看它使用的每个子解析器。如果其中任何一个子解析器已经被删除，那么segfault是不可避免的，即使它们事先被保留了。

* * *

```c
void mpc_cleanup(int n, ...);
```

To ease the task of undefining and then deleting parsers `mpc_cleanup` can be used. It takes `n` parsers as input, and undefines them all, before deleting them all.

为了简化定义和删除解析器的任务，可以使用`mpc_cleap`。它接收`n`个解析器作为输入，并在删除它们之前取消定义它们。

* * *

```c
mpc_parser_t *mpc_copy(mpc_parser_t *a);
```

This function makes a copy of a parser `a`. This can be useful when you want to use a parser as input for some other parsers multiple times without retaining it.

此函数生成解析器`a`的副本。当您想多次使用解析器作为其他解析器的输入而不保留它时，这可能很有用。

* * *

```c
mpc_parser_t *mpc_re(const char *re);
mpc_parser_t *mpc_re_mode(const char *re, int mode);
```

This function takes as input the regular expression `re` and builds a parser for it. With the `mpc_re_mode` function optional mode flags can also be given.

此函数将正则表达式`re`作为输入，并为其构建解析器。使用`mpc_re_mode`函数，还可以给出可选的模式标志。

Available flags are `MPC_RE_MULTILINE` / `MPC_RE_M` where the start of input character `^` also matches the beginning of new lines and the end of input `$` character also matches new lines, and `MPC_RE_DOTALL` / `MPC_RE_S` where the any character token `.` also matches newlines (by default it doesn't).

可用的标志是`MPC_RE_MULTILINE` / `MPC_RE_M`，其中输入字符`^`的开头也与新行的开头匹配，输入字符`$`的结尾也与新行匹配，以及`MPC_RE_DOTALL` / `MPC_RE_S`，其中包含任何字符标记`.`也匹配换行符（默认情况下不匹配）。


Library Reference 库参考
=================

Common Parsers 常见解析器
--------------


<table>

  <tr><td><code>mpc_soi</code></td><td>Matches only the start of input, returns <code>NULL</code></td></tr>
  <tr><td></td><td>仅匹配输入的开头,返回 <code>NULL</code></td></tr>
  <tr><td><code>mpc_eoi</code></td><td>Matches only the end of input, returns <code>NULL</code></td></tr>
  <tr><td></td><td>仅匹配输入的结尾,返回 <code>NULL</code></td></tr>
  <tr><td><code>mpc_boundary</code></td><td>Matches only the boundary between words, returns <code>NULL</code></td></tr>
  <tr><td></td><td>仅匹配单词之间的边界,返回 <code>NULL</code></td></tr>
  <tr><td><code>mpc_boundary_newline</code></td><td>Matches the start of a new line, returns <code>NULL</code></td></tr>
  <tr><td></td><td>匹配新行的开头,返回 <code>NULL</code></td></tr>
  <tr><td><code>mpc_whitespace</code></td><td>Matches any whitespace character <code>" \f\n\r\t\v"</code></td></tr>
  <tr><td></td><td>匹配任意的空白字符<code>" \f\n\r\t\v"</code></td></tr>
  <tr><td><code>mpc_whitespaces</code></td><td>Matches zero or more whitespace characters</td></tr>
  <tr><td></td><td>匹配零个或多个空白字符</td></tr>
  <tr><td><code>mpc_blank</code></td><td>Matches whitespaces and frees the result, returns <code>NULL</code></td></tr>
  <tr><td></td><td>匹配并释空白字符放结果,返回 <code>NULL</code></td></tr>
  <tr><td><code>mpc_newline</code></td><td>Matches <code>'\n'</code></td></tr>
  <tr><td></td><td>匹配 <code>'\n'</code></td></tr>
  <tr><td><code>mpc_tab</code></td><td>Matches <code>'\t'</code></td></tr>
  <tr><td></td><td>匹配 <code>'\t'</code></td></tr>
  <tr><td><code>mpc_escape</code></td><td>Matches a backslash followed by any character</td></tr>
  <tr><td></td><td>匹配反斜杠后跟任何字符</td></tr>
  <tr><td><code>mpc_digit</code></td><td>Matches any character in the range <code>'0'</code> - <code>'9'</code></td></tr>
  <tr><td></td><td>匹配 <code>'0'</code> - <code>'9'</code>之间的任意字符</td></tr>
  <tr><td><code>mpc_hexdigit</code></td><td>Matches any character in the range <code>'0</code> - <code>'9'</code> as well as <code>'A'</code> - <code>'F'</code> and <code>'a'</code> - <code>'f'</code></td></tr>
  <tr><td></td><td>匹配16进制的字符</td></tr>
  <tr><td><code>mpc_octdigit</code></td><td>Matches any character in the range <code>'0'</code> - <code>'7'</code></td></tr>
  <tr><td></td><td>匹配8进制的字符</td></tr>
  <tr><td><code>mpc_digits</code></td><td>Matches one or more digit</td></tr>
  <tr><td></td><td>匹配1个或多个数字</td></tr>
  <tr><td><code>mpc_hexdigits</code></td><td>Matches one or more hexdigit</td></tr>
  <tr><td></td><td>匹配1个或多个16进制数字</td></tr>
  <tr><td><code>mpc_octdigits</code></td><td>Matches one or more octdigit</td></tr>
  <tr><td></td><td>匹配1个或多个8进制数字</td></tr>
  <tr><td><code>mpc_lower</code></td><td>Matches any lower case character</td></tr>
  <tr><td></td><td>匹配任意小写字符</td></tr>
  <tr><td><code>mpc_upper</code></td><td>Matches any upper case character</td></tr>
  <tr><td></td><td>匹配任意大写字符</td></tr>
  <tr><td><code>mpc_alpha</code></td><td>Matches any alphabet character</td></tr>
  <tr><td></td><td>匹配任意字母表字符</td></tr>
  <tr><td><code>mpc_underscore</code></td><td>Matches <code>'_'</code></td></tr>
  <tr><td></td><td>匹配 <code>'_'</code></td></tr>
  <tr><td><code>mpc_alphanum</code></td><td>Matches any alphabet character, underscore or digit</td></tr>
  <tr><td></td><td>匹配字母表 <code>'_'</code> 和数字</td></tr>
  <tr><td><code>mpc_int</code></td><td>Matches digits and returns an <code>int*</code></td></tr>
  <tr><td></td><td>匹配数字,返回 <code>int*</code></td></tr>
  <tr><td><code>mpc_hex</code></td><td>Matches hexdigits and returns an <code>int*</code></td></tr>
  <tr><td></td><td>匹配16进制,返回 <code>int*</code></td></tr>
  <tr><td><code>mpc_oct</code></td><td>Matches octdigits and returns an <code>int*</code></td></tr>
  <tr><td></td><td>匹配8进制,返回 <code>int*</code></td></tr>
  <tr><td><code>mpc_number</code></td><td>Matches <code>mpc_int</code>, <code>mpc_hex</code> or <code>mpc_oct</code></td></tr>
  <tr><td></td><td>匹配 <code>mpc_int</code>, <code>mpc_hex</code> or <code>mpc_oct</code></td></tr>
  <tr><td><code>mpc_real</code></td><td>Matches some floating point number as a string</td></tr>
  <tr><td></td><td>将某个浮点数匹配为字符串</td></tr>
  <tr><td><code>mpc_float</code></td><td>Matches some floating point number and returns a <code>float*</code></td></tr>
  <tr><td></td><td>匹配浮点数,返回 <code>float*</code></td></tr>
  <tr><td><code>mpc_char_lit</code></td><td>Matches some character literal surrounded by <code>'</code></td></tr>
  <tr><td></td><td>匹配由 <code>'</code>包围的字符 </td></tr>
  <tr><td><code>mpc_string_lit</code></td><td>Matches some string literal surrounded by <code>"</code></td></tr>
  <tr><td></td><td>匹配由 <code>"</code>包围的字符串</td></tr>
  <tr><td><code>mpc_regex_lit</code></td><td>Matches some regex literal surrounded by <code>/</code></td></tr>
  <tr><td></td><td>匹配一些被<code>/</code>包围的正则表达式文字 </td></tr>
  <tr><td><code>mpc_ident</code></td><td>Matches a C style identifier</td></tr>
  <tr><td></td><td>匹配C样式标识符</td></tr>

</table>


Useful Parsers 有用的解析器
--------------

<table>

  <tr><td><code>mpc_startswith(mpc_parser_t *a);</code></td><td>Matches the start of input followed by <code>a</code></td></tr>
  <tr><td></td><td>匹配开头</td></tr>
  <tr><td><code>mpc_endswith(mpc_parser_t *a, mpc_dtor_t da);</code></td><td>Matches <code>a</code> followed by the end of input</td></tr>
  <tr><td></td><td>匹配结尾</td></tr>
  <tr><td><code>mpc_whole(mpc_parser_t *a, mpc_dtor_t da);</code></td><td>Matches the start of input, <code>a</code>, and the end of input</td></tr>
  <tr><td></td><td>匹配开头和结尾</td></tr>
  <tr><td><code>mpc_stripl(mpc_parser_t *a);</code></td><td>Matches <code>a</code> first consuming any whitespace to the left</td></tr>
  <tr><td></td><td>删除左边的空白字符</td></tr>
  <tr><td><code>mpc_stripr(mpc_parser_t *a);</code></td><td>Matches <code>a</code> then consumes any whitespace to the right</td></tr>
  <tr><td></td><td>删除右边的空白字符</td></tr>
  <tr><td><code>mpc_strip(mpc_parser_t *a);</code></td><td>Matches <code>a</code> consuming any surrounding whitespace</td></tr>
  <tr><td></td><td>删除周围的空白字符</td></tr>
  <tr><td><code>mpc_tok(mpc_parser_t *a);</code></td><td>Matches <code>a</code> and consumes any trailing whitespace</td></tr>
  <tr><td></td><td>删除尾随的空白字符</td></tr>
  <tr><td><code>mpc_sym(const char *s);</code></td><td>Matches string <code>s</code> and consumes any trailing whitespace</td></tr>
  <tr><td></td><td>匹配字符串，然后删除尾随的空白字符</td></tr>
  <tr><td><code>mpc_total(mpc_parser_t *a, mpc_dtor_t da);</code></td><td>Matches the whitespace consumed <code>a</code>, enclosed in the start and end of input</td></tr>
  <tr><td></td><td>匹配空白字符删除 <code>a</code>, 包含在输入的开头和结尾</td></tr>
  <tr><td><code>mpc_between(mpc_parser_t *a, mpc_dtor_t ad, <br /> const char *o, const char *c);</code></td><td> Matches <code>a</code> between strings <code>o</code> and <code>c</code></td></tr>
  <tr><td></td><td>匹配 <code>a</code> 在字符串 <code>o</code> 和 <code>c</code>之间</td></tr>
  <tr><td><code>mpc_parens(mpc_parser_t *a, mpc_dtor_t ad);</code></td><td>Matches <code>a</code> between <code>"("</code> and <code>")"</code></td></tr>
  <tr><td></td><td>匹配小括号</td></tr>
  <tr><td><code>mpc_braces(mpc_parser_t *a, mpc_dtor_t ad);</code></td><td>Matches <code>a</code> between <code>"<"</code> and <code>">"</code></td></tr>
  <tr><td></td><td>匹配尖括号</td></tr>
  <tr><td><code>mpc_brackets(mpc_parser_t *a, mpc_dtor_t ad);</code></td><td>Matches <code>a</code> between <code>"{"</code> and <code>"}"</code></td></tr>
  <tr><td></td><td>匹配大括号</td></tr>
  <tr><td><code>mpc_squares(mpc_parser_t *a, mpc_dtor_t ad);</code></td><td>Matches <code>a</code> between <code>"["</code> and <code>"]"</code></td></tr>
  <tr><td></td><td>匹配中括号</td></tr>
  <tr><td><code>mpc_tok_between(mpc_parser_t *a, mpc_dtor_t ad, <br /> const char *o, const char *c);</code></td><td>Matches <code>a</code> between <code>o</code> and <code>c</code>, where <code>o</code> and <code>c</code> have their trailing whitespace striped.</td></tr>
  <tr><td></td><td>匹配 <code>o</code>和 <code>c</code>之间的 <code>a</code>，其中 <code>o</code>和 <code>c</code>的尾部空格被剥离。</td></tr>
  <tr><td><code>mpc_tok_parens(mpc_parser_t *a, mpc_dtor_t ad);</code></td><td>Matches <code>a</code> between trailing whitespace consumed <code>"("</code> and <code>")"</code></td></tr>
  <tr><td></td><td>匹配<code>"("</code> 和 <code>")"</code>之间的尾随空格</td></tr>
  <tr><td><code>mpc_tok_braces(mpc_parser_t *a, mpc_dtor_t ad);</code></td><td>Matches <code>a</code> between trailing whitespace consumed <code>"<"</code> and <code>">"</code></td></tr>
  <tr><td></td><td>匹配<code>"<>"</code> 和 <code>">"</code>之间的尾随空格</td></tr>
  <tr><td><code>mpc_tok_brackets(mpc_parser_t *a, mpc_dtor_t ad);</code></td><td>Matches <code>a</code> between trailing whitespace consumed <code>"{"</code> and <code>"}"</code></td></tr>
  <tr><td></td><td>匹配<code>"{"</code> 和 <code>"}"</code>之间的尾随空格</td></tr>
  <tr><td><code>mpc_tok_squares(mpc_parser_t *a, mpc_dtor_t ad);</code></td><td>Matches <code>a</code> between trailing whitespace consumed <code>"["</code> and <code>"]"</code></td></tr>
  <tr><td></td><td>匹配<code>"["</code> 和 <code>"]"</code>之间的尾随空格</td></tr>

</table>


Apply Functions 应用函数
---------------

<table>

  <tr><td><code>void mpcf_dtor_null(mpc_val_t *x);</code></td><td>Empty destructor. Does nothing</td></tr>
  <tr><td></td><td>空白析构函数 什么都不做</td></tr>
  <tr><td><code>mpc_val_t *mpcf_ctor_null(void);</code></td><td>Returns <code>NULL</code></td></tr>
  <tr><td></td><td>返回 <code>NULL</code></td></tr>
  <tr><td><code>mpc_val_t *mpcf_ctor_str(void);</code></td><td>Returns <code>""</code></td></tr>
  <tr><td></td><td>返回 <code>""</code></td></tr>
  <tr><td><code>mpc_val_t *mpcf_free(mpc_val_t *x);</code></td><td>Frees <code>x</code> and returns <code>NULL</code></td></tr>
  <tr><td></td><td>释放 <code>x</code> 然后返回 <code>NULL</code></td></tr>
  <tr><td><code>mpc_val_t *mpcf_int(mpc_val_t *x);</code></td><td>Converts a decimal string <code>x</code> to an <code>int*</code></td></tr>
  <tr><td></td><td>转换10进制字符串 <code>x</code> 为 <code>int*</code></td></tr>
  <tr><td><code>mpc_val_t *mpcf_hex(mpc_val_t *x);</code></td><td>Converts a hex string <code>x</code> to an <code>int*</code></td></tr>
  <tr><td></td><td>转换16进制字符串 <code>x</code> 为 <code>int*</code></td></tr>
  <tr><td><code>mpc_val_t *mpcf_oct(mpc_val_t *x);</code></td><td>Converts a oct string <code>x</code> to an <code>int*</code></td></tr>
  <tr><td></td><td>转换8进制字符串 <code>x</code> 为 <code>int*</code></td></tr>
  <tr><td><code>mpc_val_t *mpcf_float(mpc_val_t *x);</code></td><td>Converts a string <code>x</code> to a <code>float*</code></td></tr>
  <tr><td></td><td>转换字符串 <code>x</code> 为 <code>float*</code></td></tr>
  <tr><td><code>mpc_val_t *mpcf_escape(mpc_val_t *x);</code></td><td>Converts a string <code>x</code> to an escaped version</td></tr>
  <tr><td></td><td>转换字符串 <code>x</code> 为转义版本</td></tr>
  <tr><td><code>mpc_val_t *mpcf_escape_regex(mpc_val_t *x);</code></td><td>Converts a regex <code>x</code> to an escaped version</td></tr>
  <tr><td></td><td>转换正则表达式<code>x</code> 为转义版本</td></tr>
  <tr><td><code>mpc_val_t *mpcf_escape_string_raw(mpc_val_t *x);</code></td><td>Converts a raw string <code>x</code> to an escaped version</td></tr>
  <tr><td></td><td>转换原始字符串 <code>x</code> 为转义版本</td></tr>
  <tr><td><code>mpc_val_t *mpcf_escape_char_raw(mpc_val_t *x);</code></td><td>Converts a raw character <code>x</code> to an escaped version</td></tr>
  <tr><td></td><td>转换原始字符 <code>x</code> 为转义版本</td></tr>
  <tr><td><code>mpc_val_t *mpcf_unescape(mpc_val_t *x);</code></td><td>Converts a string <code>x</code> to an unescaped version</td></tr>
  <tr><td></td><td>转换字符串 <code>x</code> 为未转义版本</td></tr>
  <tr><td><code>mpc_val_t *mpcf_unescape_regex(mpc_val_t *x);</code></td><td>Converts a regex <code>x</code> to an unescaped version</td></tr>
  <tr><td></td><td>转换正则表达式 <code>x</code> 为未转义版本</td></tr>
  <tr><td><code>mpc_val_t *mpcf_unescape_string_raw(mpc_val_t *x);</code></td><td>Converts a raw string <code>x</code> to an unescaped version</td></tr>
  <tr><td></td><td>转换原始字符串 <code>x</code> 为未转义版本</td></tr>
  <tr><td><code>mpc_val_t *mpcf_unescape_char_raw(mpc_val_t *x);</code></td><td>Converts a raw character <code>x</code> to an unescaped version</td></tr>
  <tr><td></td><td>转换原始字符 <code>x</code> 为未转义版本</td></tr>
  <tr><td><code>mpc_val_t *mpcf_strtriml(mpc_val_t *x);</code></td><td>Trims whitespace from the left of string <code>x</code></td></tr>
  <tr><td></td><td>修剪字符串左侧的空白</td></tr>
  <tr><td><code>mpc_val_t *mpcf_strtrimr(mpc_val_t *x);</code></td><td>Trims whitespace from the right of string <code>x</code></td></tr>
  <tr><td></td><td>修剪字符串右侧的空格</td></tr>
  <tr><td><code>mpc_val_t *mpcf_strtrim(mpc_val_t *x);</code></td><td>Trims whitespace from either side of string <code>x</code></td></tr>
  <tr><td></td><td>修剪字符串两侧的空格</td></tr>
</table>


Fold Functions 折叠函数
--------------

<table>


  <tr><td><code>mpc_val_t *mpcf_null(int n, mpc_val_t** xs);</code></td><td>Returns <code>NULL</code></td></tr>
  <tr><td></td><td>返回 <code>NULL</code></td></tr>
  <tr><td><code>mpc_val_t *mpcf_fst(int n, mpc_val_t** xs);</code></td><td>Returns first element of <code>xs</code></td></tr>
  <tr><td></td><td>返回第一个元素</td></tr>
  <tr><td><code>mpc_val_t *mpcf_snd(int n, mpc_val_t** xs);</code></td><td>Returns second element of <code>xs</code></td></tr>
  <tr><td></td><td>返回第二个元素</td></tr>
  <tr><td><code>mpc_val_t *mpcf_trd(int n, mpc_val_t** xs);</code></td><td>Returns third element of <code>xs</code></td></tr>
  <tr><td></td><td>返回第三个元素</td></tr>
  <tr><td><code>mpc_val_t *mpcf_fst_free(int n, mpc_val_t** xs);</code></td><td>Returns first element of <code>xs</code> and calls <code>free</code> on others</td></tr>
  <tr><td></td><td>返回第一个元素并释放其他</td></tr>
  <tr><td><code>mpc_val_t *mpcf_snd_free(int n, mpc_val_t** xs);</code></td><td>Returns second element of <code>xs</code> and calls <code>free</code> on others</td></tr>
  <tr><td></td><td>返回第二个元素并释放其他</td></tr>
  <tr><td><code>mpc_val_t *mpcf_trd_free(int n, mpc_val_t** xs);</code></td><td>Returns third element of <code>xs</code> and calls <code>free</code> on others</td></tr>
  <tr><td></td><td>返回第三个元素并释放其他</td></tr>
  <tr><td><code>mpc_val_t *mpcf_all_free(int n, mpc_val_t** xs);</code></td><td>Calls <code>free</code> on all elements of <code>xs</code> and returns <code>NULL</code></td></tr>
  <tr><td></td><td>释放所有元素并返回<code>NULL</code></td></tr>
  <tr><td><code>mpc_val_t *mpcf_strfold(int n, mpc_val_t** xs);</code></td><td>Concatenates all <code>xs</code> together as strings and returns result </td></tr>
  <tr><td></td><td>将所有<code>xs</code>连接在一起作为字符串并返回结果</td></tr>

</table>


Case Study - Maths Language 案例研究 - 数学语言
===========================

Combinator Approach 组合方法
-------------------

Passing around all these function pointers might seem clumsy, but having parsers be type-generic is important as it lets users define their own output types for parsers. For example we could design our own syntax tree type to use. We can also use this method to do some specific house-keeping or data processing in the parsing phase.

传递所有这些函数指针可能看起来很笨拙，但让解析器具有类型泛型很重要，因为它允许用户为解析器定义自己的输出类型。例如，我们可以设计自己的语法树类型来使用。我们还可以在解析阶段使用这种方法进行一些特定的内务管理或数据处理。

As an example of this power, we can specify a simple maths grammar, that outputs `int *`, and computes the result of the expression as it goes along.

作为这种能力的一个例子，我们可以指定一个简单的数学语法，输出`int*`，并计算表达式的结果。

We start with a fold function that will fold two `int *` into a new `int *` based on some `char *` operator.

我们从一个fold函数开始，该函数将根据一些`char*`运算符将两个`int*`折叠成一个新的`int*`。

```c
mpc_val_t *fold_maths(int n, mpc_val_t **xs) {

  int **vs = (int**)xs;

  if (strcmp(xs[1], "*") == 0) { *vs[0] *= *vs[2]; }
  if (strcmp(xs[1], "/") == 0) { *vs[0] /= *vs[2]; }
  if (strcmp(xs[1], "%") == 0) { *vs[0] %= *vs[2]; }
  if (strcmp(xs[1], "+") == 0) { *vs[0] += *vs[2]; }
  if (strcmp(xs[1], "-") == 0) { *vs[0] -= *vs[2]; }

  free(xs[1]); free(xs[2]);

  return xs[0];
}
```

And then we use this to specify a basic grammar, which folds together any results.

然后我们用它来指定一个基本语法，它将任何结果折叠在一起。

```c
mpc_parser_t *Expr   = mpc_new("expr");
mpc_parser_t *Factor = mpc_new("factor");
mpc_parser_t *Term   = mpc_new("term");
mpc_parser_t *Maths  = mpc_new("maths");

mpc_define(Expr, mpc_or(2,
  mpc_and(3, fold_maths,
    Factor, mpc_oneof("+-"), Factor,
    free, free),
  Factor
));

mpc_define(Factor, mpc_or(2,
  mpc_and(3, fold_maths,
    Term, mpc_oneof("*/"), Term,
    free, free),
  Term
));

mpc_define(Term, mpc_or(2, mpc_int(), mpc_parens(Expr, free)));
mpc_define(Maths, mpc_whole(Expr, free));

/* Do Some Parsing... */

mpc_delete(Maths);
```

If we supply this function with something like `(4*2)+5`, we can expect it to output `13`.

如果我们为这个函数提供类似于`(4*2)+5`的东西，我们可以期望它输出`13`。


Language Approach 语言方法
-----------------

It is possible to avoid passing in and around all those function pointers, if you don't care what type is output by _mpc_. For this, a generic Abstract Syntax Tree type `mpc_ast_t` is included in _mpc_. The combinator functions which act on this don't need information on how to destruct or fold instances of the result as they know it will be a `mpc_ast_t`. So there are a number of combinator functions which work specifically (and only) on parsers that return this type. They reside under `mpca_*`.

如果你不在乎_mpc_输出什么类型，可以避免传入和绕过所有这些函数指针。为此，_mpc_中包含了一个通用的抽象语法树类型`mpc_ast_t`。作用于此的组合子函数不需要关于如何销毁或折叠结果实例的信息，因为它们知道这将是一个`mpc_ast_t`。因此，有许多组合子函数专门（且仅）在返回此类型的解析器上工作。它们使用`mpca_*`类型的名称。

Doing things via this method means that all the data processing must take place after the parsing. In many instances this is not an issue, or even preferable.

通过这种方法做事意味着所有的数据处理都必须在解析后进行。在许多情况下，这不是问题，甚至更可取。

It also allows for one more trick. As all the fold and destructor functions are implicit, the user can simply specify the grammar of the language in some nice way and the system can try to build a parser for the AST type from this alone. For this there are a few functions supplied which take in a string, and output a parser. The format for these grammars is simple and familiar to those who have used parser generators before. It looks something like this.

它还允许再耍一个花招。由于所有的fold和析构函数函数都是隐式的，用户可以简单地以某种好的方式指定语言的语法，系统可以尝试仅凭此为AST类型构建解析器。为此，提供了一些函数，它们接收字符串并输出解析器。这些语法的格式对于那些以前使用过解析器生成器的人来说是简单而熟悉的。它看起来像这样。

```
number "number" : /[0-9]+/ ;
expression      : <product> (('+' | '-') <product>)* ;
product         : <value>   (('*' | '/')   <value>)* ;
value           : <number> | '(' <expression> ')' ;
maths           : /^/ <expression> /$/ ;
```

The syntax for this is defined as follows.

其语法定义如下。

<table class='table'>
  <tr><td><code>"ab"</code></td><td>The string <code>ab</code> is required.</td></tr>
  <tr><td><code>'a'</code></td><td>The character <code>a</code> is required.</td></tr>
  <tr><td><code>'a' 'b'</code></td><td>First <code>'a'</code> is required, then <code>'b'</code> is required..</td></tr>
  <tr><td><code>'a' | 'b'</code></td><td>Either <code>'a'</code> is required, or <code>'b'</code> is required.</td></tr>
  <tr><td><code>'a'*</code></td><td>Zero or more <code>'a'</code> are required.</td></tr>
  <tr><td><code>'a'+</code></td><td>One or more <code>'a'</code> are required.</td></tr>
  <tr><td><code>'a'?</code></td><td>Zero or one <code>'a'</code> is required.</td></tr>
  <tr><td><code>'a'{x}</code></td><td>Exactly <code>x</code> (integer) copies of <code>'a'</code> are required.</td></tr>
  <tr><td><code>&lt;abba&gt;</code></td><td>The rule called <code>abba</code> is required.</td></tr>
</table>

Rules are specified by rule name, optionally followed by an _expected_ string, followed by a colon `:`, followed by the definition, and ending in a semicolon `;`. Multiple rules can be specified. The _rule names_ must match the names given to any parsers created by `mpc_new`, otherwise the function will crash.

规则由规则名称指定，后面可选地跟有_expected_字符串，后跟冒号`:`，后跟定义，最后以分号`;`结尾。可以指定多个规则。_rule names_必须与`mpc_new`创建的任何解析器的名称匹配，否则函数将崩溃。

The flags variable is a set of flags `MPCA_LANG_DEFAULT`, `MPCA_LANG_PREDICTIVE`, or `MPCA_LANG_WHITESPACE_SENSITIVE`. For specifying if the language is predictive or whitespace sensitive.

标志变量是一组标志`MPCA_LANG_DEFAULT`、`MPCA_LONG_PREDICTIVE`或`MPCA_ANG_WHITESPACE_SENSITIVE`。用于指定语言是预测性的还是空格敏感的。

Like with the regular expressions, this user input is parsed by existing parts of the _mpc_ library. It provides one of the more powerful features of the library.

与正则表达式一样，此用户输入由_mpc_库的现有部分解析。它是库更强大的原因之一。

* * *

```c
mpc_parser_t *mpca_grammar(int flags, const char *grammar, ...);
```

This takes in some single right hand side of a rule, as well as a list of any of the parsers referenced, and outputs a parser that does what is specified by the rule. The list of parsers referenced can be terminated with `NULL` to get an error instead of a crash when a parser required is not supplied.

这接收规则的某个右侧，以及引用的任何解析器的列表，并输出一个执行规则指定操作的解析器。当没有提供所需的解析器时，引用的解析器列表可以用`NULL`终止，以获得错误而不是崩溃。

* * *

```c
mpc_err_t *mpca_lang(int flags, const char *lang, ...);
```

This takes in a full language (zero or more rules) as well as any parsers referred to by either the right or left hand sides. Any parsers specified on the left hand side of any rule will be assigned a parser equivalent to what is specified on the right. On valid user input this returns `NULL`, while if there are any errors in the user input it will return an instance of `mpc_err_t` describing the issues. The list of parsers referenced can be terminated with `NULL` to get an error instead of a crash when a parser required is not supplied.

这需要一个完整的语言（零个或多个规则）以及右侧或左侧引用的任何解析器。在任何规则的左侧指定的任何解析器都将被分配一个与右侧指定的解析器等效的解析器。在有效的用户输入时，这将返回`NULL`，而如果用户输入中有任何错误，它将返回一个描述问题的`mpc_err_t`实例。当没有提供所需的解析器时，引用的解析器列表可以用`NULL`终止，以获得错误而不是崩溃。

* * *

```c
mpc_err_t *mpca_lang_file(int flags, FILE* f, ...);
```

This reads in the contents of file `f` and inputs it into `mpca_lang`.

这将读取文件`f`的内容并将其输入到`mpca_lang`中。

* * *

```c
mpc_err_t *mpca_lang_contents(int flags, const char *filename, ...);
```

This opens and reads in the contents of the file given by `filename` and passes it to `mpca_lang`.

这将打开并读取由`filename`给出的文件内容，并将其传递给`mpca_lang`。

Case Study - Tokenizer 案例研究 - 分词器
======================

Another common task we might be interested in doing is tokenizing some block of text (splitting the text into individual elements) and performing some function on each one of these elements as it is read. We can do this with `mpc` too.

我们可能感兴趣的另一个常见任务是标记一些文本块（将文本拆分为单个元素），并在读取时对每个元素执行一些功能。我们也可以用mpc来实现这一点。

First, we can build a regular expression which parses an individual token. For example if our tokens are identifiers, integers, commas, periods and colons we could build something like this `mpc_re("\\s*([a-zA-Z_]+|[0-9]+|,|\\.|:)")`.

首先，我们可以构建一个解析单个令牌的正则表达式。例如，如果我们的标记是标识符、整数、逗号、句点和冒号，我们可以构建这样的`mpc_re("\\s*([a-zA-Z_]+|[0-9]+|,|\\.|:)")`。

Next we can strip any whitespace, and add a callback function using `mpc_apply` which gets called every time this regex is parsed successfully `mpc_apply(mpc_strip(mpc_re("\\s*([a-zA-Z_]+|[0-9]+|,|\\.|:)")), print_token)`.

接下来，我们可以去掉任何空格，并使用`mpc_apply`添加一个回调函数，每次成功解析此正则表达式时都会调用`mpc_apply(mpc_strip(mpc_re("\\s*([a-zA-Z_]+|[0-9]+|,|\\.|:)")), print_token)`。

Finally we can surround all of this in `mpc_many` to parse it zero or more times. The final code might look something like this:

最后，我们可以将所有这些放在“mpc_mony”中，对其进行零次或多次解析。最终的代码可能看起来像这样：


```c
static mpc_val_t *print_token(mpc_val_t *x) {
  printf("Token: '%s'\n", (char*)x);
  return x;
}

int main(int argc, char **argv) {

  const char *input = "  hello 4352 ,  \n foo.bar   \n\n  test:ing   ";

  mpc_parser_t* Tokens = mpc_many(
    mpcf_all_free,
    mpc_apply(mpc_strip(mpc_re("\\s*([a-zA-Z_]+|[0-9]+|,|\\.|:)")), print_token));

  mpc_result_t r;
  mpc_parse("input", input, Tokens, &r);

  mpc_delete(Tokens);

  return 0;
}
```

Running this program will produce an output something like this:

运行此程序将产生如下输出：

```
Token: 'hello'
Token: '4352'
Token: ','
Token: 'foo'
Token: '.'
Token: 'bar'
Token: 'test'
Token: ':'
Token: 'ing'
```

By extending the regex we can easily extend this to parse many more types of tokens and quickly and easily build a tokenizer for whatever language we are interested in.

通过扩展正则表达式，我们可以轻松地扩展它来解析更多类型的标记，并快速轻松地为我们感兴趣的任何语言构建标记器。

Error Reporting 错误报告
===============

_mpc_ provides some automatic generation of error messages. These can be enhanced by the user, with use of `mpc_expect`, but many of the defaults should provide both useful and readable. An example of an error message might look something like this:

_mpc提供了一些自动生成错误消息的功能。用户可以通过使用`mpc_expect`来增强这些功能，但许多默认值应该既有用又可读。错误消息的示例可能如下：

```
<test>:0:3: error: expected one or more of 'a' or 'd' at 'k'
```

Misc 杂项
====

Here are some other misc functions that mpc provides. These functions are susceptible to change between versions so use them with some care.

以下是mpc提供的一些其他杂项功能。这些功能很容易在版本之间发生变化，因此请谨慎使用。


* * *

```c
void mpc_print(mpc_parser_t *p);
```

Prints out a parser in some weird format. This is generally used for debugging so don't expect to be able to understand the output right away without looking at the source code a little bit.

以某种奇怪的格式打印出解析器。这通常用于调试，所以不要指望在不看一点源代码的情况下就能立即理解输出。

* * *

```c
void mpc_stats(mpc_parser_t *p);
```

Prints out some basic stats about a parser. Again used for debugging and optimisation.

打印出一些关于解析器的基本统计数据。再次用于调试和优化。


* * *

```c
void mpc_optimise(mpc_parser_t *p);
```

Performs some basic optimisations on a parser to reduce it's size and increase its running speed.

对解析器执行一些基本优化，以减小其大小并提高其运行速度。


Limitations & FAQ 限制和常见问题
=================

### I'm getting namespace issues due to `libmpc`, what can I do?

There is a re-naming of this project to `pcq` hosted on the [pcq branch](https://github.com/orangeduck/mpc/tree/pcq) which should be usable without namespace issues.

### Does _mpc_ support Unicode?

_mpc_ Only supports ASCII. Sorry! Writing a parser library that supports Unicode is pretty difficult. I welcome contributions!


### Is _mpc_ binary safe?

No. Sorry! Including NULL characters in a string or a file will probably break it. Avoid this if possible.


### The Parser is going into an infinite loop!

While it is certainly possible there is an issue with _mpc_, it is probably the case that your grammar contains _left recursion_. This is something _mpc_ cannot deal with. _Left recursion_ is when a rule directly or indirectly references itself on the left hand side of a derivation. For example consider this left recursive grammar intended to parse an expression.

```
expr : <expr> '+' (<expr> | <int> | <string>);
```

When the rule `expr` is called, it looks the first rule on the left. This happens to be the rule `expr` again. So again it looks for the first rule on the left. Which is `expr` again. And so on. To avoid left recursion this can be rewritten (for example) as the following. Note that rewriting as follows also changes the operator associativity.

```
value : <int> | <string> ;
expr  : <value> ('+' <expr>)* ;
```

Avoiding left recursion can be tricky, but is easy once you get a feel for it. For more information you can look on [wikipedia](http://en.wikipedia.org/wiki/Left_recursion) which covers some common techniques and more examples. Possibly in the future _mpc_ will support functionality to warn the user or re-write grammars which contain left recursion, but it wont for now.


### Backtracking isn't working!

_mpc_ supports backtracking, but it may not work as you expect. It isn't a silver bullet, and you still must structure your grammar to be unambiguous. To demonstrate this behaviour examine the following erroneous grammar, intended to parse either a C style identifier, or a C style function call.

```
factor : <ident>
       | <ident> '('  <expr>? (',' <expr>)* ')' ;
```

This grammar will never correctly parse a function call because it will always first succeed parsing the initial identifier and return a factor. At this point it will encounter the parenthesis of the function call, give up, and throw an error. Even if it were to try and parse a factor again on this failure it would never reach the correct function call option because it always tries the other options first, and always succeeds with the identifier.

The solution to this is to always structure grammars with the most specific clause first, and more general clauses afterwards. This is the natural technique used for avoiding left-recursive grammars and unambiguity, so is a good habit to get into anyway.

Now the parser will try to match a function first, and if this fails backtrack and try to match just an identifier.

```
factor : <ident> '('  <expr>? (',' <expr>)* ')'
       | <ident> ;
```

An alternative, and better option is to remove the ambiguity completely by factoring out the first identifier. This is better because it removes any need for backtracking at all! Now the grammar is predictive!

```
factor : <ident> ('('  <expr>? (',' <expr>)* ')')? ;
```


### How can I avoid the maximum string literal length?

Some compilers limit the maximum length of string literals. If you have a huge language string in the source file to be passed into `mpca_lang` you might encounter this. The ANSI standard says that 509 is the maximum length allowed for a string literal. Most compilers support greater than this. Visual Studio supports up to 2048 characters, while gcc allocates memory dynamically and so has no real limit.

There are a couple of ways to overcome this issue if it arises. You could instead use `mpca_lang_contents` and load the language from file or you could use a string literal for each line and let the preprocessor automatically concatenate them together, avoiding the limit. The final option is to upgrade your compiler. In C99 this limit has been increased to 4095.


### The automatic tags in the AST are annoying!

When parsing from a grammar, the abstract syntax tree is tagged with different tags for each primitive type it encounters. For example a regular expression will be automatically tagged as `regex`. Character literals as `char` and strings as `string`. This is to help people wondering exactly how they might need to convert the node contents.

If you have a rule in your grammar called `string`, `char` or `regex`, you may encounter some confusion. This is because nodes will be tagged with (for example) `string` _either_ if they are a string primitive, _or_ if they were parsed via your `string` rule. If you are detecting node type using something like `strstr`, in this situation it might break. One solution to this is to always check that `string` is the innermost tag to test for string primitives, or to rename your rule called `string` to something that doesn't conflict.

Yes it is annoying but its probably not going to change!
-												First commit

											
										
										
											2013-09-19 20:57:40 +01:00
+								Micro Parser Combinators
 								========================
-												version number

											
										
										
											2018-12-16 09:53:56 -05:00
+								Version 0.9.0
-												Initial commit for recording parse state in ast

											
										
										
											2014-04-15 16:04:07 +01:00
-												Fixed bug in state tagging. Updated examples to use concatinated preprocessor strings

											
										
										
											2014-04-16 17:06:16 +01:00
 								About
 								-----
-												A couple more fixes and edits

											
										
										
											2014-01-21 11:29:08 +00:00
+								_mpc_ is a lightweight and powerful Parser Combinator library for C.
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
 								Using _mpc_ might be of interest to you if you are...
 								* Building a new programming language
 								* Building a new data format
-												A couple more fixes and edits

											
										
										
											2014-01-21 11:29:08 +00:00
+								* Parsing an existing programming language
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								* Parsing an existing data format
 								* Embedding a Domain Specific Language
-												WIP prediction stuff

											
										
										
											2013-09-30 17:58:52 +01:00
+								* Implementing [Greenspun's Tenth Rule](http://en.wikipedia.org/wiki/Greenspun%27s_tenth_rule)
-												small readme updates

											
										
										
											2013-09-26 14:20:21 +01:00
-												First commit

											
										
										
											2013-09-19 20:57:40 +01:00
-												Updated testing framework

											
										
										
											2013-09-23 22:41:58 +01:00
+								Features
 								--------
-												First commit

											
										
										
											2013-09-19 20:57:40 +01:00
-												Various fixes

											
										
										
											2014-01-09 11:12:59 +00:00
+								* Type-Generic
 								* Predictive, Recursive Descent
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								* Easy to Integrate (One Source File in ANSI C)
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								* Automatic Error Message Generation
-												Various fixes

											
										
										
											2014-01-09 11:12:59 +00:00
+								* Regular Expression Parser Generator
-												A couple more fixes and edits

											
										
										
											2014-01-21 11:29:08 +00:00
+								* Language/Grammar Parser Generator
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												Updated testing framework

											
										
										
											2013-09-23 22:41:58 +01:00
 								Alternatives
 								------------
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								The current main alternative for a C based parser combinator library is a branch of [Cesium3](https://github.com/wbhart/Cesium3/tree/combinators).
-												Updated testing framework

											
										
										
											2013-09-23 22:41:58 +01:00
-												A couple more fixes and edits

											
										
										
											2014-01-21 11:29:08 +00:00
+								_mpc_ provides a number of features that this project does not offer, and also overcomes a number of potential downsides:
-												Updated testing framework

											
										
										
											2013-09-23 22:41:58 +01:00
-												Completed refactoring

											
										
										
											2013-11-11 16:56:20 +00:00
+								* _mpc_ Works for Generic Types
 								* _mpc_ Doesn't rely on Boehm-Demers-Weiser Garbage Collection
 								* _mpc_ Doesn't use `setjmp` and `longjmp` for errors
 								* _mpc_ Doesn't pollute the namespace
-												First commit

											
										
										
											2013-09-19 20:57:40 +01:00
-												Readme updates

											
										
										
											2014-02-12 20:51:45 +00:00
+								Quickstart
 								==========
 								Here is how one would use _mpc_ to create a parser for a basic mathematical expression language.
 								```c
 								mpc_parser_t *Expr  = mpc_new("expression");
 								mpc_parser_t *Prod  = mpc_new("product");
 								mpc_parser_t *Value = mpc_new("value");
 								mpc_parser_t *Maths = mpc_new("maths");
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								mpca_lang(MPCA_LANG_DEFAULT,
-												Fixed bug in state tagging. Updated examples to use concatinated preprocessor strings

											
										
										
											2014-04-16 17:06:16 +01:00
+								  " expression : <product> (('+' | '-') <product>)*; "
-												Update README.md
											
										
										
											2014-08-20 14:50:23 +01:00
+								  " product    : <value>   (('*' | '/')   <value>)*; "
 								  " value      : /[0-9]+/ | '(' <expression> ')';    "
 								  " maths      : /^/ <expression> /$/;               ",
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  Expr, Prod, Value, Maths, NULL);
-												Readme updates

											
										
										
											2014-02-12 20:51:45 +00:00
 								mpc_result_t r;
 								if (mpc_parse("input", input, Maths, &r)) {
 								  mpc_ast_print(r.output);
 								  mpc_ast_delete(r.output);
 								} else {
 								  mpc_err_print(r.error);
 								  mpc_err_delete(r.error);
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								}
-												Readme updates

											
										
										
											2014-02-12 20:51:45 +00:00
 								mpc_cleanup(4, Expr, Prod, Value, Maths);
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```
-												Readme updates

											
										
										
											2014-02-12 20:51:45 +00:00
+								If you were to set `input` to the string `(4 * 2 * 11 + 2) - 5`, the printed output would look like this.
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								```
-												Initial commit for recording parse state in ast

											
										
										
											2014-04-15 16:04:07 +01:00
+								>
 								  regex
 								  expression|>
 								    value|>
 								      char:1:1 '('
 								      expression|>
 								        product|>
 								          value|regex:1:2 '4'
 								          char:1:4 '*'
 								          value|regex:1:6 '2'
 								          char:1:8 '*'
 								          value|regex:1:10 '11'
 								        char:1:13 '+'
 								        product|value|regex:1:15 '2'
 								      char:1:16 ')'
 								    char:1:18 '-'
 								    product|value|regex:1:20 '5'
 								  regex
-												First commit

											
										
										
											2013-09-19 20:57:40 +01:00
+								```
-												A couple more fixes and edits

											
										
										
											2014-01-21 11:29:08 +00:00
+								Getting Started
 								===============
-												small readme updates

											
										
										
											2013-09-26 14:20:21 +01:00
-												A couple more fixes and edits

											
										
										
											2014-01-21 11:29:08 +00:00
+								Introduction
 								------------
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								Parser Combinators are structures that encode how to parse particular languages. They can be combined using intuitive operators to create new parsers of increasing complexity. Using these operators detailed grammars and languages can be parsed and processed in a quick, efficient, and easy way.
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								解析器组合器是对如何解析特定语言进行编码的结构。它们可以使用直观的运算符组合在一起，以创建越来越复杂的新解析器。使用这些运算符，可以快速、高效、简单地解析和处理详细的语法和语言。
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								The trick behind Parser Combinators is the observation that by structuring the library in a particular way, one can make building parser combinators look like writing a grammar itself. Therefore instead of describing _how to parse a language_, a user must only specify _the language itself_, and the library will work out how to parse it ... as if by magic!
-												small readme updates

											
										
										
											2013-09-26 14:20:21 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								Parser Combinators背后的诀窍是观察到，通过以特定的方式构建库，可以使构建解析器组合子看起来像编写语法本身。因此，用户只需指定语言本身，而不是描述如何解析语言，库将计算出如何解析它。。。仿佛施了魔法！
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								_mpc_ can be used in this mode, or, as shown in the above example, you can specify the grammar directly as a string or in a file.
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								_mpc可以在这种模式下使用，或者，如上例所示，您可以直接将语法指定为字符串或文件。
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								Basic Parsers
 								-------------
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
+								### String Parsers
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								All the following functions construct new basic parsers of the type `mpc_parser_t *`. All of those parsers return a newly allocated `char *` with the character(s) they manage to match. If unsuccessful they will return an error. They have the following functionality.
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								以下所有函数都构造了类型为“mpc_parser_t*”的新基本解析器。所有这些解析器都返回一个新分配的“char*”，其中包含它们设法匹配的字符。如果失败，他们将返回错误。它们具有以下功能。
-												Add documentation for ? and {d} in language approach. Fix EOL issues

											
										
										
											2018-12-02 12:34:09 +00:00
+								* * *
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
 								```c
-												A couple more fixes and edits

											
										
										
											2014-01-21 11:29:08 +00:00
+								mpc_parser_t *mpc_any(void);
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
+								```
-												Completed refactoring

											
										
										
											2013-11-11 16:56:20 +00:00
+								Matches any individual character
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								匹配任何单个字符
-												Add documentation for ? and {d} in language approach. Fix EOL issues

											
										
										
											2018-12-02 12:34:09 +00:00
+								* * *
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
 								```c
-												A couple more fixes and edits

											
										
										
											2014-01-21 11:29:08 +00:00
+								mpc_parser_t *mpc_char(char c);
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
+								```
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
+								Matches a single given character `c`
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								匹配单个给定字符`c`
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
+								* * *
 								```c
-												A couple more fixes and edits

											
										
										
											2014-01-21 11:29:08 +00:00
+								mpc_parser_t *mpc_range(char s, char e);
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
+								```
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
+								Matches any single given character in the range `s` to `e` (inclusive)
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								匹配范围`s`到`e`（含）内的任何单个给定字符
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
+								* * *
 								```c
-												Readme updates

											
										
										
											2014-02-12 20:51:45 +00:00
+								mpc_parser_t *mpc_oneof(const char *s);
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
+								```
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
+								Matches any single given character in the string  `s`
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								匹配字符串中的任何单个给定字符`
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
+								* * *
 								```c
-												Readme updates

											
										
										
											2014-02-12 20:51:45 +00:00
+								mpc_parser_t *mpc_noneof(const char *s);
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
+								```
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
+								Matches any single given character not in the string `s`
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								匹配不在字符串中的任何单个给定字符`
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
+								* * *
 								```c
-												A couple more fixes and edits

											
										
										
											2014-01-21 11:29:08 +00:00
+								mpc_parser_t *mpc_satisfy(int(*f)(char));
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
+								```
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
+								Matches any single given character satisfying function `f`
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								匹配满足函数`f`的任何单个给定字符
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
+								* * *
 								```c
-												Readme updates

											
										
										
											2014-02-12 20:51:45 +00:00
+								mpc_parser_t *mpc_string(const char *s);
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
+								```
-												Some language stuff

											
										
										
											2013-09-26 17:58:14 +01:00
+								Matches exactly the string `s`
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								完全匹配字符串`s`
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
-												Initial commit for boundary support

											
										
										
											2014-04-16 18:16:16 +01:00
+								### Other Parsers
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								Several other functions exist that construct parsers with some other special functionality.
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								还有其他几个函数可以构造具有其他特殊功能的解析器。
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
+								* * *
 								```c
-												A couple more fixes and edits

											
										
										
											2014-01-21 11:29:08 +00:00
+								mpc_parser_t *mpc_pass(void);
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
+								```
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
+								Consumes no input, always successful, returns `NULL`
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								不消耗输入，总是成功，返回`NULL`
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
+								* * *
 								```c
-												Readme updates

											
										
										
											2014-02-12 20:51:45 +00:00
+								mpc_parser_t *mpc_fail(const char *m);
 								mpc_parser_t *mpc_failf(const char *fmt, ...);
-												Completed refactoring

											
										
										
											2013-11-11 16:56:20 +00:00
+								```
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								Consumes no input, always fails with message `m` or formatted string `fmt`.
-												Completed refactoring

											
										
										
											2013-11-11 16:56:20 +00:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								不消耗任何输入，总是以消息`m`或格式化字符串`fmt`失败。
-												Completed refactoring

											
										
										
											2013-11-11 16:56:20 +00:00
+								* * *
 								```c
-												A couple more fixes and edits

											
										
										
											2014-01-21 11:29:08 +00:00
+								mpc_parser_t *mpc_lift(mpc_ctor_t f);
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
+								```
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
+								Consumes no input, always successful, returns the result of function `f`
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								不消耗输入，总是成功，返回函数`f`的结果
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
+								* * *
 								```c
-												Readme updates

											
										
										
											2014-02-12 20:51:45 +00:00
+								mpc_parser_t *mpc_lift_val(mpc_val_t *x);
-												Updated README

											
										
										
											2013-09-26 14:39:39 +01:00
+								```
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
+								Consumes no input, always successful, returns `x`
-												small readme updates

											
										
										
											2013-09-26 14:20:21 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								不消耗输入，总是成功，返回`x`
-												Initial commit for recording parse state in ast

											
										
										
											2014-04-15 16:04:07 +01:00
+								* * *
 								```c
 								mpc_parser_t *mpc_state(void);
 								```
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								Consumes no input, always successful, returns a copy of the parser state as a `mpc_state_t *`. This state is newly allocated and so needs to be released with `free` when finished with.
-												Initial commit for recording parse state in ast

											
										
										
											2014-04-15 16:04:07 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								不消耗任何输入，总是成功的，返回解析器状态的副本作为`mpc_state_t*`。此状态是新分配的，因此在完成时需要用“free”释放。
-												Initial commit for boundary support

											
										
										
											2014-04-16 18:16:16 +01:00
+								* * *
 								```c
-												Refactored boundary stuff into more general anchor

											
										
										
											2014-04-16 23:20:52 +01:00
+								mpc_parser_t *mpc_anchor(int(*f)(char,char));
-												Initial commit for boundary support

											
										
										
											2014-04-16 18:16:16 +01:00
+								```
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								Consumes no input. Successful when function `f` returns true. Always returns `NULL`.
-												Initial commit for boundary support

											
										
										
											2014-04-16 18:16:16 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								不消耗任何输入。当函数`f`返回true时成功。始终返回`NULL`。
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								Function `f` is a _anchor_ function. It takes as input the last character parsed, and the next character in the input, and returns success or failure. This function can be set by the user to ensure some condition is met. For example to test that the input is at a boundary between words and non-words.
-												Refactored boundary stuff into more general anchor

											
										
										
											2014-04-16 23:20:52 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								函数`f`是一个_anchor_函数。它将解析的最后一个字符和输入中的下一个字符作为输入，并返回成功或失败。此功能可由用户设置，以确保满足某些条件。例如，测试输入是否位于单词和非单词之间的边界。
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								At the start of the input the first argument is set to `'\0'`. At the end of the input the second argument is set to `'\0'`.
-												Initial commit for boundary support

											
										
										
											2014-04-16 18:16:16 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								在输入开始时，第一个参数设置为`'\0'`。在输入的末尾，第二个参数被设置为`'\0'`。
-												Initial commit for boundary support

											
										
										
											2014-04-16 18:16:16 +01:00
-												First commit

											
										
										
											2013-09-19 20:57:40 +01:00
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								Parsing
 								-------
 								Once you've build a parser, you can run it on some input using one of the following functions. These functions return `1` on success and `0` on failure. They output either the result, or an error to a `mpc_result_t` variable. This type is defined as follows.
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								构建解析器后，您可以使用以下函数之一在某些输入上运行它。这些函数成功时返回`1`，失败时返回`0`。它们将结果或错误输出到`mpc_result_t`变量。这种类型定义如下。
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								```c
 								typedef union {
 								  mpc_err_t *error;
 								  mpc_val_t *output;
 								} mpc_result_t;
 								```
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								where `mpc_val_t *` is synonymous with `void *` and simply represents some pointer to data - the exact type of which is dependant on the parser.
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								其中`mpc_val_t*`与`void*`同义，只是表示指向数据的某个指针，其确切类型取决于解析器。
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
 								* * *
 								```c
 								int mpc_parse(const char *filename, const char *string, mpc_parser_t *p, mpc_result_t *r);
 								```
 								Run a parser on some string.
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								对某个字符串运行解析器。
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								* * *
 								```c
 								int mpc_parse_file(const char *filename, FILE *file, mpc_parser_t *p, mpc_result_t *r);
 								```
 								Run a parser on some file.
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								对某个文件运行解析器。
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								* * *
 								```c
 								int mpc_parse_pipe(const char *filename, FILE *pipe, mpc_parser_t *p, mpc_result_t *r);
 								```
 								Run a parser on some pipe (such as `stdin`).
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								在某个管道（如`stdin`）上运行解析器。
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								* * *
 								```c
-												Readme updates

											
										
										
											2014-02-12 20:51:45 +00:00
+								int mpc_parse_contents(const char *filename, mpc_parser_t *p, mpc_result_t *r);
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								```
 								Run a parser on the contents of some file.
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								对某个文件的内容运行解析器。
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
-												First commit

											
										
										
											2013-09-19 20:57:40 +01:00
+								Combinators
 								-----------
-												Add documentation for ? and {d} in language approach. Fix EOL issues

											
										
										
											2018-12-02 12:34:09 +00:00
+								Combinators are functions that take one or more parsers and return a new parser of some given functionality.
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								组合器是接受一个或多个解析器并返回某个给定功能的新解析器的函数。
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								These combinators work independently of exactly what data type the parser(s) supplied as input return. In languages such as Haskell ensuring you don't input one type of data into a parser requiring a different type is done by the compiler. But in C we don't have that luxury. So it is at the discretion of the programmer to ensure that he or she deals correctly with the outputs of different parser types.
-												First commit

											
										
										
											2013-09-19 20:57:40 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								这些组合子的工作与解析器作为输入返回提供的数据类型完全无关。在Haskell等语言中，编译器会确保不将一种类型的数据输入到需要不同类型的解析器中。但在C中，我们没有这种奢侈。因此，程序员可以自行决定是否正确处理不同解析器类型的输出。
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								A second annoyance in C is that of manual memory management. Some parsers might get half-way and then fail. This means they need to clean up any partial result that has been collected in the parse. In Haskell this is handled by the Garbage Collector, but in C these combinators will need to take _destructor_ functions as input, which say how clean up any partial data that has been collected.
-												First commit

											
										
										
											2013-09-19 20:57:40 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								C中的第二个烦恼是手动内存管理。一些解析器可能会中途失败。这意味着他们需要清理在解析中收集到的任何部分结果。在Haskell中，这是由垃圾收集器处理的，但在C中，这些组合子需要将_destructor_函数作为输入，该函数表示如何清理已收集的任何部分数据。
-												small readme updates

											
										
										
											2013-09-26 14:20:21 +01:00
+								Here are the main combinators and how to use then.
-												First commit

											
										
										
											2013-09-19 20:57:40 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								以下是主要的组合子及其使用方法。
-												Readme tweaks

											
										
										
											2013-09-26 14:28:01 +01:00
+								* * *
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```c
-												Readme updates

											
										
										
											2014-02-12 20:51:45 +00:00
+								mpc_parser_t *mpc_expect(mpc_parser_t *a, const char *e);
 								mpc_parser_t *mpc_expectf(mpc_parser_t *a, const char *fmt, ...);
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```
-												First commit

											
										
										
											2013-09-19 20:57:40 +01:00
-												Some language stuff

											
										
										
											2013-09-26 17:58:14 +01:00
+								Returns a parser that runs `a`, and on success returns the result of `a`, while on failure reports that `e` was expected.
-												First commit

											
										
										
											2013-09-19 20:57:40 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								返回一个运行`a`的解析器，成功时返回`a`结果，失败时返回预期的`e`结果。
-												Readme tweaks

											
										
										
											2013-09-26 14:28:01 +01:00
+								* * *
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```c
-												A couple more fixes and edits

											
										
										
											2014-01-21 11:29:08 +00:00
+								mpc_parser_t *mpc_apply(mpc_parser_t *a, mpc_apply_t f);
-												Readme updates

											
										
										
											2014-02-12 20:51:45 +00:00
+								mpc_parser_t *mpc_apply_to(mpc_parser_t *a, mpc_apply_to_t f, void *x);
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```
-												Some language stuff

											
										
										
											2013-09-26 17:58:14 +01:00
+								Returns a parser that applies function `f` (optionality taking extra input `x`) to the result of parser `a`.
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								返回一个解析器，该解析器将函数`f`（可选性接受额外的输入`x`）应用于解析器`a`的结果。
-												Readme tweaks

											
										
										
											2013-09-26 14:28:01 +01:00
+								* * *
-												Add `mpc_check` and `mpc_check_with` combinators.

											
										
										
											2018-03-23 10:50:09 +01:00
+								```c
-												Added destructor to check combinators

											
										
										
											2019-06-15 14:30:41 -04:00
+								mpc_parser_t *mpc_check(mpc_parser_t *a, mpc_dtor_t da, mpc_check_t f, const char *e);
 								mpc_parser_t *mpc_check_with(mpc_parser_t *a, mpc_dtor_t da, mpc_check_with_t f, void *x, const char *e);
 								mpc_parser_t *mpc_checkf(mpc_parser_t *a, mpc_dtor_t da, mpc_check_t f, const char *fmt, ...);
 								mpc_parser_t *mpc_check_withf(mpc_parser_t *a, mpc_dtor_t da, mpc_check_with_t f, void *x, const char *fmt, ...);
-												Add `mpc_check` and `mpc_check_with` combinators.

											
										
										
											2018-03-23 10:50:09 +01:00
+								```
-												Added destructor to check combinators

											
										
										
											2019-06-15 14:30:41 -04:00
+								Returns a parser that applies function `f` (optionally taking extra input `x`) to the result of parser `a`. If `f` returns non-zero, then the parser succeeds and returns the value of `a` (possibly modified by `f`). If `f` returns zero, then the parser fails with message `e`, and the result of `a` is destroyed with the destructor `da`.
-												Add `mpc_check` and `mpc_check_with` combinators.

											
										
										
											2018-03-23 10:50:09 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								返回一个解析器，该解析器将函数`f`（可选地接受额外的输入`x`）应用于解析器`a`的结果。如果`f`返回非零，则解析器成功并返回`a`的值（可能被`f`修改）。如果`f`返回零，那么解析器将失败，并返回消息`e`，`a`的结果将被析构函数`da`销毁。
-												Add `mpc_check` and `mpc_check_with` combinators.

											
										
										
											2018-03-23 10:50:09 +01:00
+								* * *
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```c
-												A couple more fixes and edits

											
										
										
											2014-01-21 11:29:08 +00:00
+								mpc_parser_t *mpc_not(mpc_parser_t *a, mpc_dtor_t da);
 								mpc_parser_t *mpc_not_lift(mpc_parser_t *a, mpc_dtor_t da, mpc_ctor_t lf);
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
+								Returns a parser with the following behaviour. If parser `a` succeeds, then it fails and consumes no input. If parser `a` fails, then it succeeds, consumes no input and returns `NULL` (or the result of lift function `lf`). Destructor `da` is used to destroy the result of `a` on success.
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								返回具有以下行为的解析器。如果解析器`a`成功，则它失败并且不消耗任何输入。如果解析器`a`失败，则它成功，不消耗任何输入并返回`NULL`（或提升函数`lf`的结果）。析构函数`da`用于析构`a`对成功的结果。
-												Readme tweaks

											
										
										
											2013-09-26 14:28:01 +01:00
+								* * *
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```c
-												A couple more fixes and edits

											
										
										
											2014-01-21 11:29:08 +00:00
+								mpc_parser_t *mpc_maybe(mpc_parser_t *a);
 								mpc_parser_t *mpc_maybe_lift(mpc_parser_t *a, mpc_ctor_t lf);
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```
-												Completed refactoring

											
										
										
											2013-11-11 16:56:20 +00:00
+								Returns a parser that runs `a`. If `a` is successful then it returns the result of `a`. If `a` is unsuccessful then it succeeds, but returns `NULL` (or the result of `lf`).
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								返回一个运行`a`的解析器。如果`a`成功，则返回`a`的结果。如果`a`不成功，则成功，但返回`NULL`（或`lf`的结果）。
-												Readme tweaks

											
										
										
											2013-09-26 14:28:01 +01:00
+								* * *
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```c
-												A couple more fixes and edits

											
										
										
											2014-01-21 11:29:08 +00:00
+								mpc_parser_t *mpc_many(mpc_fold_t f, mpc_parser_t *a);
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								Runs `a` zero or more times until it fails. Results are combined using fold function `f`. See the _Function Types_ section for more details.
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								运行`a`零次或多次，直到失败。使用折叠函数`f`组合结果。有关更多详细信息，请参阅_Function Types_部分。
-												Readme tweaks

											
										
										
											2013-09-26 14:28:01 +01:00
+								* * *
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```c
-												A couple more fixes and edits

											
										
										
											2014-01-21 11:29:08 +00:00
+								mpc_parser_t *mpc_many1(mpc_fold_t f, mpc_parser_t *a);
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								Runs `a` one or more times until it fails. Results are combined with fold function `f`.
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								运行`a`一次或多次，直到失败。结果与折叠函数`f`相结合。
-												Readme tweaks

											
										
										
											2013-09-26 14:28:01 +01:00
+								* * *
-												feat: add sepby1 combinator

`sepby1` is a common reusable combinator in Haskell Parsec.

This adds `mpc_sepby1(mpc_fold_t f, mpc_parser_t *sep, mpc_parser_t *a)` according to Haskell's implementation:

https://hackage.haskell.org/package/parsec-3.1.16.1/docs/src/Text.Parsec.Combinator.html#sepBy1

Reuses existing `mpc_and`, `mpc_many`, and `mpcf_snd_free`.

											
										
										
											2023-08-08 14:15:55 -05:00
+								```c
 								mpc_parser_t *mpc_sepby1(mpc_fold_t f, mpc_parser_t *sep, mpc_parser_t *a);
 								```
 								Runs `a` one or more times, separated by `sep`. Results are combined with fold function `f`.
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								运行`a`一次或多次，用`sep`分隔。结果与折叠函数`f`相结合。
-												feat: add sepby1 combinator

`sepby1` is a common reusable combinator in Haskell Parsec.

This adds `mpc_sepby1(mpc_fold_t f, mpc_parser_t *sep, mpc_parser_t *a)` according to Haskell's implementation:

https://hackage.haskell.org/package/parsec-3.1.16.1/docs/src/Text.Parsec.Combinator.html#sepBy1

Reuses existing `mpc_and`, `mpc_many`, and `mpcf_snd_free`.

											
										
										
											2023-08-08 14:15:55 -05:00
+								* * *
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```c
-												A couple more fixes and edits

											
										
										
											2014-01-21 11:29:08 +00:00
+								mpc_parser_t *mpc_count(int n, mpc_fold_t f, mpc_parser_t *a, mpc_dtor_t da);
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								Runs `a` exactly `n` times. If this fails, any partial results are destructed with `da`. If successful results of `a` are combined using fold function `f`.
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								运行`a`正好`n`次。如果失败，任何部分结果都将用`da`销毁。如果成功使用折叠函数`f`组合`a`。
-												Readme tweaks

											
										
										
											2013-09-26 14:28:01 +01:00
+								* * *
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```c
-												A couple more fixes and edits

											
										
										
											2014-01-21 11:29:08 +00:00
+								mpc_parser_t *mpc_or(int n, ...);
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```
-												Completed refactoring

											
										
										
											2013-11-11 16:56:20 +00:00
+								Attempts to run `n` parsers in sequence, returning the first one that succeeds. If all fail, returns an error.
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								尝试按顺序运行`n`个解析器，返回第一个成功的解析器。如果全部失败，则返回错误。
-												Readme tweaks

											
										
										
											2013-09-26 14:28:01 +01:00
+								* * *
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```c
-												A couple more fixes and edits

											
										
										
											2014-01-21 11:29:08 +00:00
+								mpc_parser_t *mpc_and(int n, mpc_fold_t f, ...);
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```
-												Completed refactoring

											
										
										
											2013-11-11 16:56:20 +00:00
+								Attempts to run `n` parsers in sequence, returning the fold of the results using fold function `f`. First parsers must be specified, followed by destructors for each parser, excluding the final parser. These are used in case of partial success. For example: `mpc_and(3, mpcf_strfold, mpc_char('a'), mpc_char('b'), mpc_char('c'), free, free);` would attempt to match `'a'` followed by `'b'` followed by `'c'`, and if successful would concatenate them using `mpcf_strfold`. Otherwise would use `free` on the partial results.
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								尝试按顺序运行`n`个解析器，使用fold函数`f`返回结果的倍数。必须指定第一个解析器，然后为每个解析器指定析构函数，不包括最后一个解析器。这些用于部分成功的情况。例如：`mpc_and（3，mpcf_strfold，mpc_char（'a'），mpc_char（'b'），mpc_char（'c'），free，free）；`将尝试匹配`'a'`、`'b'`和`'c'`，如果成功，将使用`mpcf_strfold`将它们连接起来。否则，将在部分结果上使用`free`。
-												Readme tweaks

											
										
										
											2013-09-26 14:28:01 +01:00
+								* * *
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```c
-												A couple more fixes and edits

											
										
										
											2014-01-21 11:29:08 +00:00
+								mpc_parser_t *mpc_predictive(mpc_parser_t *a);
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								Returns a parser that runs `a` with backtracking disabled. This means if `a` consumes more than one character, it will not be reverted, even on failure. Turning backtracking off has good performance benefits for grammars which are `LL(1)`. These are grammars where the first character completely determines the parse result - such as the decision of parsing either a C identifier, number, or string literal. This option should not be used for non `LL(1)` grammars or it will produce incorrect results or crash the parser.
-												Readme tweaks

											
										
										
											2013-09-26 14:28:01 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								返回一个在禁用回溯的情况下运行`a`的解析器。这意味着，如果`a`消耗了多个字符，即使失败，它也不会被还原。对于`LL（1）`语法，关闭回溯具有良好的性能优势。这些语法中，第一个字符完全决定了解析结果，例如解析C标识符、数字或字符串文字的决定。此选项不应用于非`LL（1）`语法，否则会产生不正确的结果或使解析器崩溃。
-												Completed refactoring

											
										
										
											2013-11-11 16:56:20 +00:00
+								Another way to think of `mpc_predictive` is that it can be applied to a parser (for a performance improvement) if either successfully parsing the first character will result in a completely successful parse, or all of the referenced sub-parsers are also `LL(1)`.
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								另一种理解`mpc_cpredictive`的方法是，如果成功解析第一个字符将导致完全成功的解析，或者所有引用的子解析器都是`LL（1）`，则可以将其应用于解析器（以提高性能）。
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
 								Function Types
 								--------------
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								The combinator functions take a number of special function types as function pointers. Here is a short explanation of those types are how they are expected to behave. It is important that these behave correctly otherwise it is easy to introduce memory leaks or crashes into the system.
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								组合函数接受许多特殊的函数类型作为函数指针。以下是对这些类型的简要解释，即它们应该如何表现。重要的是，这些行为必须正确，否则很容易在系统中引入内存泄漏或崩溃。
-												Some language stuff

											
										
										
											2013-09-26 17:58:14 +01:00
+								* * *
-												Readme tweaks

											
										
										
											2013-09-26 14:28:01 +01:00
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```c
 								typedef void(*mpc_dtor_t)(mpc_val_t*);
 								```
-												Readme tweaks

											
										
										
											2013-09-26 14:28:01 +01:00
+								Given some pointer to a data value it will ensure the memory it points to is freed correctly.
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								给定一个指向数据值的指针，它将确保它指向的内存被正确释放。
-												Some language stuff

											
										
										
											2013-09-26 17:58:14 +01:00
+								* * *
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
 								```c
-												Completed refactoring

											
										
										
											2013-11-11 16:56:20 +00:00
+								typedef mpc_val_t*(*mpc_ctor_t)(void);
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```
-												First commit

											
										
										
											2013-09-19 20:57:40 +01:00
-												Completed refactoring

											
										
										
											2013-11-11 16:56:20 +00:00
+								Returns some data value when called. It can be used to create _empty_ versions of data types when certain combinators have no known default value to return. For example it may be used to return a newly allocated empty string.
-												Readme tweaks

											
										
										
											2013-09-26 14:28:01 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								调用时返回一些数据值。当某些组合子没有已知的默认值可返回时，它可用于创建数据类型的_empty_版本。例如，它可用于返回新分配的空字符串。
-												Some language stuff

											
										
										
											2013-09-26 17:58:14 +01:00
+								* * *
-												Updated testing framework

											
										
										
											2013-09-23 22:41:58 +01:00
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```c
-												Completed refactoring

											
										
										
											2013-11-11 16:56:20 +00:00
+								typedef mpc_val_t*(*mpc_apply_t)(mpc_val_t*);
 								typedef mpc_val_t*(*mpc_apply_to_t)(mpc_val_t*,void*);
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								This takes in some pointer to data and outputs some new or modified pointer to data, ensuring to free the input data if it is no longer used. The `apply_to` variation takes in an extra pointer to some data such as global state.
-												Readme tweaks

											
										
										
											2013-09-26 14:28:01 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								这会接收一些指向数据的指针，并输出一些新的或修改过的指向数据的指示器，确保在不再使用输入数据时释放它。`apply_to`变量引入了一个指向全局状态等数据的额外指针。
-												Some language stuff

											
										
										
											2013-09-26 17:58:14 +01:00
+								* * *
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												Add `mpc_check` and `mpc_check_with` combinators.

											
										
										
											2018-03-23 10:50:09 +01:00
+								```c
 								typedef int(*mpc_check_t)(mpc_val_t**);
 								typedef int(*mpc_check_with_t)(mpc_val_t**,void*);
 								```
 								This takes in some pointer to data and outputs 0 if parsing should stop with an error. Additionally, this may change or free the input data. The `check_with` variation takes in an extra pointer to some data such as global state.
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								这会接收一些指向数据的指针，如果解析因错误而停止，则输出0。此外，这可能会更改或释放输入数据。`check_with`变量引入了一个指向某些数据（如全局状态）的额外指针。
-												Add `mpc_check` and `mpc_check_with` combinators.

											
										
										
											2018-03-23 10:50:09 +01:00
+								* * *
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```c
-												Completed refactoring

											
										
										
											2013-11-11 16:56:20 +00:00
+								typedef mpc_val_t*(*mpc_fold_t)(int,mpc_val_t**);
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								This takes a list of pointers to data values and must return some combined or folded version of these data values. It must ensure to free any input data that is no longer used once the combination has taken place.
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								这需要一个指向数据值的指针列表，并且必须返回这些数据值的组合或折叠版本。它必须确保在组合发生后释放不再使用的任何输入数据。
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												Readme updates

											
										
										
											2014-02-12 20:51:45 +00:00
+								Case Study - Identifier
-												Fixed bug in state tagging. Updated examples to use concatinated preprocessor strings

											
										
										
											2014-04-16 17:06:16 +01:00
+								=======================
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
 								Combinator Method
 								-----------------
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								Using the above combinators we can create a parser that matches a C identifier.
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								使用上述组合子，我们可以创建一个与C标识符匹配的解析器。
-												small readme tweaks

											
										
										
											2014-01-26 11:47:55 +00:00
+								When using the combinators we need to supply a function that says how to combine two `char *`.
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								当使用组合子时，我们需要提供一个函数，说明如何组合两个`char*`。
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								For this we build a fold function that will concatenate zero or more strings together. For this sake of this tutorial we will write it by hand, but this (as well as many other useful fold functions), are actually included in _mpc_ under the `mpcf_*` namespace, such as `mpcf_strfold`.
-												Updated testing framework

											
										
										
											2013-09-23 22:41:58 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								为此，我们构建了一个fold函数，将零个或多个字符串连接在一起。为了本教程的目的，我们将手工编写它，但这个（以及许多其他有用的折叠函数）实际上包含在“mpcf_*”命名空间下的_mpc_中，例如“mpcf_strfold”。
-												Updated testing framework

											
										
										
											2013-09-23 22:41:58 +01:00
+								```c
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								mpc_val_t *strfold(int n, mpc_val_t **xs) {
 								  char *x = calloc(1, 1);
-												Completed refactoring

											
										
										
											2013-11-11 16:56:20 +00:00
+								  int i;
 								  for (i = 0; i < n; i++) {
 								    x = realloc(x, strlen(x) + strlen(xs[i]) + 1);
 								    strcat(x, xs[i]);
 								    free(xs[i]);
 								  }
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								  return x;
 								}
 								```
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								We can use this to specify a C identifier, making use of some combinators to say how the basic parsers are combined.
-												Updated testing framework

											
										
										
											2013-09-23 22:41:58 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								我们可以用它来指定一个C标识符，利用一些组合子来说明基本解析器是如何组合的。
-												Readme tweaks

											
										
										
											2013-09-26 14:28:01 +01:00
+								```c
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								mpc_parser_t *alpha = mpc_or(2, mpc_range('a', 'z'), mpc_range('A', 'Z'));
 								mpc_parser_t *digit = mpc_range('0', '9');
 								mpc_parser_t *underscore = mpc_char('_');
 								mpc_parser_t *ident = mpc_and(2, strfold,
 								  mpc_or(2, alpha, underscore),
 								  mpc_many(strfold, mpc_or(3, alpha, digit, underscore)),
 								  free);
 								/* Do Some Parsing... */
 								mpc_delete(ident);
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								Notice that previous parsers are used as input to new parsers we construct from the combinators. Note that only the final parser `ident` must be deleted. When we input a parser into a combinator we should consider it to be part of the output of that combinator.
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								请注意，之前的解析器被用作我们从组合子构造的新解析器的输入。请注意，只有最后一个解析器`ident`必须删除。当我们将解析器输入组合子时，我们应该将其视为该组合子输出的一部分。
-												Update README.md

typo
											
										
										
											2018-10-09 21:00:55 -04:00
+								Because of this we shouldn't create a parser and input it into multiple places, or it will be doubly freed.
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								因此，我们不应该创建解析器并将其输入到多个位置，否则它将被双重释放。
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
 								Regex Method
 								------------
-												small readme tweaks

											
										
										
											2014-01-26 11:47:55 +00:00
+								There is an easier way to do this than the above method. _mpc_ comes with a handy regex function for constructing parsers using regex syntax. We can specify an identifier using a regex pattern as shown below.
-												Language now uses full grammar

											
										
										
											2013-09-27 10:41:42 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								有一种比上述方法更简单的方法_mpc附带了一个方便的正则表达式函数，用于使用正则表达式语法构建解析器。我们可以使用正则表达式模式指定标识符，如下所示。
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								```c
 								mpc_parser_t *ident = mpc_re("[a-zA-Z_][a-zA-Z_0-9]*");
-												Language now uses full grammar

											
										
										
											2013-09-27 10:41:42 +01:00
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								/* Do Some Parsing... */
 								mpc_delete(ident);
 								```
 								Library Method
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								--------------
-												Updated testing framework

											
										
										
											2013-09-23 22:41:58 +01:00
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								Although if we really wanted to create a parser for C identifiers, a function for creating this parser comes included in _mpc_ along with many other common parsers.
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								虽然如果我们真的想为C标识符创建一个解析器，但_mpc_中包含了创建此解析器的函数以及许多其他常见的解析器。
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								```c
 								mpc_parser_t *ident = mpc_ident();
 								/* Do Some Parsing... */
 								mpc_delete(ident);
 								```
 								Parser References
 								=================
-												Completed refactoring

											
										
										
											2013-11-11 16:56:20 +00:00
+								Building parsers in the above way can have issues with self-reference or cyclic-reference. To overcome this we can separate the construction of parsers into two different steps. Construction and Definition.
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								以上述方式构建解析器可能会出现自引用或循环引用的问题。为了克服这个问题，我们可以将解析器的构建分为两个不同的步骤。构造和定义。
-												More Readme tweaks

											
										
										
											2013-09-26 14:34:34 +01:00
+								* * *
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```c
-												Readme updates

											
										
										
											2014-02-12 20:51:45 +00:00
+								mpc_parser_t *mpc_new(const char *name);
-												Updated testing framework

											
										
										
											2013-09-23 22:41:58 +01:00
+								```
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								This will construct a parser called `name` which can then be used as input to others, including itself, without fear of being deleted. Any parser created using `mpc_new` is said to be _retained_. This means it will behave differently to a normal parser when referenced. When deleting a parser that includes a _retained_ parser, the _retained_ parser will not be deleted along with it. To delete a retained parser `mpc_delete` must be used on it directly.
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								这将构造一个名为`name`的解析器，然后可以将其用作其他人的输入，包括它自己，而不必担心被删除。任何使用`mpc_new`创建的解析器都被称为_retaind_。这意味着当被引用时，它的行为将与普通解析器不同。删除包含_retained_解析器的解析器时，_retained_ 解析器不会与其一起删除。要删除保留的解析器，必须直接对其使用 `mpc_delete`。
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								A _retained_ parser can then be _defined_ using...
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								然后，可以使用…对_retaind_解析器进行_defined_。。。
-												More Readme tweaks

											
										
										
											2013-09-26 14:34:34 +01:00
+								* * *
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```c
-												A couple more fixes and edits

											
										
										
											2014-01-21 11:29:08 +00:00
+								mpc_parser_t *mpc_define(mpc_parser_t *p, mpc_parser_t *a);
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```
-												Completed refactoring

											
										
										
											2013-11-11 16:56:20 +00:00
+								This assigns the contents of parser `a` to `p`, and deletes `a`. With this technique parsers can now reference each other, as well as themselves, without trouble.
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								这将解析器`a`的内容分配给`p`，并删除`a`。通过这种技术，解析器现在可以毫无困难地相互引用，也可以引用自己。
-												More Readme tweaks

											
										
										
											2013-09-26 14:34:34 +01:00
+								* * *
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```c
-												A couple more fixes and edits

											
										
										
											2014-01-21 11:29:08 +00:00
+								mpc_parser_t *mpc_undefine(mpc_parser_t *p);
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```
-												Completed refactoring

											
										
										
											2013-11-11 16:56:20 +00:00
+								A final step is required. Parsers that reference each other must all be undefined before they are deleted. It is important to do any undefining before deletion. The reason for this is that to delete a parser it must look at each sub-parser that is used by it. If any of these have already been deleted a segfault is unavoidable - even if they were retained beforehand.
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								需要最后一步。相互引用的解析器在删除之前必须全部未定义。在删除之前进行任何未定义的操作都很重要。这样做的原因是，要删除一个解析器，它必须查看它使用的每个子解析器。如果其中任何一个子解析器已经被删除，那么segfault是不可避免的，即使它们事先被保留了。
-												More Readme tweaks

											
										
										
											2013-09-26 14:34:34 +01:00
+								* * *
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```c
 								void mpc_cleanup(int n, ...);
 								```
-												small readme updates

											
										
										
											2013-09-26 14:20:21 +01:00
+								To ease the task of undefining and then deleting parsers `mpc_cleanup` can be used. It takes `n` parsers as input, and undefines them all, before deleting them all.
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								为了简化定义和删除解析器的任务，可以使用`mpc_cleap`。它接收`n`个解析器作为输入，并在删除它们之前取消定义它们。
-												Added copy function

											
										
										
											2016-02-20 14:44:08 +00:00
+								* * *
 								```c
 								mpc_parser_t *mpc_copy(mpc_parser_t *a);
 								```
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								This function makes a copy of a parser `a`. This can be useful when you want to use a parser as input for some other parsers multiple times without retaining it.
 								此函数生成解析器`a`的副本。当您想多次使用解析器作为其他解析器的输入而不保留它时，这可能很有用。
-												Added copy function

											
										
										
											2016-02-20 14:44:08 +00:00
-												Added mode option to regex and also changed example from a line reader to a tokenizer.

											
										
										
											2018-10-14 17:20:11 -04:00
+								* * *
 								```c
 								mpc_parser_t *mpc_re(const char *re);
 								mpc_parser_t *mpc_re_mode(const char *re, int mode);
 								```
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								This function takes as input the regular expression `re` and builds a parser for it. With the `mpc_re_mode` function optional mode flags can also be given.
 								此函数将正则表达式`re`作为输入，并为其构建解析器。使用`mpc_re_mode`函数，还可以给出可选的模式标志。
 								Available flags are `MPC_RE_MULTILINE` / `MPC_RE_M` where the start of input character `^` also matches the beginning of new lines and the end of input `$` character also matches new lines, and `MPC_RE_DOTALL` / `MPC_RE_S` where the any character token `.` also matches newlines (by default it doesn't).
-												Added mode option to regex and also changed example from a line reader to a tokenizer.

											
										
										
											2018-10-14 17:20:11 -04:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								可用的标志是`MPC_RE_MULTILINE` / `MPC_RE_M`，其中输入字符`^`的开头也与新行的开头匹配，输入字符`$`的结尾也与新行匹配，以及`MPC_RE_DOTALL` / `MPC_RE_S`，其中包含任何字符标记`.`也匹配换行符（默认情况下不匹配）。
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
 								Library Reference 库参考
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								=================
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								Common Parsers 常见解析器
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								--------------
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
 								<table>
 								  <tr><td><code>mpc_soi</code></td><td>Matches only the start of input, returns <code>NULL</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>仅匹配输入的开头,返回 <code>NULL</code></td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_eoi</code></td><td>Matches only the end of input, returns <code>NULL</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>仅匹配输入的结尾,返回 <code>NULL</code></td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_boundary</code></td><td>Matches only the boundary between words, returns <code>NULL</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>仅匹配单词之间的边界,返回 <code>NULL</code></td></tr>
-												Added mode option to regex and also changed example from a line reader to a tokenizer.

											
										
										
											2018-10-14 17:20:11 -04:00
+								  <tr><td><code>mpc_boundary_newline</code></td><td>Matches the start of a new line, returns <code>NULL</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配新行的开头,返回 <code>NULL</code></td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_whitespace</code></td><td>Matches any whitespace character <code>" \f\n\r\t\v"</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配任意的空白字符<code>" \f\n\r\t\v"</code></td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_whitespaces</code></td><td>Matches zero or more whitespace characters</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配零个或多个空白字符</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_blank</code></td><td>Matches whitespaces and frees the result, returns <code>NULL</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配并释空白字符放结果,返回 <code>NULL</code></td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_newline</code></td><td>Matches <code>'\n'</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配 <code>'\n'</code></td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_tab</code></td><td>Matches <code>'\t'</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配 <code>'\t'</code></td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_escape</code></td><td>Matches a backslash followed by any character</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配反斜杠后跟任何字符</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_digit</code></td><td>Matches any character in the range <code>'0'</code> - <code>'9'</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配 <code>'0'</code> - <code>'9'</code>之间的任意字符</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_hexdigit</code></td><td>Matches any character in the range <code>'0</code> - <code>'9'</code> as well as <code>'A'</code> - <code>'F'</code> and <code>'a'</code> - <code>'f'</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配16进制的字符</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_octdigit</code></td><td>Matches any character in the range <code>'0'</code> - <code>'7'</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配8进制的字符</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_digits</code></td><td>Matches one or more digit</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配1个或多个数字</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_hexdigits</code></td><td>Matches one or more hexdigit</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配1个或多个16进制数字</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_octdigits</code></td><td>Matches one or more octdigit</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配1个或多个8进制数字</td></tr>
-												Update README.md
											
										
										
											2015-03-11 09:43:01 +00:00
+								  <tr><td><code>mpc_lower</code></td><td>Matches any lower case character</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配任意小写字符</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_upper</code></td><td>Matches any upper case character</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配任意大写字符</td></tr>
-												Update README.md
											
										
										
											2015-03-11 09:43:44 +00:00
+								  <tr><td><code>mpc_alpha</code></td><td>Matches any alphabet character</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配任意字母表字符</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_underscore</code></td><td>Matches <code>'_'</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配 <code>'_'</code></td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_alphanum</code></td><td>Matches any alphabet character, underscore or digit</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配字母表 <code>'_'</code> 和数字</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_int</code></td><td>Matches digits and returns an <code>int*</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配数字,返回 <code>int*</code></td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_hex</code></td><td>Matches hexdigits and returns an <code>int*</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配16进制,返回 <code>int*</code></td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_oct</code></td><td>Matches octdigits and returns an <code>int*</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配8进制,返回 <code>int*</code></td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_number</code></td><td>Matches <code>mpc_int</code>, <code>mpc_hex</code> or <code>mpc_oct</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配 <code>mpc_int</code>, <code>mpc_hex</code> or <code>mpc_oct</code></td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_real</code></td><td>Matches some floating point number as a string</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>将某个浮点数匹配为字符串</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_float</code></td><td>Matches some floating point number and returns a <code>float*</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配浮点数,返回 <code>float*</code></td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_char_lit</code></td><td>Matches some character literal surrounded by <code>'</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配由 <code>'</code>包围的字符 </td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_string_lit</code></td><td>Matches some string literal surrounded by <code>"</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配由 <code>"</code>包围的字符串</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_regex_lit</code></td><td>Matches some regex literal surrounded by <code>/</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配一些被<code>/</code>包围的正则表达式文字 </td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_ident</code></td><td>Matches a C style identifier</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配C样式标识符</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
 								</table>
-												First commit

											
										
										
											2013-09-19 20:57:40 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								Useful Parsers 有用的解析器
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								--------------
-												First commit

											
										
										
											2013-09-19 20:57:40 +01:00
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								<table>
-												Fixed README table format.
											
										
										
											2017-06-28 13:22:07 +08:00
+								  <tr><td><code>mpc_startswith(mpc_parser_t *a);</code></td><td>Matches the start of input followed by <code>a</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配开头</td></tr>
-												Fixed README table format.
											
										
										
											2017-06-28 13:22:07 +08:00
+								  <tr><td><code>mpc_endswith(mpc_parser_t *a, mpc_dtor_t da);</code></td><td>Matches <code>a</code> followed by the end of input</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配结尾</td></tr>
-												Add documentation for ? and {d} in language approach. Fix EOL issues

											
										
										
											2018-12-02 12:34:09 +00:00
+								  <tr><td><code>mpc_whole(mpc_parser_t *a, mpc_dtor_t da);</code></td><td>Matches the start of input, <code>a</code>, and the end of input</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配开头和结尾</td></tr>
-												Fixed README table format.
											
										
										
											2017-06-28 13:22:07 +08:00
+								  <tr><td><code>mpc_stripl(mpc_parser_t *a);</code></td><td>Matches <code>a</code> first consuming any whitespace to the left</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>删除左边的空白字符</td></tr>
-												Fixed README table format.
											
										
										
											2017-06-28 13:22:07 +08:00
+								  <tr><td><code>mpc_stripr(mpc_parser_t *a);</code></td><td>Matches <code>a</code> then consumes any whitespace to the right</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>删除右边的空白字符</td></tr>
-												Fixed README table format.
											
										
										
											2017-06-28 13:22:07 +08:00
+								  <tr><td><code>mpc_strip(mpc_parser_t *a);</code></td><td>Matches <code>a</code> consuming any surrounding whitespace</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>删除周围的空白字符</td></tr>
-												Fixed README table format.
											
										
										
											2017-06-28 13:22:07 +08:00
+								  <tr><td><code>mpc_tok(mpc_parser_t *a);</code></td><td>Matches <code>a</code> and consumes any trailing whitespace</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>删除尾随的空白字符</td></tr>
-												Fixed README table format.
											
										
										
											2017-06-28 13:22:07 +08:00
+								  <tr><td><code>mpc_sym(const char *s);</code></td><td>Matches string <code>s</code> and consumes any trailing whitespace</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配字符串，然后删除尾随的空白字符</td></tr>
-												Fixed README table format.
											
										
										
											2017-06-28 13:22:07 +08:00
+								  <tr><td><code>mpc_total(mpc_parser_t *a, mpc_dtor_t da);</code></td><td>Matches the whitespace consumed <code>a</code>, enclosed in the start and end of input</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配空白字符删除 <code>a</code>, 包含在输入的开头和结尾</td></tr>
-												Fixed README table format.
											
										
										
											2017-06-28 13:22:07 +08:00
+								  <tr><td><code>mpc_between(mpc_parser_t *a, mpc_dtor_t ad, <br /> const char *o, const char *c);</code></td><td> Matches <code>a</code> between strings <code>o</code> and <code>c</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配 <code>a</code> 在字符串 <code>o</code> 和 <code>c</code>之间</td></tr>
-												Fixed README table format.
											
										
										
											2017-06-28 13:22:07 +08:00
+								  <tr><td><code>mpc_parens(mpc_parser_t *a, mpc_dtor_t ad);</code></td><td>Matches <code>a</code> between <code>"("</code> and <code>")"</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配小括号</td></tr>
-												Fixed README table format.
											
										
										
											2017-06-28 13:22:07 +08:00
+								  <tr><td><code>mpc_braces(mpc_parser_t *a, mpc_dtor_t ad);</code></td><td>Matches <code>a</code> between <code>"<"</code> and <code>">"</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配尖括号</td></tr>
-												Fixed README table format.
											
										
										
											2017-06-28 13:22:07 +08:00
+								  <tr><td><code>mpc_brackets(mpc_parser_t *a, mpc_dtor_t ad);</code></td><td>Matches <code>a</code> between <code>"{"</code> and <code>"}"</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配大括号</td></tr>
-												Fixed README table format.
											
										
										
											2017-06-28 13:22:07 +08:00
+								  <tr><td><code>mpc_squares(mpc_parser_t *a, mpc_dtor_t ad);</code></td><td>Matches <code>a</code> between <code>"["</code> and <code>"]"</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配中括号</td></tr>
-												Fixed README table format.
											
										
										
											2017-06-28 13:22:07 +08:00
+								  <tr><td><code>mpc_tok_between(mpc_parser_t *a, mpc_dtor_t ad, <br /> const char *o, const char *c);</code></td><td>Matches <code>a</code> between <code>o</code> and <code>c</code>, where <code>o</code> and <code>c</code> have their trailing whitespace striped.</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配 <code>o</code>和 <code>c</code>之间的 <code>a</code>，其中 <code>o</code>和 <code>c</code>的尾部空格被剥离。</td></tr>
-												Fixed README table format.
											
										
										
											2017-06-28 13:22:07 +08:00
+								  <tr><td><code>mpc_tok_parens(mpc_parser_t *a, mpc_dtor_t ad);</code></td><td>Matches <code>a</code> between trailing whitespace consumed <code>"("</code> and <code>")"</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配<code>"("</code> 和 <code>")"</code>之间的尾随空格</td></tr>
-												Fixed README table format.
											
										
										
											2017-06-28 13:22:07 +08:00
+								  <tr><td><code>mpc_tok_braces(mpc_parser_t *a, mpc_dtor_t ad);</code></td><td>Matches <code>a</code> between trailing whitespace consumed <code>"<"</code> and <code>">"</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配<code>"<>"</code> 和 <code>">"</code>之间的尾随空格</td></tr>
-												Fixed README table format.
											
										
										
											2017-06-28 13:22:07 +08:00
+								  <tr><td><code>mpc_tok_brackets(mpc_parser_t *a, mpc_dtor_t ad);</code></td><td>Matches <code>a</code> between trailing whitespace consumed <code>"{"</code> and <code>"}"</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配<code>"{"</code> 和 <code>"}"</code>之间的尾随空格</td></tr>
-												Fixed README table format.
											
										
										
											2017-06-28 13:22:07 +08:00
+								  <tr><td><code>mpc_tok_squares(mpc_parser_t *a, mpc_dtor_t ad);</code></td><td>Matches <code>a</code> between trailing whitespace consumed <code>"["</code> and <code>"]"</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>匹配<code>"["</code> 和 <code>"]"</code>之间的尾随空格</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
 								</table>
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								Apply Functions 应用函数
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								---------------
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								<table>
 								  <tr><td><code>void mpcf_dtor_null(mpc_val_t *x);</code></td><td>Empty destructor. Does nothing</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>空白析构函数 什么都不做</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_val_t *mpcf_ctor_null(void);</code></td><td>Returns <code>NULL</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>返回 <code>NULL</code></td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_val_t *mpcf_ctor_str(void);</code></td><td>Returns <code>""</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>返回 <code>""</code></td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_val_t *mpcf_free(mpc_val_t *x);</code></td><td>Frees <code>x</code> and returns <code>NULL</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>释放 <code>x</code> 然后返回 <code>NULL</code></td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_val_t *mpcf_int(mpc_val_t *x);</code></td><td>Converts a decimal string <code>x</code> to an <code>int*</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>转换10进制字符串 <code>x</code> 为 <code>int*</code></td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_val_t *mpcf_hex(mpc_val_t *x);</code></td><td>Converts a hex string <code>x</code> to an <code>int*</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>转换16进制字符串 <code>x</code> 为 <code>int*</code></td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_val_t *mpcf_oct(mpc_val_t *x);</code></td><td>Converts a oct string <code>x</code> to an <code>int*</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>转换8进制字符串 <code>x</code> 为 <code>int*</code></td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_val_t *mpcf_float(mpc_val_t *x);</code></td><td>Converts a string <code>x</code> to a <code>float*</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>转换字符串 <code>x</code> 为 <code>float*</code></td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_val_t *mpcf_escape(mpc_val_t *x);</code></td><td>Converts a string <code>x</code> to an escaped version</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>转换字符串 <code>x</code> 为转义版本</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_val_t *mpcf_escape_regex(mpc_val_t *x);</code></td><td>Converts a regex <code>x</code> to an escaped version</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>转换正则表达式<code>x</code> 为转义版本</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_val_t *mpcf_escape_string_raw(mpc_val_t *x);</code></td><td>Converts a raw string <code>x</code> to an escaped version</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>转换原始字符串 <code>x</code> 为转义版本</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_val_t *mpcf_escape_char_raw(mpc_val_t *x);</code></td><td>Converts a raw character <code>x</code> to an escaped version</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>转换原始字符 <code>x</code> 为转义版本</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_val_t *mpcf_unescape(mpc_val_t *x);</code></td><td>Converts a string <code>x</code> to an unescaped version</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>转换字符串 <code>x</code> 为未转义版本</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_val_t *mpcf_unescape_regex(mpc_val_t *x);</code></td><td>Converts a regex <code>x</code> to an unescaped version</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>转换正则表达式 <code>x</code> 为未转义版本</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_val_t *mpcf_unescape_string_raw(mpc_val_t *x);</code></td><td>Converts a raw string <code>x</code> to an unescaped version</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>转换原始字符串 <code>x</code> 为未转义版本</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_val_t *mpcf_unescape_char_raw(mpc_val_t *x);</code></td><td>Converts a raw character <code>x</code> to an unescaped version</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>转换原始字符 <code>x</code> 为未转义版本</td></tr>
-												Update README.md
											
										
										
											2015-03-11 09:48:10 +00:00
+								  <tr><td><code>mpc_val_t *mpcf_strtriml(mpc_val_t *x);</code></td><td>Trims whitespace from the left of string <code>x</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>修剪字符串左侧的空白</td></tr>
-												Update README.md
											
										
										
											2015-03-11 09:48:10 +00:00
+								  <tr><td><code>mpc_val_t *mpcf_strtrimr(mpc_val_t *x);</code></td><td>Trims whitespace from the right of string <code>x</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>修剪字符串右侧的空格</td></tr>
-												Update README.md
											
										
										
											2015-03-11 09:48:10 +00:00
+								  <tr><td><code>mpc_val_t *mpcf_strtrim(mpc_val_t *x);</code></td><td>Trims whitespace from either side of string <code>x</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>修剪字符串两侧的空格</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								</table>
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								Fold Functions 折叠函数
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								--------------
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								<table>
 								  <tr><td><code>mpc_val_t *mpcf_null(int n, mpc_val_t** xs);</code></td><td>Returns <code>NULL</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>返回 <code>NULL</code></td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_val_t *mpcf_fst(int n, mpc_val_t** xs);</code></td><td>Returns first element of <code>xs</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>返回第一个元素</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_val_t *mpcf_snd(int n, mpc_val_t** xs);</code></td><td>Returns second element of <code>xs</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>返回第二个元素</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_val_t *mpcf_trd(int n, mpc_val_t** xs);</code></td><td>Returns third element of <code>xs</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>返回第三个元素</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_val_t *mpcf_fst_free(int n, mpc_val_t** xs);</code></td><td>Returns first element of <code>xs</code> and calls <code>free</code> on others</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>返回第一个元素并释放其他</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_val_t *mpcf_snd_free(int n, mpc_val_t** xs);</code></td><td>Returns second element of <code>xs</code> and calls <code>free</code> on others</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>返回第二个元素并释放其他</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_val_t *mpcf_trd_free(int n, mpc_val_t** xs);</code></td><td>Returns third element of <code>xs</code> and calls <code>free</code> on others</td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>返回第三个元素并释放其他</td></tr>
-												fixed renaming of mpcf_freefold to mpcf_all_free

											
										
										
											2020-07-19 08:53:43 -04:00
+								  <tr><td><code>mpc_val_t *mpcf_all_free(int n, mpc_val_t** xs);</code></td><td>Calls <code>free</code> on all elements of <code>xs</code> and returns <code>NULL</code></td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>释放所有元素并返回<code>NULL</code></td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>mpc_val_t *mpcf_strfold(int n, mpc_val_t** xs);</code></td><td>Concatenates all <code>xs</code> together as strings and returns result </td></tr>
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								  <tr><td></td><td>将所有<code>xs</code>连接在一起作为字符串并返回结果</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
 								</table>
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								Case Study - Maths Language 案例研究 - 数学语言
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								===========================
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								Combinator Approach 组合方法
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								-------------------
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												Added line-reader example. Changed behaviour of eof on regex to parse either eof or a newline followed be eof (better matchers other regex engines).

											
										
										
											2018-10-13 18:27:42 -04:00
+								Passing around all these function pointers might seem clumsy, but having parsers be type-generic is important as it lets users define their own output types for parsers. For example we could design our own syntax tree type to use. We can also use this method to do some specific house-keeping or data processing in the parsing phase.
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								传递所有这些函数指针可能看起来很笨拙，但让解析器具有类型泛型很重要，因为它允许用户为解析器定义自己的输出类型。例如，我们可以设计自己的语法树类型来使用。我们还可以在解析阶段使用这种方法进行一些特定的内务管理或数据处理。
-												Added line-reader example. Changed behaviour of eof on regex to parse either eof or a newline followed be eof (better matchers other regex engines).

											
										
										
											2018-10-13 18:27:42 -04:00
+								As an example of this power, we can specify a simple maths grammar, that outputs `int *`, and computes the result of the expression as it goes along.
-												small readme tweaks

											
										
										
											2014-01-26 11:47:55 +00:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								作为这种能力的一个例子，我们可以指定一个简单的数学语法，输出`int*`，并计算表达式的结果。
-												small readme tweaks

											
										
										
											2014-01-26 11:47:55 +00:00
+								We start with a fold function that will fold two `int *` into a new `int *` based on some `char *` operator.
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								我们从一个fold函数开始，该函数将根据一些`char*`运算符将两个`int*`折叠成一个新的`int*`。
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```c
-												Readme updates

											
										
										
											2014-02-12 20:51:45 +00:00
+								mpc_val_t *fold_maths(int n, mpc_val_t **xs) {
-												Add documentation for ? and {d} in language approach. Fix EOL issues

											
										
										
											2018-12-02 12:34:09 +00:00
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								  int **vs = (int**)xs;
-												Add documentation for ? and {d} in language approach. Fix EOL issues

											
										
										
											2018-12-02 12:34:09 +00:00
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								  if (strcmp(xs[1], "*") == 0) { *vs[0] *= *vs[2]; }
 								  if (strcmp(xs[1], "/") == 0) { *vs[0] /= *vs[2]; }
 								  if (strcmp(xs[1], "%") == 0) { *vs[0] %= *vs[2]; }
 								  if (strcmp(xs[1], "+") == 0) { *vs[0] += *vs[2]; }
 								  if (strcmp(xs[1], "-") == 0) { *vs[0] -= *vs[2]; }
-												Add documentation for ? and {d} in language approach. Fix EOL issues

											
										
										
											2018-12-02 12:34:09 +00:00
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								  free(xs[1]); free(xs[2]);
-												Add documentation for ? and {d} in language approach. Fix EOL issues

											
										
										
											2018-12-02 12:34:09 +00:00
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								  return xs[0];
 								}
 								```
-												Some language stuff

											
										
										
											2013-09-26 17:58:14 +01:00
+								And then we use this to specify a basic grammar, which folds together any results.
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								然后我们用它来指定一个基本语法，它将任何结果折叠在一起。
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```c
-												Readme updates

											
										
										
											2014-02-12 20:51:45 +00:00
+								mpc_parser_t *Expr   = mpc_new("expr");
 								mpc_parser_t *Factor = mpc_new("factor");
 								mpc_parser_t *Term   = mpc_new("term");
 								mpc_parser_t *Maths  = mpc_new("maths");
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												Add documentation for ? and {d} in language approach. Fix EOL issues

											
										
										
											2018-12-02 12:34:09 +00:00
+								mpc_define(Expr, mpc_or(2,
-												Readme updates

											
										
										
											2014-02-12 20:51:45 +00:00
+								  mpc_and(3, fold_maths,
-												Fix a minor README error.

Precedence of operators was flipped around in the fold_maths example.

											
										
										
											2015-11-17 15:43:25 -07:00
+								    Factor, mpc_oneof("+-"), Factor,
-												Readme updates

											
										
										
											2014-02-12 20:51:45 +00:00
+								    free, free),
 								  Factor
 								));
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												Add documentation for ? and {d} in language approach. Fix EOL issues

											
										
										
											2018-12-02 12:34:09 +00:00
+								mpc_define(Factor, mpc_or(2,
-												Readme updates

											
										
										
											2014-02-12 20:51:45 +00:00
+								  mpc_and(3, fold_maths,
-												Fix a minor README error.

Precedence of operators was flipped around in the fold_maths example.

											
										
										
											2015-11-17 15:43:25 -07:00
+								    Term, mpc_oneof("*/"), Term,
-												Readme updates

											
										
										
											2014-02-12 20:51:45 +00:00
+								    free, free),
 								  Term
 								));
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												Readme updates

											
										
										
											2014-02-12 20:51:45 +00:00
+								mpc_define(Term, mpc_or(2, mpc_int(), mpc_parens(Expr, free)));
 								mpc_define(Maths, mpc_whole(Expr, free));
 								/* Do Some Parsing... */
 								mpc_delete(Maths);
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```
-												Some language stuff

											
										
										
											2013-09-26 17:58:14 +01:00
+								If we supply this function with something like `(4*2)+5`, we can expect it to output `13`.
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								如果我们为这个函数提供类似于`(4*2)+5`的东西，我们可以期望它输出`13`。
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								Language Approach 语言方法
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								-----------------
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								It is possible to avoid passing in and around all those function pointers, if you don't care what type is output by _mpc_. For this, a generic Abstract Syntax Tree type `mpc_ast_t` is included in _mpc_. The combinator functions which act on this don't need information on how to destruct or fold instances of the result as they know it will be a `mpc_ast_t`. So there are a number of combinator functions which work specifically (and only) on parsers that return this type. They reside under `mpca_*`.
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								如果你不在乎_mpc_输出什么类型，可以避免传入和绕过所有这些函数指针。为此，_mpc_中包含了一个通用的抽象语法树类型`mpc_ast_t`。作用于此的组合子函数不需要关于如何销毁或折叠结果实例的信息，因为它们知道这将是一个`mpc_ast_t`。因此，有许多组合子函数专门（且仅）在返回此类型的解析器上工作。它们使用`mpca_*`类型的名称。
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								Doing things via this method means that all the data processing must take place after the parsing. In many instances this is not an issue, or even preferable.
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								通过这种方法做事意味着所有的数据处理都必须在解析后进行。在许多情况下，这不是问题，甚至更可取。
-												small readme tweaks

											
										
										
											2014-01-26 11:47:55 +00:00
+								It also allows for one more trick. As all the fold and destructor functions are implicit, the user can simply specify the grammar of the language in some nice way and the system can try to build a parser for the AST type from this alone. For this there are a few functions supplied which take in a string, and output a parser. The format for these grammars is simple and familiar to those who have used parser generators before. It looks something like this.
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								它还允许再耍一个花招。由于所有的fold和析构函数函数都是隐式的，用户可以简单地以某种好的方式指定语言的语法，系统可以尝试仅凭此为AST类型构建解析器。为此，提供了一些函数，它们接收字符串并输出解析器。这些语法的格式对于那些以前使用过解析器生成器的人来说是简单而熟悉的。它看起来像这样。
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
+								```
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								number "number" : /[0-9]+/ ;
-												Initial commit for recording parse state in ast

											
										
										
											2014-04-15 16:04:07 +01:00
+								expression      : <product> (('+' | '-') <product>)* ;
 								product         : <value>   (('*' | '/')   <value>)* ;
 								value           : <number> | '(' <expression> ')' ;
 								maths           : /^/ <expression> /$/ ;
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
+								```
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								The syntax for this is defined as follows.
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								其语法定义如下。
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								<table class='table'>
 								  <tr><td><code>"ab"</code></td><td>The string <code>ab</code> is required.</td></tr>
 								  <tr><td><code>'a'</code></td><td>The character <code>a</code> is required.</td></tr>
 								  <tr><td><code>'a' 'b'</code></td><td>First <code>'a'</code> is required, then <code>'b'</code> is required..</td></tr>
 								  <tr><td><code>'a' | 'b'</code></td><td>Either <code>'a'</code> is required, or <code>'b'</code> is required.</td></tr>
 								  <tr><td><code>'a'*</code></td><td>Zero or more <code>'a'</code> are required.</td></tr>
 								  <tr><td><code>'a'+</code></td><td>One or more <code>'a'</code> are required.</td></tr>
-												Add documentation for ? and {d} in language approach. Fix EOL issues

											
										
										
											2018-12-02 12:34:09 +00:00
+								  <tr><td><code>'a'?</code></td><td>Zero or one <code>'a'</code> is required.</td></tr>
 								  <tr><td><code>'a'{x}</code></td><td>Exactly <code>x</code> (integer) copies of <code>'a'</code> are required.</td></tr>
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								  <tr><td><code>&lt;abba&gt;</code></td><td>The rule called <code>abba</code> is required.</td></tr>
 								</table>
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								Rules are specified by rule name, optionally followed by an _expected_ string, followed by a colon `:`, followed by the definition, and ending in a semicolon `;`. Multiple rules can be specified. The _rule names_ must match the names given to any parsers created by `mpc_new`, otherwise the function will crash.
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								规则由规则名称指定，后面可选地跟有_expected_字符串，后跟冒号`:`，后跟定义，最后以分号`;`结尾。可以指定多个规则。_rule names_必须与`mpc_new`创建的任何解析器的名称匹配，否则函数将崩溃。
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								The flags variable is a set of flags `MPCA_LANG_DEFAULT`, `MPCA_LANG_PREDICTIVE`, or `MPCA_LANG_WHITESPACE_SENSITIVE`. For specifying if the language is predictive or whitespace sensitive.
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								标志变量是一组标志`MPCA_LANG_DEFAULT`、`MPCA_LONG_PREDICTIVE`或`MPCA_ANG_WHITESPACE_SENSITIVE`。用于指定语言是预测性的还是空格敏感的。
-												Completed refactoring

											
										
										
											2013-11-11 16:56:20 +00:00
+								Like with the regular expressions, this user input is parsed by existing parts of the _mpc_ library. It provides one of the more powerful features of the library.
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								与正则表达式一样，此用户输入由_mpc_库的现有部分解析。它是库更强大的原因之一。
-												yet More Readme tweaks

											
										
										
											2013-09-26 14:44:44 +01:00
+								* * *
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```c
-												Readme updates

											
										
										
											2014-02-12 20:51:45 +00:00
+								mpc_parser_t *mpca_grammar(int flags, const char *grammar, ...);
-												big update to readme

											
										
										
											2013-09-26 13:15:00 +01:00
+								```
-												First commit

											
										
										
											2013-09-19 20:57:40 +01:00
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								This takes in some single right hand side of a rule, as well as a list of any of the parsers referenced, and outputs a parser that does what is specified by the rule. The list of parsers referenced can be terminated with `NULL` to get an error instead of a crash when a parser required is not supplied.
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								这接收规则的某个右侧，以及引用的任何解析器的列表，并输出一个执行规则指定操作的解析器。当没有提供所需的解析器时，引用的解析器列表可以用`NULL`终止，以获得错误而不是崩溃。
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
+								* * *
 								```c
-												Update README.md
											
										
										
											2014-04-24 21:24:12 +01:00
+								mpc_err_t *mpca_lang(int flags, const char *lang, ...);
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
+								```
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								This takes in a full language (zero or more rules) as well as any parsers referred to by either the right or left hand sides. Any parsers specified on the left hand side of any rule will be assigned a parser equivalent to what is specified on the right. On valid user input this returns `NULL`, while if there are any errors in the user input it will return an instance of `mpc_err_t` describing the issues. The list of parsers referenced can be terminated with `NULL` to get an error instead of a crash when a parser required is not supplied.
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								这需要一个完整的语言（零个或多个规则）以及右侧或左侧引用的任何解析器。在任何规则的左侧指定的任何解析器都将被分配一个与右侧指定的解析器等效的解析器。在有效的用户输入时，这将返回`NULL`，而如果用户输入中有任何错误，它将返回一个描述问题的`mpc_err_t`实例。当没有提供所需的解析器时，引用的解析器列表可以用`NULL`终止，以获得错误而不是崩溃。
-												Various Updates

											
										
										
											2013-10-16 13:53:00 +01:00
+								* * *
 								```c
-												Update README.md
											
										
										
											2014-04-24 21:24:12 +01:00
+								mpc_err_t *mpca_lang_file(int flags, FILE* f, ...);
-												Various Updates

											
										
										
											2013-10-16 13:53:00 +01:00
+								```
 								This reads in the contents of file `f` and inputs it into `mpca_lang`.
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								这将读取文件`f`的内容并将其输入到`mpca_lang`中。
-												Various Updates

											
										
										
											2013-10-16 13:53:00 +01:00
+								* * *
 								```c
-												Update README.md
											
										
										
											2014-04-24 21:24:12 +01:00
+								mpc_err_t *mpca_lang_contents(int flags, const char *filename, ...);
-												Various Updates

											
										
										
											2013-10-16 13:53:00 +01:00
+								```
 								This opens and reads in the contents of the file given by `filename` and passes it to `mpca_lang`.
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								这将打开并读取由`filename`给出的文件内容，并将其传递给`mpca_lang`。
 								Case Study - Tokenizer 案例研究 - 分词器
-												Added mode option to regex and also changed example from a line reader to a tokenizer.

											
										
										
											2018-10-14 17:20:11 -04:00
+								======================
-												Added line-reader example. Changed behaviour of eof on regex to parse either eof or a newline followed be eof (better matchers other regex engines).

											
										
										
											2018-10-13 18:27:42 -04:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								Another common task we might be interested in doing is tokenizing some block of text (splitting the text into individual elements) and performing some function on each one of these elements as it is read. We can do this with `mpc` too.
 								我们可能感兴趣的另一个常见任务是标记一些文本块（将文本拆分为单个元素），并在读取时对每个元素执行一些功能。我们也可以用mpc来实现这一点。
 								First, we can build a regular expression which parses an individual token. For example if our tokens are identifiers, integers, commas, periods and colons we could build something like this `mpc_re("\\s*([a-zA-Z_]+|[0-9]+|,|\\.|:)")`.
 								首先，我们可以构建一个解析单个令牌的正则表达式。例如，如果我们的标记是标识符、整数、逗号、句点和冒号，我们可以构建这样的`mpc_re("\\s*([a-zA-Z_]+|[0-9]+|,|\\.|:)")`。
 								Next we can strip any whitespace, and add a callback function using `mpc_apply` which gets called every time this regex is parsed successfully `mpc_apply(mpc_strip(mpc_re("\\s*([a-zA-Z_]+|[0-9]+|,|\\.|:)")), print_token)`.
 								接下来，我们可以去掉任何空格，并使用`mpc_apply`添加一个回调函数，每次成功解析此正则表达式时都会调用`mpc_apply(mpc_strip(mpc_re("\\s*([a-zA-Z_]+|[0-9]+|,|\\.|:)")), print_token)`。
 								Finally we can surround all of this in `mpc_many` to parse it zero or more times. The final code might look something like this:
 								最后，我们可以将所有这些放在“mpc_mony”中，对其进行零次或多次解析。最终的代码可能看起来像这样：
-												Added line-reader example. Changed behaviour of eof on regex to parse either eof or a newline followed be eof (better matchers other regex engines).

											
										
										
											2018-10-13 18:27:42 -04:00
 								```c
-												Added mode option to regex and also changed example from a line reader to a tokenizer.

											
										
										
											2018-10-14 17:20:11 -04:00
+								static mpc_val_t *print_token(mpc_val_t *x) {
 								  printf("Token: '%s'\n", (char*)x);
 								  return x;
-												Added line-reader example. Changed behaviour of eof on regex to parse either eof or a newline followed be eof (better matchers other regex engines).

											
										
										
											2018-10-13 18:27:42 -04:00
+								}
 								int main(int argc, char **argv) {
-												Added mode option to regex and also changed example from a line reader to a tokenizer.

											
										
										
											2018-10-14 17:20:11 -04:00
+								  const char *input = "  hello 4352 ,  \n foo.bar   \n\n  test:ing   ";
-												Add documentation for ? and {d} in language approach. Fix EOL issues

											
										
										
											2018-12-02 12:34:09 +00:00
-												Added mode option to regex and also changed example from a line reader to a tokenizer.

											
										
										
											2018-10-14 17:20:11 -04:00
+								  mpc_parser_t* Tokens = mpc_many(
-												Add documentation for ? and {d} in language approach. Fix EOL issues

											
										
										
											2018-12-02 12:34:09 +00:00
+								    mpcf_all_free,
-												Added mode option to regex and also changed example from a line reader to a tokenizer.

											
										
										
											2018-10-14 17:20:11 -04:00
+								    mpc_apply(mpc_strip(mpc_re("\\s*([a-zA-Z_]+|[0-9]+|,|\\.|:)")), print_token));
-												Add documentation for ? and {d} in language approach. Fix EOL issues

											
										
										
											2018-12-02 12:34:09 +00:00
-												Added line-reader example. Changed behaviour of eof on regex to parse either eof or a newline followed be eof (better matchers other regex engines).

											
										
										
											2018-10-13 18:27:42 -04:00
+								  mpc_result_t r;
-												Added mode option to regex and also changed example from a line reader to a tokenizer.

											
										
										
											2018-10-14 17:20:11 -04:00
+								  mpc_parse("input", input, Tokens, &r);
-												Add documentation for ? and {d} in language approach. Fix EOL issues

											
										
										
											2018-12-02 12:34:09 +00:00
-												Added mode option to regex and also changed example from a line reader to a tokenizer.

											
										
										
											2018-10-14 17:20:11 -04:00
+								  mpc_delete(Tokens);
-												Add documentation for ? and {d} in language approach. Fix EOL issues

											
										
										
											2018-12-02 12:34:09 +00:00
-												Added line-reader example. Changed behaviour of eof on regex to parse either eof or a newline followed be eof (better matchers other regex engines).

											
										
										
											2018-10-13 18:27:42 -04:00
+								  return 0;
 								}
 								```
-												Added mode option to regex and also changed example from a line reader to a tokenizer.

											
										
										
											2018-10-14 17:20:11 -04:00
+								Running this program will produce an output something like this:
-												Added line-reader example. Changed behaviour of eof on regex to parse either eof or a newline followed be eof (better matchers other regex engines).

											
										
										
											2018-10-13 18:27:42 -04:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								运行此程序将产生如下输出：
-												Added line-reader example. Changed behaviour of eof on regex to parse either eof or a newline followed be eof (better matchers other regex engines).

											
										
										
											2018-10-13 18:27:42 -04:00
+								```
-												Added mode option to regex and also changed example from a line reader to a tokenizer.

											
										
										
											2018-10-14 17:20:11 -04:00
+								Token: 'hello'
 								Token: '4352'
 								Token: ','
 								Token: 'foo'
 								Token: '.'
 								Token: 'bar'
 								Token: 'test'
 								Token: ':'
 								Token: 'ing'
 								```
-												Added line-reader example. Changed behaviour of eof on regex to parse either eof or a newline followed be eof (better matchers other regex engines).

											
										
										
											2018-10-13 18:27:42 -04:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								By extending the regex we can easily extend this to parse many more types of tokens and quickly and easily build a tokenizer for whatever language we are interested in.
-												Added line-reader example. Changed behaviour of eof on regex to parse either eof or a newline followed be eof (better matchers other regex engines).

											
										
										
											2018-10-13 18:27:42 -04:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								通过扩展正则表达式，我们可以轻松地扩展它来解析更多类型的标记，并快速轻松地为我们感兴趣的任何语言构建标记器。
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								Error Reporting 错误报告
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								===============
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
-												Completed refactoring

											
										
										
											2013-11-11 16:56:20 +00:00
+								_mpc_ provides some automatic generation of error messages. These can be enhanced by the user, with use of `mpc_expect`, but many of the defaults should provide both useful and readable. An example of an error message might look something like this:
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								_mpc提供了一些自动生成错误消息的功能。用户可以通过使用`mpc_expect`来增强这些功能，但许多默认值应该既有用又可读。错误消息的示例可能如下：
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
+								```
 								<test>:0:3: error: expected one or more of 'a' or 'd' at 'k'
 								```
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								Misc 杂项
-												Added basic optimise function for parsers.

											
										
										
											2015-11-07 16:57:09 +00:00
+								====
 								Here are some other misc functions that mpc provides. These functions are susceptible to change between versions so use them with some care.
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								以下是mpc提供的一些其他杂项功能。这些功能很容易在版本之间发生变化，因此请谨慎使用。
-												Added basic optimise function for parsers.

											
										
										
											2015-11-07 16:57:09 +00:00
+								* * *
 								```c
 								void mpc_print(mpc_parser_t *p);
 								```
 								Prints out a parser in some weird format. This is generally used for debugging so don't expect to be able to understand the output right away without looking at the source code a little bit.
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								以某种奇怪的格式打印出解析器。这通常用于调试，所以不要指望在不看一点源代码的情况下就能立即理解输出。
-												Added basic optimise function for parsers.

											
										
										
											2015-11-07 16:57:09 +00:00
+								* * *
 								```c
 								void mpc_stats(mpc_parser_t *p);
 								```
 								Prints out some basic stats about a parser. Again used for debugging and optimisation.
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								打印出一些关于解析器的基本统计数据。再次用于调试和优化。
-												Added basic optimise function for parsers.

											
										
										
											2015-11-07 16:57:09 +00:00
+								* * *
 								```c
 								void mpc_optimise(mpc_parser_t *p);
 								```
 								Performs some basic optimisations on a parser to reduce it's size and increase its running speed.
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								对解析器执行一些基本优化，以减小其大小并提高其运行速度。
-												Fixed bug in state tagging. Updated examples to use concatinated preprocessor strings

											
										
										
											2014-04-16 17:06:16 +01:00
-												翻译 readme.md

											
										
										
											2024-10-28 16:01:33 +08:00
+								Limitations & FAQ 限制和常见问题
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								=================
-												Update README.md
											
										
										
											2021-03-27 20:40:08 -04:00
+								### I'm getting namespace issues due to `libmpc`, what can I do?
 								There is a re-naming of this project to `pcq` hosted on the [pcq branch](https://github.com/orangeduck/mpc/tree/pcq) which should be usable without namespace issues.
-												Fixed bug in state tagging. Updated examples to use concatinated preprocessor strings

											
										
										
											2014-04-16 17:06:16 +01:00
+								### Does _mpc_ support Unicode?
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
-												Initial commit for recording parse state in ast

											
										
										
											2014-04-15 16:04:07 +01:00
+								_mpc_ Only supports ASCII. Sorry! Writing a parser library that supports Unicode is pretty difficult. I welcome contributions!
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
-												Refactored boundary stuff into more general anchor

											
										
										
											2014-04-16 23:20:52 +01:00
+								### Is _mpc_ binary safe?
-												Readme updates

											
										
										
											2014-04-23 11:56:24 +01:00
+								No. Sorry! Including NULL characters in a string or a file will probably break it. Avoid this if possible.
-												Refactored boundary stuff into more general anchor

											
										
										
											2014-04-16 23:20:52 +01:00
-												Fixed bug in state tagging. Updated examples to use concatinated preprocessor strings

											
										
										
											2014-04-16 17:06:16 +01:00
+								### The Parser is going into an infinite loop!
 								While it is certainly possible there is an issue with _mpc_, it is probably the case that your grammar contains _left recursion_. This is something _mpc_ cannot deal with. _Left recursion_ is when a rule directly or indirectly references itself on the left hand side of a derivation. For example consider this left recursive grammar intended to parse an expression.
 								```
 								expr : <expr> '+' (<expr> | <int> | <string>);
 								```
-												Rephrased section about backtracking

											
										
										
											2014-04-23 13:06:10 +01:00
+								When the rule `expr` is called, it looks the first rule on the left. This happens to be the rule `expr` again. So again it looks for the first rule on the left. Which is `expr` again. And so on. To avoid left recursion this can be rewritten (for example) as the following. Note that rewriting as follows also changes the operator associativity.
-												Fixed bug in state tagging. Updated examples to use concatinated preprocessor strings

											
										
										
											2014-04-16 17:06:16 +01:00
 								```
-												Rephrased section about backtracking

											
										
										
											2014-04-23 13:06:10 +01:00
+								value : <int> | <string> ;
 								expr  : <value> ('+' <expr>)* ;
-												Fixed bug in state tagging. Updated examples to use concatinated preprocessor strings

											
										
										
											2014-04-16 17:06:16 +01:00
+								```
 								Avoiding left recursion can be tricky, but is easy once you get a feel for it. For more information you can look on [wikipedia](http://en.wikipedia.org/wiki/Left_recursion) which covers some common techniques and more examples. Possibly in the future _mpc_ will support functionality to warn the user or re-write grammars which contain left recursion, but it wont for now.
 								### Backtracking isn't working!
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
-												Rephrased section about backtracking

											
										
										
											2014-04-23 13:06:10 +01:00
+								_mpc_ supports backtracking, but it may not work as you expect. It isn't a silver bullet, and you still must structure your grammar to be unambiguous. To demonstrate this behaviour examine the following erroneous grammar, intended to parse either a C style identifier, or a C style function call.
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
 								```
 								factor : <ident>
 								       | <ident> '('  <expr>? (',' <expr>)* ')' ;
 								```
-												Rephrased section about backtracking

											
										
										
											2014-04-23 13:06:10 +01:00
+								This grammar will never correctly parse a function call because it will always first succeed parsing the initial identifier and return a factor. At this point it will encounter the parenthesis of the function call, give up, and throw an error. Even if it were to try and parse a factor again on this failure it would never reach the correct function call option because it always tries the other options first, and always succeeds with the identifier.
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
-												Rephrased section about backtracking

											
										
										
											2014-04-23 13:06:10 +01:00
+								The solution to this is to always structure grammars with the most specific clause first, and more general clauses afterwards. This is the natural technique used for avoiding left-recursive grammars and unambiguity, so is a good habit to get into anyway.
 								Now the parser will try to match a function first, and if this fails backtrack and try to match just an identifier.
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
 								```
 								factor : <ident> '('  <expr>? (',' <expr>)* ')'
 								       | <ident> ;
 								```
-												Rephrased section about backtracking

											
										
										
											2014-04-23 13:06:10 +01:00
+								An alternative, and better option is to remove the ambiguity completely by factoring out the first identifier. This is better because it removes any need for backtracking at all! Now the grammar is predictive!
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
 								```
 								factor : <ident> ('('  <expr>? (',' <expr>)* ')')? ;
 								```
-												Readme updates

											
										
										
											2014-02-12 20:51:45 +00:00
+								### How can I avoid the maximum string literal length?
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								Some compilers limit the maximum length of string literals. If you have a huge language string in the source file to be passed into `mpca_lang` you might encounter this. The ANSI standard says that 509 is the maximum length allowed for a string literal. Most compilers support greater than this. Visual Studio supports up to 2048 characters, while gcc allocates memory dynamically and so has no real limit.
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
-												Added flags to language specifiction. Added optional expect string to language specification. Added some exaple grammars for testing and demos

											
										
										
											2014-01-26 11:25:50 +00:00
+								There are a couple of ways to overcome this issue if it arises. You could instead use `mpca_lang_contents` and load the language from file or you could use a string literal for each line and let the preprocessor automatically concatenate them together, avoiding the limit. The final option is to upgrade your compiler. In C99 this limit has been increased to 4095.
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
-												Fixed bug in state tagging. Updated examples to use concatinated preprocessor strings

											
										
										
											2014-04-16 17:06:16 +01:00
+								### The automatic tags in the AST are annoying!
-												Initial commit for recording parse state in ast

											
										
										
											2014-04-15 16:04:07 +01:00
-												Fixed bug in state tagging. Updated examples to use concatinated preprocessor strings

											
										
										
											2014-04-16 17:06:16 +01:00
+								When parsing from a grammar, the abstract syntax tree is tagged with different tags for each primitive type it encounters. For example a regular expression will be automatically tagged as `regex`. Character literals as `char` and strings as `string`. This is to help people wondering exactly how they might need to convert the node contents.
-												WIP conversion to while loop

											
										
										
											2013-10-04 17:58:27 +01:00
-												Initial commit for recording parse state in ast

											
										
										
											2014-04-15 16:04:07 +01:00
+								If you have a rule in your grammar called `string`, `char` or `regex`, you may encounter some confusion. This is because nodes will be tagged with (for example) `string` _either_ if they are a string primitive, _or_ if they were parsed via your `string` rule. If you are detecting node type using something like `strstr`, in this situation it might break. One solution to this is to always check that `string` is the innermost tag to test for string primitives, or to rename your rule called `string` to something that doesn't conflict.
-												First commit

											
										
										
											2013-09-19 20:57:40 +01:00
-												Fixed bug in state tagging. Updated examples to use concatinated preprocessor strings

											
										
										
											2014-04-16 17:06:16 +01:00
+								Yes it is annoying but its probably not going to change!