Adierazpen erregularren zerrenda

Terminoa

Irudikapena/erabilera

Edozein karaktere

Emandako karakterea, adierazpen erregularren metakarakterea denean salbu (hurrengo taulakoak).

.

Edozein karaktere bakar, lerro-jauzi edo paragrafo-jauzia salbu. Adibidez, "eg.n" bilaketa-terminoa bat dator "egin" zein "egon" hitzekin.

^

The beginning of a paragraph or cell. Special objects such as empty fields or character-anchored frames, at the beginning of a paragraph are ignored. Example: "^Peter" matches the word "Peter" only when it is the first word of a paragraph.

$

The end of a paragraph or cell. Special objects such as empty fields or character-anchored frames at the end of a paragraph are ignored. Example: "Peter$" matches only when the word "Peter" is the last word of a paragraph, note "Peter" cannot be followed by a period.

$ ikurrak, bere horretan, paragrafoaren amaierarekin bat dator. Horri esker, paragrafo-jauziak bilatu eta ordeztu daitezke.

*

Zero or more of the regular expression term immediately preceding it. For example, "Ab*c" matches "Ac", "Abc", "Abbc", "Abbbc", and so on.

+

Justu bere aurrean dagoen edozein adierazpen erregularreko termino bat edo gehiago. Adibidez, "AX.+4" bilaketak "AXx4" aurkitzen du, baina ez "AX4".

The longest possible string that matches this regular expression in a paragraph is always matched. If the paragraph contains the string "AX 4 AX4", the entire passage is highlighted.

?

Zero or one of the regular expression term immediately preceding it. For example, "Texts?" matches "Text" and "Texts" and "x(ab|c)?y" finds "xy", "xaby", or "xcy".

\

The special character that follows it is interpreted as a normal character and not as a regular expression meta character (except for the combinations "\n", "\t", "\b", "\>" and "\<"). For example, "tree\." matches "tree.", not "treed" or "trees".

\n

A line break that was inserted with the Shift+Enter key combination when in the Find text box.

A paragraph break that can be entered with the Enter or Return key when in the Replace text box.

To change line breaks into paragraph breaks, enter \n in both the Find and Replace boxes, and then perform a search and replace.

\t

Tabulazio-karaktere bat. Ordeztu koadroan ere erabili daiteke.

\b

Hitz-muga bat. Adibidez, "\betxe" bilaketa bat dator "etxeko" eta "etxetik" hitzekin, baina ez "gaztetxe" hitzarekin; "etxe\b" bilaketa, berriz, bat dator "gaztetxe" eta "etxe" hitzekin, baina ez "etxeberri" hitzarekin.

Note, this form replaces the obsolete (although they still work for now) forms "\>" (match end of word) and "\<" (match start of word).

^$

Paragrafo huts bat bilatzen du.

^.

Paragrafoaren lehen karakterea aurkitzen du.

& edo $0

Adds the string that was found by the search criteria in the Find box to the term in the Replace box when you make a replacement.

For example, if you enter "window" in the Find box and "&frame" in the Replace box, the word "window" is replaced with "windowframe".

You can also enter an "&" in the Replace box to modify the Attributes or the Format of the string found by the search criteria.

[...]

Any single occurrence of any one of the characters that are between the brackets. For example: "[abc123]" matches the characters ‘a’, ‘b’, ’c’, ‘1’, ‘2’ and ‘3’. "[a-e]" matches single occurrences of the characters a through e, inclusive (the range must be specified with the character having the smallest Unicode code number first). "[a-eh-x]" matches any single occurrence of the characters that are in the ranges ‘a’ through ‘e’ and ‘h’ through ‘x’.

[^...]

Any single occurrence of a character, including Tab, Space and Line Break characters, that is not in the list of characters specified inclusive ranges are permitted. For example "[^a-syz]" matches all characters not in the inclusive range ‘a’ through ‘s’ or the characters ‘y’ and ‘z’.

\uXXXX

\UXXXXXXXX

The character represented by the four-digit hexadecimal Unicode code (XXXX).

The character represented by the eight-digit hexadecimal Unicode code (XXXXXXXX).

For certain symbol fonts the Unicode code for special characters may depend on the font in use. The Unicode codes can be viewed by choosing Insert - Special Character.

|

The infix operator delimiting alternatives. Matches the term preceding the "|" or the term following the "|". For example, "this|that" matches occurrences of both "this" and "that".

{N}

The post-fix repetition operator that specifies an exact number of occurrences ("N") of the regular expression term immediately preceding it must be present for a match to occur. For example, "tre{2}" matches "tree".

{N,M}

The post-fix repetition operator that specifies a range (minimum of "N" to a maximum of "M") of occurrences of the regular expression term immediately preceding it that can be present for a match to occur. For example, "tre{1,2}" marches "tre" and "tree".

{N,}

The post-fix repetition operator that specifies a range (minimum "N" to an unspecified maximum) of occurrences of the regular expression term immediately preceding it that can be present for a match to occur. (The maximum number of occurrences is limited only by the size of the document). For example, "tre{2,}" matches "tree", "treee", and "treeeee".

(...)

The grouping construct that serves three purposes.

  1. To enclose a set of ‘|’ alternatives. For example, the regular expression "b(oo|ac)k" matches both "book" and "back".

  2. To group terms in a complex expression to be operated on by the post-fix operators: "*", "+" and "?" along with the post-fix repetition operators. For example, the regular expression "a(bc)?d" matches both "ad" and "abcd" in a search.; the regular expression "M(iss){2}ippi" matches "Mississippi".

  3. To record the matched sub string inside the parentheses as a reference for later use in the Find box using the "\n" construct or in the Replace box using the "$n" construct, where the reference to the first matched sub string in the current expression in the Find box is represented by "\1" in the Find box and by "$1" in the Replace box, the reference to the second matched sub string by "\2" and "$2" respectively, and so on.

For example, the regular expression "(890)7\1\1" matches "8907890890".

With the regular expression "\b(fruit|truth)\b" in the Find box and the regular expression "$1ful" in the Replace box occurrences of the words "fruit" and "truth" can be replaced with the words "fruitful" and "truthful" respectively without affecting the words "fruitfully" and "truthfully"

[:alpha:]

Karaktere alfanumeriko bat ordezkatzen du. Erabili [:alpha:]+ horietako bat bilatzeko.

[:digit:]

Digitu hamartarra ordezkatzen du. Erabili [:digit:]+ horietako bat bilatzeko.

[:alnum:]

Karaktere alfanumeriko bat ([:alpha:] adierazten du eta [:digit:].

[:space:]

Espazio-karaktere bat adierazten du (baina ez beste zuriune-karaktereak).

[:print:]

Karaktere inprimagarri bat adierazten du.

[:cntrl:]

Ez inprimatzeko karaktere bat adierazten du.

[:lower:]

Karaktere minuskulak adierazten ditu Maiuskula/Minuskula hautatuta badago Aukerak eremuan.

[:upper:]

Karaktere maiuskula bat ordezkatzen du Maiuskula/Minuskula hautatuta badago Aukerak eremuan.


Onartzen diren metakaraktereen eta sintaxiaren zerrenda osoa ezagutzeko, ikusi ICUren adierazpen erregularren dokumentazioa

Note that currently all named character class terms, [:alpha:] through [:upper:], must be enclosed in parentheses when used in a regular expression, see the examples that follow.

Regular expression terms can be combined to form complex and sophisticated regular expressions for searches as show in the following examples.

Adibideak

Adierazpena

Esanahia

^$

Paragrafo hutsa.

^ ikurrak zehazten du bat etortzeak paragrafoaren hasieran egon behar duela,

$ ikurrak zehazten du bat datorren kateari paragrafo-marka batek edo gelaxka baten amaierak jarraitu behar diola.

^.

Paragrafo baten lehen karakterea.

^ ikurrak zehazten du bat etortzeak paragrafoaren hasieran egon behar duela,

. ikurrak karaktere bakar bat zehazten du.

e([:digit:])?

Matches "e" by itself or an "e" followed by one digit.

e ikurrak "e" karakterea zehazten du,

[:digit:] ikurrak edozein digitu dezimal zehazten du,

? ikurrak [:digit:] ikurraren zero edo bat agerraldi zehazten ditu.

^([:digit:])$

Digitu bakar bat duen edozein paragrafo edo gelaxkarekin bat dator.

^ ikurrak zehazten du bat etortzeak paragrafoaren hasieran egon behar duela,

[:digit:] ikurrak edozein digitu dezimal zehazten du,

$ ikurrak zehazten du bat datorren kateari paragrafo-marka batek edo gelaxka baten amaierak jarraitu behar diola.

^[:digit:]{3}$

Hiru digitu soilik dituen edozein paragrafo edo gelaxkarekin bat dator.

^ ikurrak zehazten du bat etortzeak paragrafoaren hasieran egon behar duela,

[:digit:] ikurrak edozein digitu dezimal zehazten du,

{3} ikurrak zehazten du [:digit:] hiru aldiz agertu behar dela,

$ ikurrak zehazten du bat datorren kateari paragrafo-marka batek edo gelaxka baten amaierak jarraitu behar diola.

\bconst(itu|ruc)tion\b

Matches the words "constitution" and "construction" but not the word "constitutional."

\b specifies that the match must begin at a word boundary,

const specifies the characters "const",

( starts the group,

itu specifies the characters "itu",

| specifies the alternative,

ruc specifies the characters "ruc",

) ends the group,

tion specifies the characters "tion",

/b specifies that the match must end at a word boundary.