wp_spaces_regexp()

Returns the regexp for common whitespace characters.


Description

By default, spaces include new lines, tabs, nbsp entities, and the UTF-8 nbsp. This is designed to replace the PCRE \s sequence. In https://core.trac.wordpress.org/ticket/22692, that sequence was found to be unreliable due to random inclusion of the A0 byte.


Return

(string) The spaces regexp.


Source

File: wp-includes/formatting.php

function wp_spaces_regexp() {
	static $spaces = '';

	if ( empty( $spaces ) ) {
		/**
		 * Filters the regexp for common whitespace characters.
		 *
		 * This string is substituted for the \s sequence as needed in regular
		 * expressions. For websites not written in English, different characters
		 * may represent whitespace. For websites not encoded in UTF-8, the 0xC2 0xA0
		 * sequence may not be in use.
		 *
		 * @since WP-4.0.0
		 *
		 * @param string $spaces Regexp pattern for matching common whitespace characters.
		 */
		$spaces = apply_filters( 'wp_spaces_regexp', '[\r\n\t ]|\xC2\xA0| ' );
	}

	return $spaces;
}


Changelog

Changelog
Version Description
WP-4.0.0 Introduced.