String prcessing routines
use NetSDS::Util::String qw(); # Read from standard input my $string = <STDIN>; # Encode string to internal structure $string = string_encode($tring);
\*(C`NetSDS::Util::String\*(C' module contains functions may be used to quickly solve string processing tasks like parsing, recoding, formatting.
As in other NetSDS modules standard encoding is \s-1UTF-8\s0.
By default this function treat first argument as byte string in \s-1UTF-8\s0 and return it's internal Unicode representation. In case of external character set isn't \s-1UTF-8\s0 it should be added as second argument of function. # Convert UTF-8 byte string to internal Unicode representation $uni_string = str_encode($byte_string);
# Convert KOI8-U byte string to internal $uni_string = str_encode($koi8_string, 'KOI8-U'); After \*(C`str_encode()\*(C' it's possible to process this string correctly including regular expressions. All characters will be understood as \s-1UTF-8\s0 symbols instead of byte sequences. By default this function treat first argument as string in internal \s-1UTF-8\s0 and return it in byte string (external) representation. In case of external character set isn't \s-1UTF-8\s0 it should be added as second argument of function. # Get UTF-8 byte string from internal Unicode representation $byte_string = str_decode($uni_string);
# Convert to KOI8-U byte string from internal Unicode $koi8_string = str_encode($uni_string, 'KOI8-U'); It's recommended to use \*(C`str_encode()\*(C' when preparing data for communication with external systems (especially networking). Translate string between different encodings. If target encoding is not set \s-1UTF-8\s0 used as default one.
$orig_str = " string with spaces "; $new_str = str_trim($orig_str);
# Output: "string with spaces" print $new_str;
This function is similar to \*(C`str_trim()\*(C' except of it removes only leading space characters and leave trailing ones.
This function is similar to \*(C`str_trim()\*(C' except of it removes only trailing space characters and leave leading ones.
Function is similar to \*(C`str_trim()\*(C' but also changes all spacing chains inside string to single spaces.
If pass undef - return undef. If pass '' - return ''. Examples: # returns 'getValue' str_camelize( 'get_value' )
# returns 'addUserAction' str_camelize( 'ADD_User_actION' )
If pass undef - return undef. If pass '' - return ''. Examples: # returns 'get_value' str_decamelize( 'getValue' )
None yet
Unknown yet
Implement examples and tests.
Encode, perlunicode
Valentyn Solomko <[email protected]>
Michael Bochkaryov <[email protected]>