3

The Full IP Core Network

3.1. Fixed mobile convergence

Fixed mobile convergence (FMC) is set of a technology bringing seamless connectivity between fixed and wireless communication networks.

FMC is the answer to the desire of consumers to be connected anytime anywhere and from any device.

This requires:

fixed and wireless broadband;
IP Multimedia Subsystem (IMS);
Session Initiation Protocol (SIP):
- Voice over Internet Protocol (VoIP),
- common service delivery platform.

FMC, to be seamless to the end user, involves personalization for the user services.

FMC includes:

Unlicenced Mobile Access (UMA), with UMA-capable devices;
SIP-based services;
seamless roaming between mobile accesses (cellular, WiFi, etc.).

3.2. IP multimedia subsystem

3.2.1. General description of IMS

3GPP has taken IMS (Internet Multimedia Subsystem) as the core of its telecommunication standards. This trend has been followed by ETSI and other standardization bodies such as ANSI.

Figure 3.1. IMS

image

IMS was introduced by 3GPP with release 5 and developed to date. It is also developed by ETSI TISPAN (for fixed telecommunication). IMS is based on IETF protocols. OMA specify the service layer application.

Although originally specified by 3GPP, a number of other organizations around the world support IMS. In addition to the IETF, which specifies key protocols such as SIP, and the OMA, which specifies end-to-end service-layer applications, other organizations supporting IMS include the GSM Association (GSMA), the ETSI, CableLabs, 3GPP2, the Parlay Group, the International Telecommunication Union (ITU), the American National Standards Institute (ANSI), the TISPAN and the Java Community Process (JCP).

According to an ABI Research analyst, an advantage of IMS is that it enables rapid development and deployment of new services and is essential to the success of mobile and fixed operators who are losing revenue from traditional sources. It is forecast by ABI that mobile operators will reap $300 billion revenues over five years ending in 2013 due to new services delivered with IMS.

The benefits of using IMS include handling all communication in the packet domain, tighter integration with the Internet and a lower cost infrastructure that is based on IP building blocks used for both voice and data services. This allows operators to potentially deliver data and voice services at lower cost, in turn providing these services at lower prices and further driving demand and usage.

IMS applications can reside either in the operator’s network or in third-party networks.

IMS separates the access network from the service layer. The horizontal control layer isolates the access from the service. It is a common horizontal control layer. Logically, services do not need to have their own control functions.

IMS uses IETF protocols wherever possible, especially SIP. But other technologies apply, in particular soft switches or Generic Access Network.

However in implementation this does not necessarily map into greater reduced cost and complexity. IMS is unable to master the OTT mechanisms, which provide access to contents and contacts outside the control of wireless/fixed operators.

3.2.2. Session Initiation Protocol

SIP is the core networking protocol used within the IMS and is one of the more widely known examples of a session control protocol. Session control refers to the process used to create, modify and terminate IP-based communication sessions, and a session can include two-way voice communication, multimedia (text, audio or video) conference collaboration, instant messaging, application sharing and other contemplated but not yet fully specified services. Session control is accomplished through signaling between various network elements and endpoints using a session control protocol.

Although SIP is the most widely known session control protocol, SIP has a major limitation that is of great importance to any GSMUMTS operator. It does not provide any method of directly inter-working with the Public Switched Telephone Network (PSTN) because it was not created with the intention of it being fully backwards-compatible with legacy PSTN signaling mechanisms.

In addition to SIP, other examples of session control protocols include BICC, SIP-I and SIP-T.

BICC, or Bearer Independent Call Control, is the protocol standardized in the 3GPP Release 4 architecture and deployed in some networks today. BICC, however, is not an optimal choice for ongoing evolution because it has been limited to, and is predicted to remain limited to, operation within a GSM-UMTS context. BICC does not address domains beyond GSM-UMTS such as LTE; as a result, it does not automatically offer the future level of flexibility of continued development and evolution that would accompany the SIP with ISUP encapsulation variants (i.e. either SIP-T, SIP for Telephones or SIP-I, SIP with ISUP encapsulation).

With a technical analysis of capabilities existing within the two SIP technologies with ISUP encapsulation variants, 4G Americas recommends SIP-I as the direction for evolution. There are four areas where SIP-I is better suited for a GSM-UMTS environment than SIP-T:

assumptions regarding the trust and security environment;
encapsulation procedures and message mapping;
Support of Request for Comments (RFCs);
user plane interoperability.

3.2.3. IMS components and interfaces

The IMS is an architectural framework for delivering IP multimedia services. It was originally designed by 3GPP as a part of the vision for evolving mobile networks beyond GSM. Its original formulation appears in Release 5. This vision was later updated by 3GPP and TISPAN. In particular, IMS applies to fixed lines standards (3GPP Release 7 and TISPAN R1.1), and further updates.

Since it is becoming increasingly easier to access content and contacts using mechanisms outside the control of traditional wireless/fixed operators, the interest of IMS is being challenged.

Figure 3.2. IMS wide scope

image

Figure 3.3. IMS functions

image

The IP multimedia CN subsystem is a collection of different functions, linked by standardized interfaces, which grouped form one IMS administrative network. A function is not a node (hardware box). An implementer is free to combine two functions in one node, or to split a single function into two or more nodes. Each node can also be present multiple times in a single network, for dimensioning, load balancing or organizational issues.

3.2.3.1. Access network

The user can connect to IMS in various ways, most of which use the standard IP. IMS terminals (such as mobile phones, personal digital assistants (PDAs) and computers) can register directly on IMS, even when they are roaming in another network or country (the visited network). The only requirement is that they can use IP and run SIP user agents (SIP UAs). Fixed access (e.g., Digital Subscriber Line (DSL), cable modems, Ethernet), mobile access and wireless access (e.g. WiMAX) are all supported. Other phone systems like plain old telephone service (POTS—the old analog telephones), H.323 and non IMS-compatible VoIP systems, are supported through gateways.

3.2.3.2. Core network

The Home Subscriber Server (HSS), or User Profile Server Function (UPSF), is a master user database that supports the IMS network entities that actually handle calls. It contains the subscription-related information (subscriber profiles), performs authentication and authorization of the user, and can provide information about the subscribers location and IP information. It is similar to the GSM Home Location Register (HLR) and Authentication Centre (AuC).

A Subscriber Location Function (SLF) is needed to map user addresses when multiple HSSs are used.

Various User identities may be associated with IMS: IP Multimedia Private Identity (IMPI), IP Multimedia Public Identity (IMPU), Globally Routable User Agent URI (GRUU), Wildcarded Public User Identity. Both IMPI and IMPU are not phone numbers or other series of digits, but Uniform Resource Identifiers (URIs), that can be digits (a Tel. URI, such as Tel.: +1-555-123-4567) or alphanumeric identifiers (a SIP URI, such as sip:[email protected]).

The IMPI is a unique permanently allocated global identity assigned by the home network operator, and is used, for example, for Registration, Authorization, Administration and Accounting purposes. Every IMS user will have one IMPI.

The IMPU is used by any user for requesting communications to other users (e.g. this might be included on a business card). There can be multiple IMPU per IMPI. The IMPU can also be shared with another phone, so that both can be reached with the same identity (for example, a single phone-number for an entire family).

GRUU is an identity that identifies a unique combination of IMPU and UE instance. There are two types of GRUU: Public-GRUU (P-GRUU) and Temporary GRUU (T-GRUU):

P-GRUU reveal the IMPU and are very long lived.
T-GRUU do not reveal the IMPU and are valid until the contact is explicitly de-registered or the current registration expires.

A wildcarded Public User Identity expresses a set of IMPU grouped together.

The HSS subscriber database contains the IMPU, IMPI, IMSI, MSISDN, subscriber service profiles, service triggers and other information.

CSCF – Call Session Control Function

The Call Session Control Function (CSCF) is a collection of functional capabilities that play an essential role in the IMS CN. The CSCF is responsible for the signaling controlling the communication of IMS User Equipment (UE) with IMS enhanced services across different network accesses and domains. The CSCF controls the session establishment and teardown, as well as user authentication, network security and Quality of Service (QoS).

Several roles of SIP servers or proxies, collectively called CSCF, are used to process SIP signaling packets in the IMS.

A Proxy-CSCF (P-CSCF) is a SIP proxy that is the first point of contact for the IMS terminal and can be located either in the visited network (in full IMS networks) or in the home network (when the visited network is not IMS compliant yet). Some networks may use a Session Border Controller (SBC) for this function. The P-CSCF is at its core a specialized SBC for the user–network interface which not only protects the network, but also the IMS terminal. The use of an additional SBC between the IMS terminal and the P-CSCF is unnecessary and infeasible due to the signaling being encrypted on this leg. The terminal discovers its P-CSCF with either DHCP, or it may be configured (e.g. during initial provisioning or via a 3GPP IMS Management Object (MO)) or in the ISIM or assigned in the PDP Context (e.g. General Packet Radio Service (GPRS)).
It is assigned to an IMS terminal before registration, and does not change for the duration of the registration.
It sits on the path of all signaling, and can inspect every signal; the IMS terminal must ignore any other unencrypted signaling.
It provides subscriber authentication and may establish an IPsec or TLS security association with the IMS terminal. This prevents spoofing attacks and replay attacks and protects the privacy of the subscriber.
It inspects the signaling and ensures that the IMS terminals do not misbehave (e.g. change normal signaling routes, do not obey home network’s routing policy).
It can also compress and decompress SIP messages using SigComp, which reduces the round-trip over slow radio links.
It may include a Policy Decision Function (PDF), which authorizes media plane resources, for example, QoS over the media plane. It is used for policy control, bandwidth management, etc. The PDF can also be a separate function.
It also generates charging records.

To summarize, the P-CSCF is the first point of contact between the UEs and the IMS network. Acting as a SIP proxy, all the SIP requests and responses from/to UEs traverse the P-CSCF. The P-CSCF may be located in either the user’s home network or in the visited network for handling roaming.

The P-CSCF supports several important functions:

validates the correctness of SIP messages with IMS UEs according to SIP standard rules;
ensures the security of the messages between UEs and the IMS network using IPsec or TLS security associations;
authenticates and asserts the identity of the UE;
compresses the messages ensuring the efficient transmission of SIP messages over narrowband channels.

The P-CSCF may support Policy Enforcement capabilities for authorizing media plane resources, bandwidth and QoS management.

In addition, the P-CSCF can also generate charging information to be collected by charging network nodes.

A Serving-CSCF (S-CSCF) is the central node of the signaling plane. It is a SIP server, but performs session control too. It is always located in the home network. It uses Diameter Cx and Dx interfaces to the HSS to download user profiles and upload user-to-S-CSCF associations (the user profile is only cached locally for processing reasons only and is not changed). All necessary subscriber profile information is loaded from the HSS.

It handles SIP registrations, which allows it to bind the user location (e.g., the IP address of the terminal) and the SIP address;
It sits on the path of all signaling messages of the locally registered users, and can inspect every message.
It decides to which application server(s) the SIP message will be forwarded, in order to provide their services.
It provides routing services, typically using Electronic Numbering (ENUM) lookups.
It enforces the policy of the network operator.
There can be multiple S-CSCFs in the network for load distribution and high availability reasons. It is the HSS that assigns the S-CSCF to a user, when it is queried by the I-CSCF. There are multiple options for this purpose, including a mandatory/optional capabilities to be matched between subscribers and S-CSCFs.

To summarize, the S-CSCF is a central function of the signaling plane in the IMS CN. A S-CSCF node acts as a SIP registrar, and in some cases as a SIP redirect server. It is responsible for processing the location registration of each UE, user authentication and call routing and processing. Similar to the I-CSCF, the S-CSCF supports Diameter Cx and Dx interfaces to the HSS to download the authentication information and user profile of the registering UEs from the HSS for authentication purpose. All of the SIP signaling from/to the IMS UEs traverses their serving S-CSCF allocated during the registration process. The S-CSCF also provides SIP message routing and services triggering. It also enforces the policy of the network operator and keep users from performing unauthorized operations.

The S-CSCF is always located in the home network. A number of S-CSCFs may be deployed for the sake of scalability and redundancy.

An Interrogating-CSCF (I-CSCF) is another SIP function located at the edge of an administrative domain. Its IP address is published in the Domain Name System (DNS) of the domain (using NAPTR and SRV type of DNS records), so that remote servers can find it, and use it as a forwarding point (e.g. registering) for SIP packets to this domain.

It queries the HSS to retrieve the address of the S-CSCF and assign it to a user performing SIP registration.
It also forwards SIP request or response to the S-CSCF.
Up to Release 6 it can also be used to hide the internal network from the outside world (encrypting parts of the SIP message), in which case it is called a Topology Hiding Inter-network Gateway (THIG). From Release 7 onwards this “entry point” function is removed from the I-CSCF and is now part of the Interconnection Border Control Function (IBCF). The IBCF is used as gateway to external networks, and provides NAT and firewall functions (pinholing). The IBCF is practically a Session Border Controller specialized for the NNI.

To summarize, The I-CSCF is a SIP proxy located in the edge of an administrative IMS domain. Its IP address is published in the DNS of the domain (using NAPTR and SRV type of DNS records), so that remote servers can find and use it as a forwarding point (e.g., registering) for SIP packets to this IMS domain. The I-CSCF implements a Diameter (RFC 3588) interface to the HSS, and queries the HSS to retrieve the address of the S-CSCF for an UE to perform SIP registration. Being a SIP proxy, the I-CSCF forwards SIP message requests and responses to the S-CSCF. Additionally, the I-CSCF may encrypt parts of the SIP messages securing any sensitive information.

Typically the IMS network includes a number of I-CSCF nodes for the purpose of scalability and redundancy. The I-CSCF is usually located in the IMS home network.

An Emergency-CSCF, the E-CSCF is a newly defined entity in the IMS network. As its name indicates, the E-CSCF is responsible for handling of emergency call service. Once the P-CSCF detects that the received SIP message request is for an emergency call it forwards that SIP message to the E-CSCF. Then, the E-CSCF contacts the Locating Retrieval Function (LRF) to get the location of the UE for routing the emergency call appropriately. The E-CSCF can be located either in a home network or in a visited network.

SIP ASs host and execute services (Next Generation Network Services), and interface with the S-CSCF using SIP. An example of an AS that is being developed in 3GPP is the Voice call continuity Function (VCC Server). Depending on the actual service, the AS can operate in SIP proxy mode, SIP UA mode or SIP B2BUA mode. An AS can be located in the home network or in an external third-party network. If located in the home network, it can query the HSS with the Diameter Sh or Si interfaces (for a SIP-AS).

SIP AS: host and execute IMS specific services.
IP Multimedia Service Switching Function (IM-SSF): interfaces SIP to CAP to communicate with CAMEL ASs.
OSA Service Capability Server (OSA SCS): interfaces SIP to the OSA framework.

Functional model: the AS-ILCM and AS-OLCM store the transaction state, and may optionally store the session state depending on the specific service being executed. The AS-ILCM interfaces to the S-CSCF (ILCM) for an incoming leg and the AS-OLCM interfaces to the S-CSCF (OLCM) for an outgoing leg. Application Logic provides the service(s) and interacts between the AS-ILCM and AS-OLCM.

Public Service Identities (PSI) are identities that identify services, which are hosted by ASs. As user identities, PSIs will take the form of either a SIP or Tel URI. PSIs are stored in the HSS either as a distinct PSI or as a wildcarded PSI:

a distinct PSI contains the PSI that is used in routing;
a wildcarded PSI represents a collection of PSIs.

Media servers

The Media Resource Function (MRF) provides media-related functions such as media manipulation (e.g. voice stream mixing) and playing of tones and announcements.

Each MRF is further divided into a Media Resource Function Controller (MRFC) and a Media Resource Function Processor (MRFP).

The MRFC is a signaling plane node that interprets information coming from an AS and S-CSCF to control the MRFP.
The MRFP is a media plane node used to mix, source or process media streams. It can also manage access rights to shared resources.

The Media Resource Broker (MRB) is a functional entity that is responsible for both collection of appropriate published MRF information and supplying of appropriate MRF information to consuming entities such as the AS. MRB can be used in two modes:

Query mode: AS queries the MRB for media and sets up the call using the response of MRB.
In-Line Mode: AS sends a SIP INVITE to the MRB. The MRB sets up the call.

A Breakout Gateway Control Function (BGCF) is a SIP proxy which processes requests for routing from an S-CSCF when the S-CSCF has determined that the session cannot be routed using DNS or ENUM/DNS. It includes routing functionality based on telephone numbers.

A PSTN/CS gateway interfaces with PSTN circuit switched (CS) networks. For signaling, CS networks use ISDN User Part (ISUP) (or BICC) over Message Transfer Part (MTP), while IMS uses SIP over IP. For media, CS networks use Pulse-code modulation (PCM), while IMS uses Real-time Transport Protocol (RTP).

A Signaling Gateway (SGW) interfaces with the signaling plane of the CS. It transforms lower layer protocols as Stream Control Transmission Protocol (SCTP, an IP protocol) into MTP (a Signaling System 7 (SS7) protocol), to pass ISDN User Part (ISUP) from the MGCF to the CS network.
A Media Gateway Controller Function (MGCF) is a SIP endpoint that does call control protocol conversion between SIP and ISUP/BICC and interfaces with the SGW over SCTP. It also controls the resources in a Media Gateway (MGW) across an H.248 interface.
A Media Gateway (MGW) interfaces with the media plane of the CS network, by converting between RTP and PCM. It can also transcode when the codecs don’t match (e.g., IMS might use AMR, PSTN might use G.711).

Media Resources are those components that operate on the media plane and are under the control of IMS Core functions. Specifically, Media Server (MS) and MGW.

IMS has two types of Next Generation Networking interconnection :

Service oriented Interconnection (SoIx): the physical and logical linking of next generation networks (NGN) domains that allows carriers and service providers to offer services over NGN (i.e., IMS and PES) platforms with control, signaling (i.e., session based), which provides defined levels of interoperability. For instance, this is the case of “carrier grade” voice and/or multimedia services over IP interconnection. “Defined levels of interoperability” are dependent upon the service or the QoS or the Security, etc.
Connectivity oriented Interconnection (CoIx): the physical and logical linking of carriers and service providers based on simple IP connectivity irrespective of the levels of interoperability. For example, an IP interconnection of this type is not aware of the specific end to end service and, as a consequence, service specific network performance, QoS and security requirements are not necessarily assured. This definition does not exclude that some services may provide a defined level of interoperability. However only SoIx fully satisfies NGN interoperability requirements.

An NGN interconnection mode can be direct or indirect. Direct interconnection refers to the interconnection between two network domains without any intermediate network domain. Indirect interconnection at one layer refers to the interconnection between two network domains with one or more intermediate network domain(s) acting as transit networks. The intermediate network domain(s) provide(s) transit functionality to the two other network domains. Different interconnection modes may be used for carrying service layer signaling and media traffic.

Offline charging is applied to users who pay for their services periodically (e.g. at the end of the month). Online charging, also known as credit-based charging, is used for prepaid services, or realtime credit control of postpaid services. Both may be applied to the same session.

Charging function addresses are addresses distributed to each IMS entitly and provide a common location for each entity to send charging information. Charging Data Function (CDF) addresses are used for offline billing and Online Charging Function (OCF) for online billing.

Offline Charging: all the SIP network entities (P-CSCF, I-CSCF, S-CSCF, BGCF, MRFC, MGCF and AS) involved in the session use the Diameter Rf interface to send accounting information to a CDF located in the same domain. The CDF will collect all this information, and build a Call Detail Record (CDR), which is sent to the billing system (BS) of the domain. Each session carries an IMS Charging Identifier (ICID) as a unique identifier generated by the first IMS entity involved in a SIP transaction and used for the correlation with CDRs. Inter Operator Identifier (IOI) is a globally unique identifier shared between sending and receiving networks. Each domain has its own charging network. BSs in different domains will also exchange information, so that roaming charges can be applied.
Online charging: The S-CSCF talks to a IMS Gateway Function (IMS-GWF) which looks like a regular SIP application server. The IMS-GWF can signal the S-CSCF to terminate the session when the user runs out of credits during a session. The AS and MRFC use the Diameter Ro interface toward an OCF.
When Immediate Event Charging (IEC) is used, a number of credit units is immediately deducted from the users account by the ECF and the MRFC or AS is then authorized to provide the service. The service is not authorized when not enough credit units are available.
When Event Charging with Unit Reservation (ECUR) is used, the Event Charging Function (ECF) first reserves a number of credit units in the user’s account and then authorizes the MRFC or the AS. After the service is over, the number of spent credit units is reported and deducted from the account; the reserved credit units are then cleared.

IMS-based PSTN Emulation System (PES) provides IP network services to analog devices. IMS-based PES allows non-IMS devices to appear to IMS as normal SIP users. Analog terminal using standard analog interfaces can connect to IMS-based PES in two ways:

Via Access Media Gateway (A-MGW) that is linked and controlled by AGCF. AGCF is placed within the operators’ network and controls multiple A-MGW. A-MGW and AGCF communicate using H.248.1 (Megaco) over the P1 reference point. POTS phone connect to A-MGW over the z interface. The signaling is converted to H.248 in the A-MGW and passed to AGCF. AGCF interprets the H.248 signal and other inputs from the A-MGW to format H.248 messages into appropriate SIP messages. AGCF presents itself as P-CSCF to the S-CSCF and passes generated SIP messages to S-CSCF or to IP border via IBCF. Service presented to S-CSCF in SIP messages trigger PES AS. AGCF also has a certain service independent logic, for example on receipt of an off-hook event from A-MGW, the AGCF requests the A-MGW to play a dial tone.
Via VoIP-Gateway (VGW) or SIP Gateway/Adapter on customer premises. POTS phones via VOIP Gateway connect to P-CSCF directly. Operators mostly use Session Border Controllers between VoIP Gateways and P-CSCFs for security and to hide network topology. VGW link to IMS using SIP over Gm reference point. The conversion from POTS service over the z interface to SIP occurs in the customer premises VoIP Gateway. POTS signaling is converted to SIP and passed on to P-CSCF. VGW acts as SIP UA and appears to P-CSCF as SIP terminal.

Both A-MGW and VGW are unaware of the services. They only relay call control signaling to and from the PSTN terminal. Session control and handling is done by IMS components.

Session handling

One of the most important features of IMS, that of allowing for a SIP application to be dynamically and differentially (based on the user’s profile) triggered, is implemented as a filter-and-redirect signaling mechanism in the S-CSCF.

The S-CSCF might apply filter criteria to determine the need to forward SIP requests to AS. It is important to note that services for the originating party will be applied in the originating network, while the services for the terminating party will be applied in the terminating network, all in the respective S-CSCFs.

An initial filter criteria (iFC) is an XML-based format used for describing control logic. iFCs represent a provisioned subscription of a user to an application. They are stored in the HSS as part of the IMS Subscription Profile and are downloaded to the S-CSCF upon user registration (for registered users) or on processing demand (for services, acting as unregistered users). iFCs are valid throughout the registration lifetime or until the User Profile is changed.

The iFC is composed of:

Priority: determines the order of checking the trigger.
Trigger Point: logical condition(s) which is verified against initial dialog creating SIP requests or stand-alone SIP requests.
AS URI: specifies the AS to be forwarded to when the Trigger Point matches.

There are two types of iFCs:

Shared: when provisioning, only a reference number (the Shared iFC number) is assigned to the subscriber. During registration, only the number is sent to the CSCF, not the entire XML description. The complete XML will have previously been stored on the CSCF.
Non-Shared: when provisioning, the entire XML description of the iFC is assigned to the subscriber. During registration, the entire XML description is sent to the CSCF.

Table 3.1. The chart describes the interfaces involved in IMS and figure 3.4 shows their place in the overall processing system

image
image
image
image
image

It is envisaged that security defined in TS 33.203 may not be available for a while especially because of the lack of USIM/ISIM interfaces and prevalence of devices that support IPv4. For this situation, to provide some protection against the most significant threats, 3GPP defines some security mechanisms, which are informally known as “early IMS security”, in TR33.978. This mechanism relies on the authentication performed during the network attachment procedures, which binds between the user’s profile and its IP address. This mechanism is also weak because the signaling is not protected on the User–network interface.

Figure 3.4. Security aspects of early IMS and non-3GPP systems

image

CableLabs in PacketCable 2.0, which also adopted the IMS architecture but has no USIM/ISIM capabilities in their terminals, published deltas to the 3GPP specifications where the Digest-MD5 is a valid authentication option. Later on, TISPAN also made a similar effort given their Fixed Networks scopes, although the procedures are different. To compensate for the lack of IPsec capabilities, TLS has been added as an option for securing the Gm interface. Later 3GPP Releases have included the Digest-MD5 method, toward a Common-IMS platform, yet in its own and again different approach. Although all 3 variants of Digest-MD5 authentication have the same functionality and are the same from the IMS terminal’s perspective, the implementations on the Cx interface between the S-CSCF and the HSS are different.

For a more detailed description see:

3GPP TS 23.228, IP Multimedia Subsystem (IMS), Stage 2;
3GPP TS 29.228, Stage 2 specifications ;
3GPP TS 24.229, IP multimedia call control protocol based on Session Initiation Protocol (SIP) and Session Description Protocol (SDP).

3.3. Evolved packet system in 3GPP standards

The new blocks specific to Evolved UMTS evolution, also known as the evolved packet system (EPS), are the evolved packet core (EPC) and the Evolved UTRAN (or E-UTRAN). Other blocks from the classical UMTS architecture are also displayed, such as the UTRAN (the UMTS Access Network), the PS and the CS CNs, respectively, connected to the public (or any private) IP and Telephone Networks. The IMS (IP Multimedia Subsystem) is located on top of the Packet Core blocks and provide access to both public or private IP networks, and the public telephone network via Media Gateway network entities. The HSS, managing user subscription information is shown as a central node, providing services to all CN blocks of 3G and evolved 3G architecture.

3.3.1. Policy and charging rules function

EPS introduces the Policy and Charging Rules Function (PCRF) as a software node designed to determine in real-time the policy rules in an IMS network. Earlier policy engines were added onto an existing network, which was sometimes cumbersome to manage and update.

Figure 3.5. Full scope of EPS

image

Being a policy tool, PCRF plays a central role in the NGN. It accesses subscribers’ databases and charging system. In real-time, PCRF aggregates information to and from the network, from operational support systems and other sources such as portals. It supports the creation of rules and automatically makes policy decisions for each active subscriber on the network, e.g. authorization to access one or more among the multiple services and QoSs levels offered by the network with application of the charging rules.

Figure 3.6. PCRF connections in LTE’s EPC

image

PCRF is available both on wired and wireless networks. It can be integrated in different platforms (billing, rating, charging, subscribers’ database) or operate on a stand alone device.

3.3.2. Release 8 system architecture evolution and evolved packet system

System Architecture Evolution (SAE) is synonymous with Evolved Packet Core (EPC). SAE/EPC is defined by 3GPP in Release 8 (Rel-8) as an entirely new CN with a flatter all-IP architecture enabling a higher data rate, lower latency packet-optimized system that supports multiple radio-access technologies, focusing on the packet-switched domain, with the assumption that the system will support all services – including voice – in this domain.

3GPP has made significant progress in Rel-8 toward the standards development and definition of a new flatter-IP CN to support the Evolved UMTS Terrestrial Radio Access Network (EUTRAN) through the SAE work item, which was later renamed the EPC Architecture. In parallel, 3GPP has made significant progress toward the standards development and definition of a new OFDMA-based technology through the Long Term Evolution (LTE) work item. This new OFDMA based air interface (LTE) is also often referred to as the EUTRAN. Note that the complete packet system consisting of the EUTRAN/LTE and the SAE/EPC is called the EPS.

The combination of LTE and SAE/EPC provides the long-term vision for 3GPP to an all-IP, packet only wideband OFDMA system expected to further improve performance by providing higher data rates, improved spectral efficiency and reduced latency. LTE’s ability to support bandwidths wider than 5 MHz is of particular importance as the demands for higher wireless data speeds and spectral efficiencies continue to grow.

As mobile operators add LTE to their radio access networks, they will simultaneously evolve the rest of their networks and subscriber devices. They will update their core and backhaul networks to handle the exponential increases in IP traffic enabled by LTE. To keep their networks performing optimally, mobile operators will flatten their CN architectures considerably by using EPC technology. EPC reduces the number of nodes in the core, which reduces latency even as the amount of data traffic increases. It simplifies deployment of IP-based networks and reduces the cost of their deployments.

Figure 3.7. Evolved packet core

image

EPC uses IMS as a component. It also manages QoS across the whole system, which will be essential for enabling a rich set of multimedia-based services. The EPS will be optimized for all services to be delivered via IP in a manner that is as efficient as possible – through minimization of latency within the system, for example.

Although it will most likely be deployed in conjunction with LTE, EPC may also be deployed for use with HSPA+, where it would provide a stepping-stone to LTE. It will support service continuity across heterogeneous networks, important for LTE operators that must simultaneously support GSM/GPRS/EDGE/UMTS/HSPA customers.

The key features and capabilities of SAE/EPC include:

reduced latency and higher data performance through a flatter all-IP architecture. 3GPP has targeted user-plane latency at 10 ms;
support for both LTE radio-access networks and interworking with GSM-UMTS radio-access networks;
the ability to integrate non-3GPP networks;
optimization for all services provided via IP.

The EPC is a flat all-IP-based CN that can be accessed through 3GPP radio access (LTE, 3G and 2G) and non-3GPP radio access (e.g. WiMAX and WLAN), allowing handover procedures within and between both radio access types. The access flexibility to the EPC is attractive for operators since it enables them to have a single CN through which different services are supported. The main components of the EPC are: (1) Mobility Management Entity; (2) Serving Gateway; and (3) Packet Data Network.

Figure 3.8. EPC components

image

3.3.2.1. Mobility Management Entity (MME)

Mobility Management Entity (MME) is a key control element. It is in charge of managing security functions (authentication, authorization and Network Access Server (NAS) signaling security), idle state mobility handling, roaming and handovers among other functions. The S1-MME interface connects the EPC with the evolved Node Bs (eNBs, base stations in LTE).

MME is responsible for tracking and paging procedure including retransmissions, and also for idle mode of UE. MME is also involved in bearer activation and deactivation procedures. Its task also belongs to choosing the SGW for a UE in the initial attach process and when the intra-handover takes place which involves CN node relocation.

MME is responsible for authenticating the user to the HSS, if the user is roaming MME terminates S6a interface to the user’s home HSS. All Non Access Stratum (NAS) signaling terminates at the MME point, which is also responsible for generation and allocation of temporary UE identities (GUTI). Among its duties is also authorization UE to Public Land Mobile Network (PLMN) and enforcing UE roaming restrictions if there are any. MME is also the termination point of ciphering and integrity protection for NAS signaling. Lawful interception (LI) of signaling could also be supported by MME entity. It also provides the control plane function for mobility between LTE and 2G/3G networks by the S3 interface (from SGSN to MME).

MME functions, according to 23.401 3GPP documentation, include:

NAS signaling;
NAS signaling security; MME acts the termination point for ciphering protection for NAS signaling;
inter CN node signaling for mobility between 3GPP access networks (terminating S3); the S3 interface terminates in the MME, providing the control plane function for mobility between LTE and 2G/3G access networks;
UE Reach ability in ECM-IDLE state (including control and execution of paging retransmission);
paging procedure;
providing temporary identities for UEs;
tracking area list management;
mapping from UE location (e.g. TAI) to time zone, and signaling a UE time zone change associated with mobility;
PDN GW and serving GW selection for the UE;
MME selection for handovers with MME change; intra-LTE handover involving core network node location;
SGSN selection for handovers to GSM or UMTS access networks;
roaming (S6a toward home HSS); interaction with HSS to authenticate user on attachment; implement roaming restrictions;
authentication;
authorization;
bearer management functions including dedicated bearer establishment, activation/deactivation;
lawful interception of signaling traffic; MME is the point at which lawful interception of signaling may be made;
warning message transfer function (including selection of appropriate eNodeB);
UE reach ability procedures.

The MME will signal a change is UE time zone only in case of mobility and in case of UE triggered service request, PDN disconnection and UE detach. If the MME cannot determine whether the UE time zone has changed (e.g. the UE time zone is not sent by the old MME during MME relocation), the MME should not signal a change in UE time zone. A change in UE time zone caused by a regulatory mandated time change (e.g. daylight saving time or summer time change) will not trigger the MME to initiate signaling procedures due to the actual change. Instead the MME will wait for the UE’s next mobility event or service request procedure and then use these procedures to update the UE time zone information in PDN GW.

3.3.2.2. Serving Gateway (S-GW or SGW)

SGW is the gateway that terminates the EPC interface toward the E-UTRAN via an interface called the S1-U. For each UE that is associated with the EPS there will be a unique S-GW hosting several functions, including mobility anchor point for both local inter-eNB handover and inter-3GPP mobility, inter-operator charging and packet routing and forwarding.

SGW is responsible for handovers with neighboring eNodeBs, also for data transfer in terms of all packets across the user plane. To its duties belongs taking care of mobility interface to other networks such as 2G/3G. SGW is monitoring and maintaining context information related to UE during its idle state and generates paging requests when data arrives for the UE in downlink direction (e.g. somebody’s calling). SGW is also responsible for replication of user traffic in case of LI.

SGW functions, according to 23.401 3GPP documentation include:

the local Mobility Anchor point for inter-eNodeB handover:
- sending of one or more “end marker” to the source eNodeB, source SGSN or source RNC immediately after switching the path during inter-eNodeB and inter-RAT handover, especially to assist the reordering function in eNodeB;
- mobility anchoring for inter-3GPP mobility (terminating S4 and relaying the traffic between 2G/3G system and PDN GW);
- ECM-IDLE mode downlink packet buffering and initiation of network triggered service request procedure;
lawful interception;
packet routing and forwarding;
transport level packet marking in the uplink and the downlink, e.g. setting the DiffServ Code Point, based on the QCI of the associated EPS bearer;
accounting for inter-operator charging. For GTP-based S5/S8, the Serving GW generates accounting data per UE and bearer;
interfacing OFCS according to charging principles and through reference points specified in TS 32.240.

3.3.2.3. Packet Data Network Gateway (PDN-GW or PGW)

Packet Data Network Gateway (PGW) provides the UE with access to a packet data network (PDN) by assigning an IP address from the PDN to the UE among other functions. Additionally, the Evolved Packet Data Gateway (ePDG) provides a security connection between an UE connected from an untrusted non-3GPP access network with the EPC by using IPSec tunnels.

The PGW is the gateway which terminates the SGi interface toward PDN. If UE is accessing multiple PDNs, there may be more than one PGW for that UE, however a mix of S5/S8 connectivity and Gn/Gp connectivity is not supported for that UE simultaneously.

PGW is responsible for acting as an “anchor” of mobility between 3GPP and non-3GPP technologies. PGW provides connectivity from the UE to external PDN by being the point of entry or exit of traffic for the UE. The PGW manages policy enforcement, packet filtration for users, charging support and LI.

Possible non-3GPP technologies are: WiMAX, CDMA 1X and EvDO.

PGW functions, as a list, according to 23.401 3GPP documentation, include:

per-user based packet filtering (by e.g. deep packet inspection);
lawful interception;
UE IP address allocation;
transport level packet marking in the uplink and downlink, e.g. setting the DiffServ Code Point, based on the QCI of the associated EPS bearer;
accounting for inter-operator charging;
UL and DL service level charging as defined in TS 23.203 (e.g. based on SDFs defined by the PCRF, or based on deep packet inspection defined by local policy);
interfacing OFCS through according to charging principles and through reference points specified in TS 32.240 [51];
UL and DL service level gating control as defined in TS 23. 203 [6];
UL and DL service level rate enforcement as defined in TS 23.203 [6] (e.g. by rate policing/shaping per SDF);
UL and DL rate enforcement based on APN-AMBR (e.g. by rate policing/shaping per aggregate of traffic of all SDFs of the same APN that are associated with Non-GBR QCIs);
DL rate enforcement based on the accumulated MBRs of the aggregate of SDFs with the same GBR QCI (e.g. by rate policing/shaping);
DHCPv4 (server and client) and DHCPv6 (client and server) functions;
PPP functionality;
packet screening.

Additionally the PDN GW includes the following functions for the GTP-based S5/S8:

UL and DL bearer binding as defined in TS 23.203;
UL bearer binding verification as defined in TS 23.203;
functionality as defined in RFC 4861 [32];
accounting per UE and bearer.

The PGW provides PDN connectivity to both GERAN/UTRAN only UEs and E-UTRAN capable UEs using any of E-UTRAN, GERAN or UTRAN. The PGW provides PDN connectivity to E-UTRAN capable UEs using E-UTRAN only over the S5/S8 interface.

However, from a user-plane perspective, there are only the base station (eNodeB) and the gateways. The system is considered “flat”. This results in a reduced complexity compared to previous architectures.

For further details regarding these elements and other elements of the EPC, the official specifications can be found in:

3GPP TS 23.401 GPRS enhancements for Evolved Universal Terrestrial Radio Access Network (E-UTRAN) access;
3GPP TS 23.402 Architecture enhancements for non-3GPP accesses.

3.4. Telephony processing

3.4.1. Enhanced voice quality

To ensure compatibility, 3GPP demands at least AMR-NB codec (narrow band), but the recommended speech codec for VoLTE is Adaptive Multi-Rate Wideband, also known as HD Voice. This codec is mandated in 3GPP networks that support 16 kHz sampling.

Fraunhofer IIS has proposed and demonstrated Full-HD Voice, an implementation of the AAC-ELD (Advanced Audio Coding – Enhanced Low Delay) codec for LTE handsets. Where previous cellphone voice codecs only supported frequencies up to 3.5 kHz and upcoming wideband audio services branded as HD Voice up to 7 kHz, Full-HD Voice supports the entire bandwidth range from 20 Hz to 20 kHz. For end-to-end Full-HD Voice calls to succeed however, both the caller and recipient’s handsets as well as networks have to support the feature.

3.4.2. Circuit-switched fallback (CSFB)

In this approach, LTE just provides data services, and when a voice call is to be initiated or received, 192 twill fall back to the circuit switched domain. When using this solution, operators, which have both LTE and 3G systems, just need to upgrade the MSC (in the GSM case) instead of deploying the IMS, and therefore, can provide services quickly. However, the disadvantage is that the subscriber suffers longer call setup delay.

3.4.3. Simultaneous voice and LTE (SVLTE)

The handset works simultaneously in the LTE and circuit switched modes, with the LTE mode providing data services and the circuit switched mode providing the voice service. This is a solution solely based on the handset, which does not have special requirements on the network and does not require the deployment of IMS either. The disadvantage of this solution is that the phone can become expensive with high power consumption.

While the industry has seemingly standardized on VoLTE for the future, the demand for voice calls today has led LTE carriers to introduce Circuit-switched fallback (CSFB) as a stopgap measure. When placing or receiving a voice call, LTE handsets will fall back to old 2G or 3G networks for the duration of the call.

3.4.4. Over-The-Top (OTT) applications

There are multiple options available today for mobile applications. This includes Over-The-Top (OTT) applications, services that are generally offered by third party (non-operator) developers (e.g. Skype) that can be downloaded from popular app stores, often for free. Second, we have Web-based applications. These applications run directly from a Web browser, or are included as part of Cloud-based services that run thin client applications on a device platform. Third are pre-loaded applications which are bundled with a platform and cannot be deleted or disabled. This includes applications such as camera, maps, mail and calendar. Finally, there are embedded applications, which are deeply integrated applications that are part of the native dialer; VoLTE falls in this category.

Today we are seeing explosive growth in OTT applications for VoIP/Video/Messaging apps. By quickly browsing through Google Play and Apple Store, we see a couple of hundred (or more) VoIP/Video/Messaging apps available for free. Most of these apps use their own proprietary protocols/infrastructure/network and use the operator’s best-effort data network. Most consumers today have one or more of these apps installed on their mobile device.

OTT VoIP over LTE:

voice +video packets/data packets/steaming web browsing;
wireless channel: Wi-Fi or LTE ; single pipe for data and realtime voice;
output: congestion of packets, packet losses, out of sequence packets.

3.4.4.1. OTT Mobile VoIP

Mobile VoIP or simply mVoIP is an extension of mobility to a VoIP network. There are several methodologies that allow a mobile handset to be integrated into a VoIP network. One implementation turns the mobile device into a standard SIP client, which then uses a data network to send and receive SIP messaging, and to send and receive RTP for the voice path. This methodology of turning a mobile handset into a standard SIP client requires that the mobile handset support, at minimum, high speed IP communications. In this application, standard VoIP protocols (typically SIP) are used over any broadband IP-capable wireless network connection.

High speed services from mobile operators using LTE may have better audio quality and capabilities for metropolitan-wide coverage including fast handoffs among mobile base stations, yet it will cost more than the typical Wi-Fi-based VoIP service.

As device manufacturers exploited more powerful processors and less costly memory, smartphones became capable of sending and receiving email, browsing the Web (albeit at low rates) and allowing a user to watch TV. Mobile VoIP users were predicted to exceed 100 million by 2012 and InStat projects 288 million subscribers by 2013.

The mobile operator industry business model conflicts with the expectations of Internet users that access is free and fast without extra charges for visiting specific sites.

Mobile VoIP, like all VoIP, relies on SIP — the standard used by most VoIP services, and now being implemented on mobile handsets and smartphones.

When moving between IP-based networks, as is typically the case for outdoor applications, two other protocols are required:

IEEE 802.21 handoff, permitting one network to do call setup and initial traffic, handing off to another when the first is about to fall out of range – the underlying network need not be IP-based, but typically the IP stream is guaranteed a certain QoS during the handoff process;
IEEE 802.11u call initiation when the initial contact with a network is not one that the user has subscribed to or been in contact with before.

For indoor or campus (cordless phone equivalent) use, the IEEE P1905 protocol establishes QoS guarantees for home area networks: Wi-Fi, Bluetooth, 3G, 4G and wired backbones using AC powerline networking/HomePlug/IEEE P1901, Ethernet and Power over Ethernet/IEEE 802.3af/IEEE 802.3at, MoCA and G.hn. In combination with IEEE 802.21, P1905 permits a call to be initiated on a wired phone and transferred to a wireless one and then resumed on a wired one, perhaps with additional capabilities such as videoconferencing in another room. In this case the use of mobile VoIP enables a continuous conversation that originates, and ends with, a wired terminal device.

A more popular approach has been full-spectrum handsets that can communicate with any wireless network including mobile VoIP, DECT and satellite phone networks, but which have limited handoff capabilities between networks. The intent of IEEE 802.21 and IEEE 802.11u is that they can be added to such phones running iPhone, QNX, Android or other smartphone operating systems, yielding a phone that is capable of communicating with literally any digital network and maintaining a continuous call at high reliability at a low access cost.

3.5. The requirements of VoLTE and V.VoIP applications

VoLTE requirements:

data packets/steaming web browsing;
dedicated bearer for voice + video; wireless channel (IMS APN);
output: flow of data packets.

VoLTE is based on the IMS network, with specific profiles for control and media planes of voice service on LTE defined by GSMA in PRD IR.92. This approach results in the voice service (control and media planes) being delivered as data flows within the LTE data bearer. This means that there is no dependency on (or ultimately, requirement for) the legacy Circuit Switch voice network to be maintained.

One of the most important requirements for VoIP applications is that they must have guaranteed QoS. Since all OTT apps use VoIP over LTE (on the operators’ data networks) do not offer QoS because the same pipe is used for real-time voice/video communication as is used for web browsing and audio/video streaming. This leads to competition for bandwidth – meaning there can be no guaranteed QoS. Conversely, carrier-grade VoLTE utilizes dedicated bearer/bandwidth thus guaranteeing superior voice and video quality.

Low latency is also a key requirement for V.VoIP. With OTT apps, latency suffers as tasks queue up in the pipeline as they compete for bandwidth. OTT apps are integrated at high-level user space interfaces, whereas the embedded VoLTE integrates with low-level audio drivers for audio capture/rendering and network interfaces, thus reducing the TX/RX processing delays considerably. OTT end-to-end latency is approximately 3x higher than with embedded VoLTE.

Interoperability must also be considered. Almost all OTT apps use proprietary methods, and hence do not interoperate with each other. To overcome this, and make calling a hassle-free experience for consumers, embedded VoLTE has the opportunity to thrive through ecosystem collaboration between chipset, OEM and infrastructure vendors and mobile operators.

3.6. Voice and video over LTE are achieved using voice on IP channels (VoLTE)

The advantages of LTE are high throughput, low latency, plug and play capabilities, a superior end user experience and a simplified network architecture resulting in a 10X improvement cost-per-megabyte to carry data traffic.

Given its ability to effectively carry VoIP, LTE provides mobile carriers with a single network infrastructure for all services, including voice, short message service (SMS), and broadband data for both mobile and fixed end users. Finally, mobile operators can migrate voice and SMS from their congested and costly circuit switched core networks to a more efficient IP-based core.

The de facto standard for delivering voice and video over LTE using IMS is specified in GSMA IR.92;
IMS Profile for Voice and SMS and IR.94:
IMS Profile for Conversational Video Services. Endorsed by the world’s leading mobile operators and network equipment vendors, the goals of IR.92 and IR.94 are to identify the minimum requirements in the 3GPP standards needed to deliver LTE voice, video and SMS services over IMS that is consistent with what is offered today by mobile operators on their circuit switched networks, but at the same time offer additional services that are not currently available such as HD (High Definition) voice and video.

Key technical requirements for Voice and Video over LTE include:

3GPP IMS Release 9 or greater;
MMTel services;
SMS over IP;
IMS Media support for AMR audio codecs. Including AMR-WB for HD;
IMS Media support for H.264 video codec;
Service Centralization and Continuity AS (SCC-AS) support for Single Radio VCC (SRVCC), providing seamless voice handoff between LTE and circuit-switch networks.

Inside the smartphone (or mobile), standards can obviously help in the collaboration between the different applications. Embedded carrier grade VoLTE/Video over LTE and Rich Communication Suite (RCS) applications are compliant to GSMA PRD IR.92 (IMS Profile for Voice, and SMS), GSMA PRD IR.94 (IMS Profile for Conversational Video Service) and RCS 5.1 (Advanced Communications Services and Client Specification) respectively.

As these apps are running on mobile devices, it is clear that optimizing for low-power consumption and optimal bandwidth is key. Some of the techniques that embedded VoLTE/Video over LTE/RCS uses to reduce power consumption and increase bandwidth include:

integration with hardware accelerators (such as video) thus reducing the overall CPU MHz requirements and enabling higherresolution/fps video;
hand-coding assembly for CPU-intensive algorithms (media engine), since the power consumption increases for every MHz used;
discontinuous reception (DRX) in connected mode – stopping the transmission during silence packets and aggregation of packets;
semi-persistent scheduling (SPS) – assignment of a predefined chunk of radio resources for VoIP users with an interval of 20 ms so that devices are not required to request resources at each Transmission Time Interval (TTI);
Robust Header Compression (RoHC) – For VoIP packets, the size of headers (IP/UDP/RTP) is usually larger than the data itself— for IPv4, UDP and RTP, the amount of overhead due to headers is 40 bytes, and RoHC can compress this to 2 or 3 bytes.

Network integration is also critical. All OTT apps drop the realtime voice/video calls when migrating from one network to the other, which is an obvious problem. Embedded VoLTE supports SRVCC which enables a seamless handover to legacy 2G/3G cellular networks when there is no LTE coverage, as well as handover to Wi-Fi with IP2IP and Access Network Discovery and Selection Function (ANDSF).

For a smooth user experience, it is also important to have a single user interface for both circuit switched and packet switched calls. As an integrated native application, embedded VoLTE uses one interface for both circuit switched and packet switched VoLTE calls. However, OTT applications have their own dialer/user interface for VoIP over LTE/Wi-Fi calling, but a native dialer must be used for circuit-switched calls.

The RCS is the operators’ response to real-time VoIP/Video/Messaging OTT apps. It not only supports VoIP/Video/Messaging, but also integrates an enhanced phonebook which includes capability discovery exchange and social presence information. It also supports enhanced messaging (1-1 chat, group chat, emoticons, location share and file sharing and enriched calls) and enables multimedia content sharing during a voice call, video call and video sharing. It also addresses a much larger subscriber base (more than 5 billion) compared to a fragmented OTT user base.

More and more operators across the world are following this track. Trials are ongoing for Video over LTE, with the ultimate goal of providing an integrated super-app that provides a unified VoLTE, Video over LTE and RCS functionality. The community developing Rich Communication Services is called Joyn, it is working under the umbrella of the GSMA with the leadership of prominent operators (Vodafone, Orange, Telefonica, Deutsche Telekom and Telecom Italia) and major manufacturers (Sony, Samsung, HTC, Huawei, Nokia, LG, BlackBerry, ZTE, Apple with the I-Phone).

To further increase customer satisfaction, operators’ embedded VoLTE provides the same user experience across the platforms. It supports all circuit-switched features (voice calling, and supplementary call features – call waiting, transfer, forwarding, emergency calling and SMS). It also supports some of the enhanced features such as HD Voice (AMR-WB) and HD Video, with reduced call setup times and improved voice quality that is 40% better than 3G due to using a wider bandwidth (50–7000 Hz instead of 300–2400 Hz).

Operators have adopted different approaches to migrate from the full circuit-switched voice + data to full LTE voice and data. From a high-level perspective, there are three categories:

the first category includes operators who have gone ahead and launched the VoLTE service even though the LTE coverage is spotty (using CSFB for voice calling).
the second category of operators is planning to adopt the SRVCC to fall back to 2G/3G where there is no LTE coverage.
the last group includes more conservative carriers who are busy rolling-out the LTE data-only service so that they initially provide the same coverage as their existing 3G network and then deploy VoLTE for LTE-only handsets. But CSFB, SVLTE and SRVCC are expensive interim solutions; operators will have to quickly migrate to full VoLTE-only solutions in order to provide the carrier-grade user experience.

As realization in the field, embedded VoLTE was launched in August 2012 by MetroPCS, LGU+, SK Telecom. The world’s first country-wide interoperable RCS network was launched in Spain by Movistar, Orange and Vodafone. This was followed by deployments in Korea – KT, LGU+, SK Telecom, Germany – Vodafone, T-Mobile and US – MetroPCS.

The largest LTE operator, Verizon is already testing VoLTE in trials and plans to launch the service commercially in 2014, at first using traditional CDMA-LTE combo phones, unlike other carriers, Verizon’s LTE-CDMA combo does not support a feature called SRVCC, which would hand over a VoLTE call to the 2G network if a customer walked out of 4G network range. For Verizon, launching VoLTE is dependent on matching its LTE coverage to its 2G coverage, so customers would rarely, if ever, be forced off the 4G network. Most of Verizon’s data traffic (57%) has already migrated to its LTE network. Verizon 4G now covers 95% of the US population (298 million POP).

CDMA traffic has hit its peak and has started declining. Eventually Verizon will shut down a portion of its 3G CDMA data networks, and as voice traffic moves to VoLTE it can do the same with its 2G networks. That PCS spectrum can then be re-farmed for LTE or other purposes.

VoLTE would not just support voice calls. Verizon’s service would include video-casting and other real-time multimedia communications features. As an IP service, VoLTE could be integrated into other applications and could reproduce many of the enhanced messaging and video features in over-the-top applications.

3.7. Cut down version of IMS

In order that IMS was implemented in a fashion that would be acceptable to operators, a cut down version was defined. This not only reduced the number of entities required in the IMS network, but it also simplified the interconnectivity – focusing on the elements required for VoLTE.

Figure 3.9. Cut down version of IMS Reduced IMS network for VoLTE

image

As can be seen there are several entities within the reduced IMS network used for VoLTE:

IP-CAN IP, Connectivity Access Network: this consists of the EUTRAN and the MME.
P-CSCF, Proxy Call State Control Function: the P-CSCF is the user to network proxy. In this respect all SIP signaling to and from the user runs via the P-CSCF whether in the home or a visited network.
I-CSCF, Interrogating Call State Control Function: the I-CSCF is used for forwarding an initial SIP request to the S-CSCF. When the initiator does not know which S-CSCF should receive the request.
S-CSCF, Serving Call State Control Function: the S-CSCF undertakes a variety of actions within the overall system, and it has a number of interfaces to enable it to communicate with other entities within the overall system.
AS, Application Server: it is the AS that handles the voice as an application.
HSS, Home Subscriber Server: the IMS HSS or home subscriber server is the main subscriber database used within IMS. The IMS HSS provides details of the subscribers to the other entities within the IMS network, enabling users to be granted access or not dependent upon their status.

The IMS calls for VoLTE are processed by the subscriber’s S-CSCF in the home network. The connection to the S-CSCF is via the P-CSCF. Dependent upon the network in use and overall location within a network, the P-CSCF will vary, and a key element in the enablement of voice calling capability is the discovery of the P-CSCF.

3.8. Latency management

While much attention has been focused on LTE data rates, another important parameter, latency, is critical to enable a number of applications particularly voice services (VoLTE) or real time television

Latency depends on a number of parameter. It is a statistical measure. Some of the parameters impacting latency include traffic and subscriber load, the type of traffic and the channel radio frequency condition.

LTE includes QoS management with up to 9 classes of service. Conversational voice is very susceptible to delay and packet error loss, particularly for low bit-rate vocoders. Real-time gaming is another highly demanding application both in terms of delay budget and packet error rate.

LTE is supposed to provide a round trip time of less than 10 ms. According to LTE system specification and requirements, user plane latency is defined to be sub-10 ms for two way radio delay. Although this value excludes the latency to the core, the bulk of latency typically occurs on the air interface. Hence, there is much more room for improvements and optimization when it comes to LTE latency. This is a key issue and even failure of the LTE standard if latency is not improved. After all, LTE was designed with a 1 ms subframe to meet the low latency requirement. So far, this goal has not been achieved.

Table 3.2. Priority

image

The measured values exceed this limit.

Table 3.3. Latency

Reported Latency Network / Reference Comments
Average: 49 ms
Min: 40 ms
AT&T Houston (Signals Research) Latency measured to servers in the operators local market.
Average: 23 ms
Max: 38 ms
TeliaSonera Finland (Epitiro) These are reported as “Network Latency” and defined as “the time it takes for a network to respond.” I believe this measurement was done under very controlled conditions since they reported relatively stable average.
Average: 143 ms
Min: 79 ms
Verizon (BTIG) Tests were conducted in-building and near a window.

Figure 3.10. Latency (50 ms)

image

Table 3.4. Measurement

image

3.9. Appendix 1: VoIP tests in UK

For more details about the methodology and results, please visit David Martland’s website: www.sipcentric.com/2013/04/how–well–does–Uoip–work–ove–4g/.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset