Patent application number | Description | Published |
20130142256 | CODING LEAST SIGNFICANT BITS OF PICTURE ORDER COUNT VALUES IDENTIFYING LONG-TERM REFERENCE PICTURES - In general, techniques are described for coding picture order count values identifying long-term reference pictures. A video decoding device comprising a processor may perform the techniques. The processor may be configured to determine a number of bits used to represent least significant bits of the picture order count value that identifies a long-term reference picture to be used when decoding at least a portion of a current picture and parse the determined number of bits from a bitstream representative of the encoded video data. The parsed bits represent the least significant bits of the picture order count value. The processor retrieves the long-term reference picture from a decoded picture buffer based on the least significant bits, and decodes at least the portion of the current picture using the retrieved long-term reference picture. | 06-06-2013 |
20130142257 | CODING PICTURE ORDER COUNT VALUES IDENTIFYING LONG-TERM REFERENCE FRAMES - In general, techniques are described for coding picture order count values identifying long-term reference pictures. A video decoding device comprising a processor may perform the techniques. The processor may determine least significant bits (LSBs) of a picture order count (POC) value that identifies a long-term reference picture (LTRP). The LSBs do not uniquely identify the POC value with respect to the LSBs of any other POC value identifying any other picture in a decoded picture buffer (DPB). The processor may determine most significant bits (MSBs) of the POC value. The MSBs combined with the LSBs is sufficient to distinguish the POC value from any other POC value that identifies any other picture in the DPB. The processor may retrieve the LTRP from the decoded picture buffer based on the LSBs and MSBs of the POC value, and decode a current picture of the video data using the retrieved LTRP. | 06-06-2013 |
20130202035 | REFERENCE PICTURE LIST MODIFICATION FOR VIDEO CODING - A video coder may, in some cases, signal whether one or more initial reference picture lists are to be modified. When an initial list is to be modified, the video coder can signal information indicating a starting position in the initial reference picture list. When the starting position signaled by the video coder is less than a number of pictures included in the initial reference picture list, then the video coder signals the number of pictures to be inserted into the initial reference picture list, and a reference picture source from which a picture can be retrieved to insert into the initial reference picture list to construct a modified reference picture list. | 08-08-2013 |
20130272403 | REFERENCE PICTURE SET PREDICTION FOR VIDEO CODING - In one example, a device for decoding video data includes a video decoder configured to decode one or more syntax elements of a current reference picture set (RPS) prediction data structure, wherein at least one of the syntax elements represents a picture order count (POC) difference between a POC value associated with the current RPS and a POC value associated with a previously decoded RPS, form a current RPS based at least in part on the RPS prediction data structure and the previously decoded RPS, and decode one or more pictures using the current RPS. A video encoder may be configured to perform a substantially similar process during video encoding. | 10-17-2013 |
20130329787 | SIGNALING DATA FOR LONG TERM REFERENCE PICTURES FOR VIDEO CODING - A video coder codes a slice header for a slice of video data. The slice header includes a syntax element comprising identifying information for a long term reference picture, wherein the identifying information is explicitly signaled in the slice header or derived from a sequence parameter set corresponding to the slice. When the syntax element indicates that the identifying information for the long term reference picture is explicitly signaled, to code the slice header, the video coder is further configured to code a value for the identifying information for the long term reference picture in the slice header. | 12-12-2013 |
20140003506 | SIGNALING OF LONG-TERM REFERENCE PICTURES FOR VIDEO CODING | 01-02-2014 |
20140003537 | RANDOM ACCESS AND SIGNALING OF LONG-TERM REFERENCE PICTURES IN VIDEO CODING | 01-02-2014 |
20140003538 | SIGNALING LONG-TERM REFERENCE PICTURES FOR VIDEO CODING | 01-02-2014 |
20140016699 | REFERENCE PICTURE LIST MODIFICATION FOR VIDEO CODING - A video decoder generates an initial reference picture list (RPL). Furthermore, the video decoder determines that an ordered set of reference picture list modification (RPLM) syntax elements does not include any additional syntax elements when a syntax element in the ordered set of RPLM syntax elements has a particular value. Furthermore, the video decoder generates a final RPL. For each respective RPLM syntax element in the ordered set of syntax elements, when the respective RPLM syntax element does not have the particular value, the final RPL includes, at an insertion position for the respective RPLM syntax element, a particular reference picture. The respective syntax element indicates a position in the initial RPL of the particular RPLM reference picture. The insertion position for the respective RPLM syntax element corresponds to a position in the ordered set of RPLM syntax elements of the respective RPLM syntax element. | 01-16-2014 |
20140049604 | CONSTRUCTING REFERENCE PICTURE LISTS FOR MULTI-VIEW OR 3DV VIDEO CODING - In one example, a video coder, such as a video encoder or a video decoder, is configured to code a value for a layer identifier in a slice header for a current slice in a current layer of multi-layer video data, and, when the value for the layer identifier is not equal to zero, code a first set of syntax elements in accordance with a base video coding standard, and code a second set of one or more syntax elements in accordance with an extension to the base video coding standard. The second set of syntax elements may include a syntax element representative of a position for an identifier of an inter-layer reference picture of a reference layer in a reference picture list, and the video coder may construct the reference picture list such that the identifier of the inter-layer reference picture is located in the determined position. | 02-20-2014 |
20140086324 | LONG-TERM REFERENCE PICTURE SIGNALING IN VIDEO CODING - A video encoder signals, in a slice header for a current slice of a current picture, a first long-term reference picture (LTRP) entry, the first LTRP entry indicating that a particular reference picture is in a long-term reference picture set of the current picture. Furthermore, the video encoder signals, in the slice header, a second LTRP entry only if second LTRP entry does not indicate that the particular reference picture is in the long-term reference picture set of the current picture. | 03-27-2014 |
20140192149 | NON-NESTED SEI MESSAGES IN VIDEO CODING - A device obtains, from a bitstream that includes an encoded representation of the video data, a non-nested Supplemental Enhancement Information (SEI) message that is not nested within another SEI message in the bitstream. Furthermore, the device determines a layer of the bitstream to which the non-nested SEI message is applicable. The non-nested SEI message is applicable to layers for which video coding layer (VCL) network abstraction layer (NAL) units of the bitstream have layer identifiers equal to a layer identifier of a SEI NAL unit that encapsulates the non-nested SEI message. A temporal identifier of the SEI NAL unit is equal to a temporal identifier of an access unit containing the SEI NAL unit. Furthermore, the device processes, based in part on one or more syntax elements in the non-nested SEI message, video data of the layer of the bitstream to which the non-nested SEI message is applicable. | 07-10-2014 |
20140192882 | VIDEO BUFFERING OPERATIONS FOR RANDOM ACCESS IN VIDEO CODING - In an example, the present disclosure provides for receiving in a video bitstream an access unit having a first random access point (RAP) picture and receiving in the video bitstream, after the access unit in the bitstream, a subsequent access unit having a second RAP picture. In a case that one or more random access skipped leading (RASL) pictures for the subsequent access unit are not present in the received video bitstream, shifting a picture buffer removal time earlier based on a picture buffer removal delay offset. Another example provides for receiving an access unit after an earlier initialization of the hypothetical reference decoder (HRD), the access unit having a RAP picture, wherein associated access units containing RASL pictures are not received and initializing a picture buffer removal time and a picture buffer removal delay offset in response to receiving the access unit and not receiving the associated access units containing RASL pictures. | 07-10-2014 |
20140192885 | BITSTREAM CONSTRAINTS AND MOTION VECTOR RESTRICTION FOR INTER-VIEW OR INTER-LAYER REFERENCE PICTURES - Techniques are described for motion vector restriction where information in a bitstream ensures that a derived motion vector from a motion vector predictor is compliant with a motion vector restriction. Techniques are also described for indicating the motion vector restriction for parallel decoding. | 07-10-2014 |
20140192896 | GRADUAL DECODING REFRESH WITH TEMPORAL SCALABILITY SUPPORT IN VIDEO CODING - An example method of decoding video data includes receiving a plurality of pictures, receiving, in a message associated with a first picture of the plurality of pictures, information indicating a picture order count (POC) value of a recovery point picture of a gradual decoder refresh (GDR) set, when a picture that follows the first picture in decoding order has the POC value that is equal to a POC value of the recovery point picture, identifying the picture having the POC value equal to the POC value of the recovery point picture, as a recovery point picture, and when none of the pictures that follow the first picture has the POC value that is equal to the POC value of the recovery point picture, identifying one of the pictures that has a POC value greater than the POC value of the recovery point picture as the recovery point picture. | 07-10-2014 |
20140192897 | GRADUAL DECODING REFRESH WITH TEMPORAL SCALABILITY SUPPORT IN VIDEO CODING - An example method of decoding video data includes receiving, in a message associated with a picture, information indicating a refreshed region of the picture, determining whether the picture comprises a last picture in a gradual decoder refresh (GDR) set, determining whether the picture comprises a recovery point picture, and responsive to determining that the picture comprises the last picture in the GDR set and the recovery point picture, determining that the message indicates that the entire picture belongs to the refreshed region of the picture. | 07-10-2014 |
20140254679 | INTER-LAYER REFERENCE PICTURE CONSTRUCTION FOR SPATIAL SCALABILITY WITH DIFFERENT ASPECT RATIOS - A method of coding video data includes upsampling at least a portion of a reference layer picture to an upsampled picture having an upsampled picture size. The upsampled picture size has a horizontal upsampled picture size and a vertical upsampled picture size. At least one of the horizontal or vertical upsampled picture sizes may be different than a horizontal picture size or vertical picture size, respectively, of an enhancement layer picture. In addition, position information associated with the upsampled picture may be signaled. An inter-layer reference picture may be generated based on the upsampled picture and the position information. | 09-11-2014 |
20140301436 | CROSS-LAYER ALIGNMENT IN MULTI-LAYER VIDEO CODING - An apparatus for coding video information according to certain aspects includes a memory unit and a video processor in communication with the memory unit. The video processor is configured to identify a first picture included in the first set of pictures, wherein pictures within the first set of pictures having an output position after the output position of the first picture also have a decoding position after the decoding position of the first picture. The video processor is further configured to identify a second picture included in the second set of pictures, wherein pictures within the second set of pictures having an output position after the output position of the second pictures also have a decoding position after the decoding position of the second picture. The video processor is also configured to code the identified first picture and the identified second picture via one syntax element into one access unit. | 10-09-2014 |
20140301439 | CROSS-LAYER POC ALIGNMENT FOR MULTI-LAYER BITSTREAMS THAT MAY INCLUDE NON-ALIGNED IRAP PICTURES - In one example, a video coder is configured to code a value for a syntax element indicating whether at least a portion of a picture order count (POC) value of a picture is to be reset to a value of zero, when the value for the syntax element indicates that the portion of the POC value is to be reset to the value of zero, reset at least the portion of the POC value such that the portion of the POC value is equal to zero, and code video data using the reset POC value. Coding video data using the reset POC value may include inter-predicting a block of a subsequent picture relative to the picture, where the block may include a motion parameter that identifies the picture using the reset POC value. The block may be coded using temporal inter-prediction or inter-layer prediction. | 10-09-2014 |
20140301441 | NON-ENTROPY ENCODED SET OF PROFILE, TIER, AND LEVEL SYNTAX STRUCTURES - Systems, methods, and devices for coding multilayer video data are disclosed that may include, encoding, decoding, transmitting, or receiving a non-entropy encoded set of profile, tier, and level syntax structures, potentially at a position within a video parameter set (VPS) extension. The systems, methods, and devices may refer to one of the profile, tier, and level syntax structures for each of a plurality of output layer sets. The systems, methods, and devices may encode or decode video data of one of the output layer sets based on information from the profile, tier, and level syntax structure referred to for the output layer set. | 10-09-2014 |
20140301466 | GENERALIZED RESIDUAL PREDICTION IN HIGH-LEVEL SYNTAX ONLY SHVC AND SIGNALING AND MANAGEMENT THEREOF - An apparatus for decoding video information according to certain aspects includes a memory unit and a processor operationally coupled to the memory unit. The memory unit is configured to store at least one reference picture list of an enhancement layer, the at least one reference picture list comprising residual prediction reference picture information. The processor is configured to: decode signaled information about residual prediction reference picture generation; generate a residual prediction reference picture based on an enhancement layer reference picture and the decoded signaled information such that the generated residual prediction reference picture has the same motion field and the same picture order count (POC) as the enhancement layer reference picture from which it is generated; and store the generated residual prediction reference picture in the at least one reference picture list of the enhancement layer. | 10-09-2014 |
20140301469 | CODING VIDEO DATA FOR AN OUTPUT LAYER SET - Systems, methods, and devices for coding multilayer video data are disclosed that may include encoding, decoding, transmitting, or receiving multilayer video data. The systems, methods, and devices may receive or transmit a first output layer set for a layer set and receive or transmit a second output layer set for the layer set. The systems, methods, and devices may code (encode or decode) video data for at least one of the first output layer set and the second output layer set. | 10-09-2014 |
20140301483 | NON-ENTROPY ENCODED REPRESENTATION FORMAT - Systems, methods, and devices for coding multilayer video data are disclosed that may include encoding, decoding, transmitting, or receiving multilayer video data. The systems, methods, and devices may receive or transmit a non-entropy coded representation format within a video parameter set (VPS). The systems, methods, and devices may code (encode or decode) video data based on the non-entropy coded representation format within the VPS, wherein the representation format includes one or more of chroma format, whether different colour planes are separately coded, picture width, picture height, luma bit depth, and chroma bit depth. | 10-09-2014 |
20140301484 | PARAMETER SET DESIGNS FOR VIDEO CODING EXTENSIONS - Systems, methods, and devices for coding multilayer video data are disclosed that may include encoding, decoding, transmitting, or receiving multilayer video data. The systems, methods, and devices may transmit or receive a video parameter set (VPS) including information for a series of layers, each layer including visual signal information. The systems, methods, and devices may code (encode or decode) video data based on the visual signal information signaled per layer in the VPS. | 10-09-2014 |
20140301485 | IRAP ACCESS UNITS AND BITSTREAM SWITCHING AND SPLICING - In one example, a device for coding video data includes a video coder configured to code an intra random access point (IRAP) picture of a partially aligned IRAP access unit of video data, and code data that indicates, when performing random access from the partially aligned IRAP access unit, at least one picture of a video coding layer that is not correctly decodable. When the video coder comprises a video decoder, the video decoder may skip decoding of the pictures that are not correctly decodable, assuming random access has been performed starting from the partially aligned IRAP access unit. | 10-09-2014 |
20140307803 | NON-ENTROPY ENCODED LAYER DEPENDENCY INFORMATION - Systems, methods, and devices for coding multilayer video data are disclosed that may include encoding, decoding, transmitting, or receiving a non-entropy encoded layer dependency information at a position within a video parameter set (VPS) extension prior to syntax elements of the VPS extension that are entropy encoded. The systems, methods, and devices may encode or decode the non-entropy encoded layer dependency information before an entropy encoded syntax element. The systems, methods, and devices may encode or decode video data of one or more of the layers of video data based on the non-entropy encoded layer dependency information. The layer dependency information indicates whether one of the layers is a direct reference layer for another of the layers. | 10-16-2014 |
20140355692 | HYPOTHETICAL REFERENCE DECODER MODEL AND CONFORMANCE FOR CROSS-LAYER RANDOM ACCESS SKIPPED PICTURES - A device may determine, based on a value, whether all cross-layer random access skipped (CL-RAS) pictures of an intra random access point (IRAP) access unit are present in a video data bitstream. In addition, the device may reconstruct pictures of the video data based at least in part on syntax elements decoded from the video data bitstream. | 12-04-2014 |
20150016520 | CROSS-LAYER PARALLEL PROCESSING AND OFFSET DELAY PARAMETERS FOR VIDEO CODING - In an example, a method of coding video data includes coding data of a video parameter set (VPS) of a multi-layer bitstream, including at least one of data that indicates whether any layers of the multi-layer bitstream have an inter-layer prediction restriction or data that indicates whether tile boundaries are aligned between at least two of the layers of the multi-layer bitstream, and coding the multi-layer bitstream in accordance with the data of the VPS. | 01-15-2015 |
20150016532 | SELECTION OF TARGET OUTPUT LAYERS IN HIGH EFFICIENCY VIDEO CODING EXTENSIONS - A method, apparatus, and manufacture for processing video data. A list of output layer sets in a video bitstream is received, and an index to at least one target output layer set in the list of output layer sets is received. Next, target output layers in the at least one target output layer set is determined based on the index. At least the target output layers from the video bitstream are decoded. Then, the decoded target output layers are output without outputting layers that are not targeted for output. | 01-15-2015 |
20150016540 | CROSS-LAYER PARALLEL PROCESSING AND OFFSET DELAY PARAMETERS FOR VIDEO CODING - In an example, a method of coding video data includes determining a location of a reference sample associated with a reference picture of video data based on one or more scaled offset values, where the reference picture is included in a first layer of a multi-layer bitstream and the one or more scaled offset values indicate a difference in scale between the first layer and a second, different layer. The method also includes determining a location of a collocated reference block of video data in the first layer based on the location of the reference sample, and coding a current block of video data in the second layer relative to the collocated reference block. | 01-15-2015 |
20150016543 | BITSTREAM RESTRICTIONS ON PICTURE PARTITIONS ACROSS LAYERS - Bitstream restrictions or constraints on the partitioning of pictures across layers of video data are described. In some examples, the number of tiles per picture for each layer of a plurality of layers is constrained based on a maximum number of tiles per picture for the layer. In some examples, the number of tiles per picture for each layer of the plurality of layers is no greater than the maximum number of tiles per picture for the layer. In some examples, a sum of the numbers of tiles per picture for the plurality of layers is no greater than a sum of the maximum numbers of tiles per picture for the plurality of layers. In some examples, a second largest coding unit (LCU) or coding tree block (CTB) size for a second layer is constrained based on, e.g., to be equal to, a first LCU size for a first layer. | 01-15-2015 |
20150016545 | DECODED PICTURE BUFFER OPERATIONS FOR VIDEO CODING - A method of decoding video data comprising partitioning a decoded picture buffer (DPB) into a plurality of sub-DPBs, receiving at least one indication of a sub-DPB size for the plurality of sub-DPBs for one or more operation points of a multi-layer video coding process, and allocating memory space for the plurality of sub-DPBs based on the at least one indication. | 01-15-2015 |
20150016546 | CROSS-LAYER ALIGNMENT OF INTRA RANDOM ACCESS POINT PICTURES - A video processing device includes a memory storing video data and one or more processors configured to: receive a first network abstraction layer (NAL) unit comprising a first picture of an access unit; in response to determining the first NAL unit comprises an intra random access point (IRAP) picture and in response to a NAL unit type for the first NAL unit indicating the presence of an instantaneous decoding refresh (IDR) picture without any associated leading pictures for a second NAL unit of the access unit comprising another IRAP picture, determine a NAL unit type for the second NAL unit to be a NAL unit type indicating the presence of an IDR picture without any associated leading pictures; and, process the first NAL unit and the second NAL unit based on the NAL unit type for the second NAL unit. | 01-15-2015 |